858 Commits

Author SHA1 Message Date
yangdx
bed5a97ae2 Fix prompt respond cache fail when is_embedding_cache_enabled is true 2025-02-02 03:09:06 +08:00
yangdx
5d14ab03eb Fix linting 2025-02-02 01:56:32 +08:00
yangdx
b45ae1567c Refactor LLM cache handling and entity extraction
- Removed custom LLM function in entity extraction
- Simplified cache handling logic
- Added `force_llm_cache` parameter
- Updated cache handling conditions
2025-02-02 01:28:46 +08:00
yangdx
6c7d7c25d3 Refactor cache handling logic for better readability, keep function unchanged. 2025-02-02 00:10:21 +08:00
yangdx
c9481c81b9 Add cache type "extract" for entity extraction 2025-02-01 23:05:02 +08:00
yangdx
2d387fa6de Save keywords to cache only when it's no empty 2025-02-01 22:54:23 +08:00
yangdx
3c3cdba499 Fix typo error 2025-02-01 22:27:49 +08:00
yangdx
b87703aea6 Add embedding_func to llm_response_cache 2025-02-01 22:19:16 +08:00
yangdx
3bc7c4d8f1 Save cache_type to llm_response_cache 2025-02-01 22:18:59 +08:00
yangdx
c3942077a9 Use direct embedding_func from hashing_kv (do not by pass maxiumu async control) 2025-02-01 22:12:45 +08:00
yangdx
c98a675b6c remove unused parm 2025-02-01 22:07:12 +08:00
yangdx
95edf8a51e Fix linting 2025-02-01 15:22:40 +08:00
yangdx
a0710e9d65 Merge branch 'main' into fix-extract-entity-concurrent-problem 2025-02-01 14:45:42 +08:00
yangdx
b109f57ddd Refactor async call limiting to use asyncio.Semaphore for better performance.
- Replace custom counter with asyncio.Semaphore
- The existing implementation cannot follow the FIFO order
2025-02-01 10:36:25 +08:00
yangdx
635d4fd9e4 Add lock to protect file write operations in NanoVectorDBStorage
- Introduce asyncio.Lock for save operations
- Ensure thread-safe file writes
2025-02-01 10:36:25 +08:00
yangdx
6a326e2783 Revert "Refactor embedding functions and add async query limit"
This reverts commit 21481dba8f3b020797718de3d8a82aafa7f69590.
2025-02-01 10:36:25 +08:00
yangdx
2ba36f87e3 Add support for list input in quantize_embedding function
- Convert list to numpy array if needed
- Maintain existing functionality
2025-02-01 10:36:25 +08:00
yangdx
1192727be7 remove semaphore logic from EmbeddingFunc(cause num of instances is already control by limit_async_func_call) 2025-02-01 10:36:25 +08:00
yangdx
60b66b95e3 Fix typo in prompt 2025-02-01 10:36:25 +08:00
yangdx
0692635ebd Improve prompts to avoid make-up respond from LLM like qwen-plus when very long context is provided. 2025-02-01 10:36:25 +08:00
yangdx
b22a8b216c Fix linting 2025-02-01 10:36:25 +08:00
yangdx
2a010c985e Add logging for chunk truncation in mix_kg_vector_query 2025-02-01 10:36:25 +08:00
Saifeddine ALOUI
3a40772d30 Simplified file loading 2025-02-01 01:19:32 +01:00
Saifeddine ALOUI
e09cb85f37 fixed linting as well as file path 2025-02-01 01:15:06 +01:00
Saifeddine ALOUI
ef35f9a4e4 Introduced docling instead of other tools for loading files 2025-02-01 00:56:43 +01:00
zrguo
e59cb7493c fixed linting 2025-01-31 23:35:42 +08:00
zrguo
58a9ae2f83
Merge pull request #686 from gurjot-05/main
Implement Faiss Vector Storage Support for LightRAG
2025-01-31 23:34:53 +08:00
zrguo
a3f2a3f1de
Merge pull request #682 from ParisNeo/main
Few upgrades in the webui
2025-01-31 23:33:28 +08:00
Saifeddine ALOUI
78b858c03b
Finished testing api key 2025-01-31 16:19:46 +01:00
Saifeddine ALOUI
d2a550fd31
Update api.js 2025-01-31 16:08:23 +01:00
Saifeddine ALOUI
d1210851aa
Update api.js 2025-01-31 16:07:27 +01:00
Saifeddine ALOUI
e9591548b4
Update api.js 2025-01-31 16:03:31 +01:00
Gurjot Singh
2894e8faf2 Fix linting errors 2025-01-31 19:05:47 +05:30
Gurjot Singh
8a624e198a Add faiss integration for storage 2025-01-31 19:00:36 +05:30
yangdx
2d4991dfbd Fix typo in prompt 2025-01-31 20:40:37 +08:00
Saifeddine ALOUI
2444975bf1
Update api.js 2025-01-31 13:22:19 +01:00
Saifeddine ALOUI
6889606a48
Update lightrag_server.py 2025-01-31 11:19:12 +01:00
yangdx
b0d87b2e29 Fix linting 2025-01-31 15:33:50 +08:00
yangdx
389f4ee872 Shorten log message for cosine similarity threshold. 2025-01-31 15:33:41 +08:00
yangdx
21481dba8f Refactor embedding functions and add async query limit
- Separate insert/query embedding funcs
- Add query-specific async limit
- Update storage classes to use new funcs
- Protect vector DB save with lock
- Improve config handling for thresholds
2025-01-31 15:00:56 +08:00
Saifeddine ALOUI
381f7deec6 linting 2025-01-30 23:29:21 +01:00
Saifeddine ALOUI
219cbab1e3 Added progress when scanning files and fixed some bugs in the API 2025-01-30 23:27:43 +01:00
yangdx
54b68074a1 Merge branch 'main' into fix-extract-entity-concurrent-problem 2025-01-31 01:19:44 +08:00
yangdx
fb63e5ed39 Improve prompts to avoid make-up respond from LLM like qwen-plus when very long context is provided. 2025-01-30 22:38:32 +08:00
yangdx
601f10b132 Fix linting 2025-01-30 22:26:28 +08:00
yangdx
82460088da Merge commit '59617da83e224e3af1c97fb21dd661b088effc2d' into Improve-prompt-prevent-make-up-answer 2025-01-30 22:24:13 +08:00
yangdx
81655829bc Add logging for chunk truncation in mix_kg_vector_query 2025-01-30 22:21:52 +08:00
zrguo
59617da83e
Merge pull request #675 from danielaskdd/Fix-get_conversation_turns
Fix get conversation turns
2025-01-30 20:36:59 +08:00
yangdx
3150c2812f Fix loop step from 1 to 2 in get_conversation_turns, ensure user-assistant pairing 2025-01-30 13:08:27 +08:00
yangdx
cc50ade14e Fix concurrent problem on extract_entities function.
- Abandon the approach of temporarily replacing the global llm_model_func configuration
- Introduce custom_llm function with new_config for handle_cache while extracting entities
- Update handle_cache to accept custom_llm
2025-01-30 02:45:33 +08:00