2573 Commits

Author SHA1 Message Date
yangdx
1d24e8ca3c Bump api version to 0179 2025-07-07 01:40:26 +08:00
yangdx
db22cad2c8 feat: add workspace and MAX_GRAPH_NODES to /health endpoint and webui 2025-07-07 01:39:48 +08:00
yangdx
253833475f Add workspace info to splash screen display 2025-07-07 01:26:27 +08:00
yangdx
033098c1bc Feat: Add WORKSPACE support to all storage types 2025-07-07 00:57:21 +08:00
yangdx
1b2d295a4f Remove namespace_prefix 2025-07-06 00:16:47 +08:00
yangdx
98150e80b8 Improved empty/whitespace file handling
- Better detection of whitespace-only files
- Changed error to warning for empty chunks
2025-07-05 23:16:39 +08:00
xuewei
648a87653f 文本块是空白 2025-07-05 14:28:42 +08:00
xuewei
49cb51b5dc PDF文件解析不到内容 2025-07-05 13:47:47 +08:00
yangdx
706da5ad23 Merge branch 'main' of github.com:HKUDS/LightRAG 2025-07-05 13:12:27 +08:00
yangdx
86a00ed333 Remove deprecated storage type 2025-07-05 13:11:32 +08:00
yangdx
fe13475234 Fix linting 2025-07-05 12:07:37 +08:00
yangdx
a2e59dd078 fix: prevent empty entity names after normalization in extraction
Added validation checks in entity and relationship extraction functions to filter out entities that become empty strings after normalize_extracted_info processing. This prevents empty labels from appearing in get_all_labels() results and maintains knowledge graph data integrity.
2025-07-05 12:06:34 +08:00
yangdx
bdfd2d53c7 Fix linting 2025-07-05 11:43:45 +08:00
yangdx
2e2b9f3b48 Refactor setup.py to utilize pyproject.toml for project installation. 2025-07-05 11:19:00 +08:00
yangdx
99bb25a282 Update logger message 2025-07-05 08:21:54 +08:00
yangdx
2d78d74055 Fix milvus compatible check error 2025-07-05 07:55:36 +08:00
yangdx
b842a80aab Fix linting 2025-07-05 07:09:01 +08:00
yangdx
9878dc7f51 fix: ensure Milvus collections are loaded before operations
- Resolves "collection not loaded" MilvusException errors
2025-07-05 00:40:25 +08:00
yangdx
7ec4ba5f9a Fix: recover faiss_impl.py 2025-07-04 23:50:27 +08:00
yangdx
951ea3b851 Remvoe deprecated storage type 2025-07-04 21:48:18 +08:00
yangdx
6201c7fb56 Refactoring Milvus implementation 2025-07-04 21:42:10 +08:00
yangdx
810676d54c Fix faisss import error 2025-07-04 19:12:23 +08:00
yangdx
531502677e fix: Use create_time when update_time is 0 in PGKVStorage queries 2025-07-03 23:38:53 +08:00
yangdx
3355a0ce95 Fix create_time conflict in MongoKVStorage updates 2025-07-03 22:58:08 +08:00
yangdx
04d793abbd Update logger message 2025-07-03 22:15:32 +08:00
yangdx
67f51597c2 Bump api version to 0178 2025-07-03 21:37:47 +08:00
yangdx
05231233f1 Feat: Check pending equest_pending after document deletion
- Add double-check for pipeline status to prevent race conditions
- Implement automatic processing of pending indexing requests after deletion
2025-07-03 21:36:35 +08:00
yangdx
6c2ae40d7d Refac: Enhance KG rebuild stability by incorporating create_time into the LLM cache 2025-07-03 17:08:29 +08:00
yangdx
a9e10ae810 Update logger messages 2025-07-03 14:08:19 +08:00
yangdx
6b6d14bc3a fix: Deduplicate entities and relationships in a single chunk with multiple gleaning results during KG rebuild 2025-07-03 13:47:52 +08:00
yangdx
70e154b0aa Fix linting 2025-07-03 12:26:05 +08:00
yangdx
ff1b1c61c7 Implemented storage types: PostgreSQL and MongoDB 2025-07-03 11:46:24 +08:00
yangdx
e56734cb8b Refac: Optimize document deletion performance
- Adding chunks_list to  dock_status
- Adding  llm_cache_list to text_chunks
- Implemented storage types: JsonKV and  Redis
2025-07-03 04:18:25 +08:00
Daniel.y
d0f04383cc
Merge pull request #1729 from danielaskdd/refac-kv-cache
feat: Flatten LLM cache structure for improved recall efficiency
2025-07-02 16:36:45 +08:00
yangdx
3be8727e3e Bump api version to 0177 2025-07-02 16:35:22 +08:00
yangdx
86c9a0cda2 Fix linting 2025-07-02 16:29:43 +08:00
yangdx
271722405f feat: Flatten LLM cache structure for improved recall efficiency
Refactored the LLM cache to a flat Key-Value (KV) structure, replacing the previous nested format. The old structure used the 'mode' as a key and stored specific cache content as JSON nested under it. This change significantly enhances cache recall efficiency.
2025-07-02 16:11:53 +08:00
zrguo
479865a271 Add max_gleaning to env 2025-07-01 17:13:33 +08:00
SLKun
4e88ee3662 update ollama compatible api 2025-06-30 10:41:35 +08:00
yangdx
4c2b4b4b6b Revert "Fix LLM cache handling for Redis to address document deletion scenarios."
This reverts commit 14cda93988dc110c11cf1fe8286ded7b5db7b8fb.
2025-06-29 22:35:40 +08:00
yangdx
10cd9c90e7 Revert "Fix linting"
This reverts commit abd9de2a63b39021320855fab9f5c3b332dee162.
2025-06-29 22:35:26 +08:00
yangdx
75ea0bc38d Remove default get_nodes_by_chunk_ids implementation
- Deletes inefficient default implementation
- Makes method purely abstract
2025-06-29 22:05:48 +08:00
yangdx
e70f5a35e5 Refac: Add KG rebuild logging with pipeline status
- Logs detailed progress, including warnings and failures, to the pipeline status.
- Adds counters to report the total number of successfully rebuilt entities and relationships upon completion.
2025-06-29 21:27:12 +08:00
yangdx
abd9de2a63 Fix linting 2025-06-29 15:15:49 +08:00
yangdx
14cda93988 Fix LLM cache handling for Redis to address document deletion scenarios.
- Implements bulk scan for "extract" cache entries
- Maintains backward compatibility for normal IDs
2025-06-29 15:13:42 +08:00
yangdx
e2824b721e Fix LLM cache handling for MongoKVStorage to address document deletion scenarios.
- Support fetching all "default_" prefixed documents
- Maintain original behavior for other IDs
- Return dictionary of documents for "default"
- Keep backward compatibility
2025-06-29 15:03:57 +08:00
yangdx
37bf341a69 Fix LLM cache handling for PGKVStorage to address document deletion scenarios.
- Add dynamic cache_type field
- Support mode parameter for LLM cache
- Maintain backward compatibility
2025-06-29 14:39:50 +08:00
Moddy Surala
319d12bb6a Fix GRAPH_FIELD_SEP import typo 2025-06-29 01:28:39 +05:00
yangdx
6d5e73a251 Merge branch 'kenspirit/main' 2025-06-29 00:30:58 +08:00
yangdx
28aedd8b3c Update comments 2025-06-29 00:30:39 +08:00