4156 Commits

Author SHA1 Message Date
yangdx
efc359c411 Update webui assets 2025-07-13 00:57:41 +08:00
Daniel.y
0e52582bf2
Merge pull request #1773 from danielaskdd/cleanup-lock-in-healthcheck
Feat: Added reranker config and lock status to status card of WebUI
2025-07-13 00:52:57 +08:00
Daniel.y
9674ade611
Merge pull request #1772 from danielaskdd/rebuild-chunks-in-parallel
Optimize knowledge graph rebuild with parallel processing
2025-07-13 00:49:52 +08:00
yangdx
eb31ff0f90 Update i18n translation 2025-07-13 00:46:12 +08:00
yangdx
ab561196ff Feat: Added reranker config and lock status to status card of WebUI 2025-07-13 00:41:54 +08:00
yangdx
0e3aaa318f Feat: Add keyed lock cleanup and status monitoring 2025-07-13 00:09:00 +08:00
Daniel.y
9f3af332ec
Merge pull request #1771 from danielaskdd/namespace-keyed-lock
Refac: Generalize keyed lock with namespace support
2025-07-12 13:29:38 +08:00
yangdx
e4bf4d19a0 Optimize knowledge graph rebuild with parallel processing
- Add parallel processing for KG rebuild
- Implement keyed locks for data consistency
2025-07-12 13:22:56 +08:00
yangdx
a85d7054d4 fix: move node existence check inside lock to prevent race condition
Move knowledge_graph_inst.has_node check inside get_storage_keyed_lock
in _merge_edges_then_upsert to ensure atomic check-then-act operations
and prevent duplicate node creation during concurrent updates.
2025-07-12 12:22:32 +08:00
yangdx
2ade3067f8 Refac: Generalize keyed lock with namespace support
Refactored the `KeyedUnifiedLock` to be generic and support dynamic namespaces. This decouples the locking mechanism from a specific "GraphDB" implementation, allowing it to be reused across different components and workspaces safely.

Key changes:
- `KeyedUnifiedLock` now takes a `namespace` parameter on lock acquisition.
- Renamed `_graph_db_lock_keyed` to a more generic _storage_keyed_lock`
- Replaced `get_graph_db_lock_keyed` with get_storage_keyed_lock` to support namespaces
2025-07-12 12:10:12 +08:00
yangdx
f2d875f8ab Update comments 2025-07-12 11:05:25 +08:00
yangdx
943ead8b1d Bump api version to 0181 2025-07-12 05:59:13 +08:00
Daniel.y
b0ca25e5f1
Merge pull request #1768 from schmidt-marvin/main
fix(build): pyproject.toml setup
2025-07-12 05:44:48 +08:00
Daniel.y
ad7d7d0854
Merge pull request #1770 from danielaskdd/merge_lock_with_key
Refac: Optimize keyed lock cleanup logic with time and size tracking
2025-07-12 05:24:36 +08:00
yangdx
5ee509e671 Fix linting 2025-07-12 05:17:44 +08:00
yangdx
964293f21b Optimize lock cleanup with time tracking and intervals
- Add cleanup time tracking variables
- Implement minimum cleanup intervals
- Track earliest cleanup times
- Handle time rollback cases
- Improve cleanup logging
2025-07-12 04:34:26 +08:00
yangdx
39965d7ded Move merging stage back controled by max parallel insert semhore 2025-07-12 03:32:08 +08:00
yangdx
7490a18481 Optimize lock cleanup parameters 2025-07-12 03:10:03 +08:00
yangdx
3d8e6924bc Show lock clean up message 2025-07-12 02:58:05 +08:00
yangdx
22c36f2fd2 Optimize log messages 2025-07-12 02:41:31 +08:00
yangdx
a64c767298 optimize: improve lock cleanup performance with threshold-based strategy
- Add CLEANUP_THRESHOLD constant (100) to control cleanup frequency
- Modify _release_shared_raw_mp_lock to only scan when cleanup list exceeds threshold
- Modify _release_async_lock to only scan when cleanup list exceeds threshold
2025-07-11 23:43:40 +08:00
yangdx
ad99d9ba5a Improve code organization and comments 2025-07-11 22:13:02 +08:00
yangdx
c52c451cf7 Fix linting 2025-07-11 20:40:50 +08:00
yangdx
3afdd1b67c Fix initial count error for multi-process lock with key 2025-07-11 20:39:08 +08:00
Marvin Schmidt
42a1da0041 fix(build): pyproject.toml setup 2025-07-11 12:01:34 +02:00
yangdx
c47747da9e Merge branch 'main' into merge_lock_with_key 2025-07-11 16:37:10 +08:00
yangdx
ef4870fda5 Combined entity and edge processing tasks and optimize merging with semaphore 2025-07-11 16:34:54 +08:00
Daniel.y
ba0cffd853
Merge pull request #1753 from HKUDS/rerank
Add rerank optional for chunks
2025-07-09 16:06:55 +08:00
zrguo
b0479c078a fix process_chunks_unified() 2025-07-09 15:55:38 +08:00
yangdx
9aa2ed0837 Merge branch 'main' into rerank 2025-07-09 15:33:39 +08:00
yangdx
e457374224 Fix linting 2025-07-09 15:33:05 +08:00
yangdx
bfa0844ecb Update README 2025-07-09 15:17:05 +08:00
zrguo
e1541caea9 Update webui setting 2025-07-09 12:10:06 +08:00
yangdx
207f0a7f2a Merge branch 'main' into merge_lock_with_key 2025-07-09 09:25:28 +08:00
yangdx
cb3bfc0e5b Release semphore before merge stage 2025-07-09 09:24:44 +08:00
Daniel.y
feb30d8987
Merge pull request #1759 from antonvice/bugfix/handle-none-filepath
Fix: Handle NoneType error when processing documents without a file path
2025-07-09 09:21:04 +08:00
Anton Vice
b192f8c9a3 Fix: Handle NoneType error when processing documents without a file path
The document processing pipeline would crash with a TypeError when a document was submitted as raw text via the API, as the file_path attribute would be None. This change adds a check to handle the None case gracefully, preventing the crash and allowing text-based documents to be indexed correctly.
2025-07-08 19:35:22 -03:00
yangdx
4705a22861 Bump core version to 1.4.0 2025-07-09 04:43:20 +08:00
yangdx
2056c3c809 Increase default CHUNK_TOP_K from 5 to 15 2025-07-09 04:41:51 +08:00
yangdx
78033edabb Update env.example 2025-07-09 04:37:04 +08:00
yangdx
e9c3503f77 Update logger info 2025-07-09 04:36:52 +08:00
yangdx
5d4484882a Merge branch 'main' into rerank 2025-07-09 03:59:04 +08:00
Daniel.y
1497f334f6
Merge pull request #1703 from DavIvek/add-Memgraph-graph-db
Add Memgraph as a Supported Graph Database Backend.
2025-07-09 03:45:22 +08:00
Daniel.y
565b274490
Merge pull request #1758 from HKUDS/memgraph
Merge Memgraph branch
2025-07-09 03:43:25 +08:00
yangdx
14d51518dd Merge branch 'add-Memgraph-graph-db' into memgraph 2025-07-09 03:38:07 +08:00
yangdx
3a0249a6b9 Update env.example 2025-07-09 03:36:17 +08:00
DavIvek
1854d7c75a Merge remote-tracking branch 'upstream/memgraph' into add-Memgraph-graph-db 2025-07-08 21:00:24 +02:00
DavIvek
08eb68b8ed run pre-commit 2025-07-08 20:21:20 +02:00
yangdx
75ce636084 Merge branch 'main' into add-Memgraph-graph-db 2025-07-09 02:09:35 +08:00
DavIvek
4438897b6b add changes based on review 2025-07-08 16:28:06 +02:00