327 Commits

Author SHA1 Message Date
yangdx
3eb3b170ab Remove list_of_list_to_dict function 2025-05-07 18:01:23 +08:00
yangdx
156244e260 Refactor: Unify naive context to JSON format
- Merges 'mix' mode query handling into 'hybrid' mode, simplifying query logic by removing the dedicated `mix_kg_vector_query` function
- Standardizes vector search result by using JSON string format to build context
- Fixes a bug in `query_with_keywords` ensuring `hl_keywords` and `ll_keywords` are correctly passed to `kg_query_with_keywords`
2025-05-07 17:42:14 +08:00
yangdx
59771b60df Optimize relationship title to entity1 and entity2 2025-05-07 13:02:22 +08:00
yangdx
1e03888cef Change function name get_kg_context to _get_kg_context 2025-05-07 10:57:33 +08:00
yangdx
3146309fde Change function name from list_of_list_to_json to list_of_list_to_dict 2025-05-07 10:52:26 +08:00
yangdx
edb3d6ac11 Improve query context format for mix mode 2025-05-07 10:51:44 +08:00
yangdx
2485bfe53c Fix linting 2025-05-07 03:57:14 +08:00
yangdx
910a7a8936 Unified vector retrieval logic for mix and naive queries 2025-05-07 03:47:09 +08:00
yangdx
1794b57b43 Ignore chat history in vector search 2025-05-07 03:20:39 +08:00
yangdx
c984ebd462 Improve mix query context format 2025-05-07 03:11:59 +08:00
yangdx
098846b651 Improve naive query context format 2025-05-07 02:52:05 +08:00
yangdx
b1f874b489 Fix linting 2025-05-07 01:51:58 +08:00
yangdx
52d8815230 Elimiate redunction chunk data fecth for niave query mode 2025-05-07 01:46:23 +08:00
yangdx
027c67a73c Skip self-referential relationships in edge processing 2025-05-05 11:58:33 +08:00
yangdx
9ff3542ab2 Fix time handling bugs for graph data 2025-05-01 15:14:15 +08:00
yangdx
6ec36ee538 Fix time handle bugs for _get_edge_data 2025-05-01 11:33:42 +08:00
yangdx
48fd250257 Optimize log messages 2025-04-29 19:02:08 +08:00
yangdx
43f255a5b4 Optimize log message 2025-04-29 18:20:35 +08:00
yangdx
a4a0643ad0 Enable graph_db_lock log 2025-04-28 23:21:07 +08:00
yangdx
90a07b0420 Remove unused params 2025-04-28 21:14:19 +08:00
yangdx
ef69009c15 Increase the priority of queries related to LLM requests 2025-04-28 19:36:21 +08:00
yangdx
5a393e563e remove duplicate priority setting for merge summerization 2025-04-28 18:37:51 +08:00
yangdx
140b1b3cbb Add priority control for limited async decorator 2025-04-28 18:12:29 +08:00
yangdx
3e385b5f81 Optimize logger info 2025-04-28 02:39:18 +08:00
yangdx
ef4a4ac8ed Optimize looger info 2025-04-28 02:35:35 +08:00
yangdx
594e7b751a Fix linting 2025-04-28 02:15:25 +08:00
yangdx
18040aa95c Improve parallel handling logic between extraction and merge operation 2025-04-28 01:14:00 +08:00
yangdx
dc498b88e0 Update logger message 2025-04-23 20:40:19 +08:00
yangdx
d89e0f7208 Fix linting 2025-04-23 19:35:05 +08:00
yangdx
a6f0c0b33b Simplify entity extraction prompt formatting logic 2025-04-23 18:55:03 +08:00
yangdx
2bfef70960 Fix context format 2025-04-22 21:17:01 +08:00
Daniel.y
a09f6eb170
Merge pull request #1423 from tackhwa/main
friendly implementation of entity extraction and relationship weight extract for Low-Capability LLMs
2025-04-22 19:11:04 +08:00
tackhwa
2e186ba488 remove regex 2025-04-22 15:22:37 +08:00
yangdx
1eef9b7205 Set max parallel chunks processing according to MAX_SYNC of LLM 2025-04-22 15:03:46 +08:00
yangdx
21c0bb7abf Merge branch 'context_format_csv_to_json' 2025-04-22 12:25:50 +08:00
yangdx
6a727103d6 Simplified logger messages 2025-04-22 12:19:40 +08:00
yangdx
9f958db328 Improve logger messages 2025-04-22 12:10:39 +08:00
yangdx
ff65cba544 Add null check for edge data 2025-04-21 18:32:33 +08:00
tackhwa
f3c57b606e friendly implementation of entity extraction and relationship weight extract for Low-Capability LLMs 2025-04-21 16:52:13 +08:00
yangdx
1a7b225e90 Fix stream respone error for naive query mode 2025-04-21 00:06:15 +08:00
mengchao
f2f3a2721d Refactor context handling to convert data from CSV to JSON format for improved compatibility with LLM, replacing the list_of_list_to_csv function with list_of_list_to_json 2025-04-20 19:24:05 +08:00
yangdx
4ae5246a7e Remove summary length check for entity relations
- Summary now determined by num_fragment
2025-04-20 12:36:32 +08:00
孟超
a20d68d865 Revise the context format of chunks from CSV to JSON to enhance compatibility with LLM 2025-04-19 15:18:33 +08:00
drahnreb
9c6b5aefcb fix linting 2025-04-18 16:24:43 +02:00
drahnreb
e71f466910 fix: take global_config from storage class 2025-04-18 16:24:43 +02:00
drahnreb
0f949dd5d7 fix truncation with global_config tokenizer 2025-04-18 16:24:43 +02:00
drahnreb
20ba1eb9c2 add: to optionally replace default tiktoken Tokenizer with a custom one 2025-04-18 16:24:43 +02:00
drahnreb
0aa994163e fix: correct parentheses. system_prompt was never formatted. 2025-04-17 23:44:14 +02:00
yangdx
a3ca134e97 Fix special chars problem for Postgres 2025-04-17 22:58:36 +08:00
yangdx
a185e48b87 fix: cancel pending tasks when any chunk processing fails
Modify extract_entities function to terminate all pending text chunk processing tasks when any single chunk processing fails.
2025-04-17 03:57:38 +08:00