Larfii
|
2ba20910bb
|
fix naive_query
|
2024-12-09 17:45:01 +08:00 |
|
zrguo
|
71af34196f
|
Merge branch 'main' into fix-entity-name-string
|
2024-12-09 17:30:40 +08:00 |
|
Larfii
|
ffa95e0461
|
Fix jina embedding
|
2024-12-09 17:05:17 +08:00 |
|
zrguo
|
0a8d88212a
|
Merge pull request #423 from davidleon/feature/jina_embedding
add jina embedding
|
2024-12-09 10:18:50 +08:00 |
|
david
|
97d1894077
|
add jina embedding
|
2024-12-08 22:20:41 +08:00 |
|
Magic_yuan
|
779ed604d8
|
清理多余注释
|
2024-12-08 17:38:49 +08:00 |
|
Magic_yuan
|
39c2cb11f3
|
清理多余注释
|
2024-12-08 17:37:58 +08:00 |
|
Magic_yuan
|
ccf44dc334
|
feat(cache): 增加 LLM 相似性检查功能并优化缓存机制
- 在 embedding 缓存配置中添加 use_llm_check 参数
- 实现 LLM 相似性检查逻辑,作为缓存命中的二次验证- 优化 naive 模式的缓存处理流程
- 调整缓存数据结构,移除不必要的 model 字段
|
2024-12-08 17:35:52 +08:00 |
|
magicyuan876
|
4da7dd1865
|
移除kwargs中的hashing_kv参数取为变量
|
2024-12-06 15:35:09 +08:00 |
|
yuanxiaobin
|
6a010abb62
|
移除kwargs中的hashing_kv参数取为变量
|
2024-12-06 15:35:09 +08:00 |
|
magicyuan876
|
efdd4b8b8e
|
移除kwargs中的hashing_kv参数取为变量
|
2024-12-06 15:23:18 +08:00 |
|
yuanxiaobin
|
a1c4a036fd
|
移除kwargs中的hashing_kv参数取为变量
|
2024-12-06 15:23:18 +08:00 |
|
magicyuan876
|
8d9fc01b4c
|
解决冲突
|
2024-12-06 15:09:50 +08:00 |
|
yuanxiaobin
|
633fb55b5b
|
解决冲突
|
2024-12-06 15:09:50 +08:00 |
|
magicyuan876
|
8924d2b8fc
|
Merge remote-tracking branch 'origin/main'
# Conflicts:
# lightrag/llm.py
# lightrag/operate.py
|
2024-12-06 15:06:00 +08:00 |
|
yuanxiaobin
|
ad4b0d1ba9
|
Merge remote-tracking branch 'origin/main'
# Conflicts:
# lightrag/llm.py
# lightrag/operate.py
|
2024-12-06 15:06:00 +08:00 |
|
magicyuan876
|
e619b09c8a
|
重构缓存处理逻辑
- 提取通用缓存处理逻辑到新函数 handle_cache 和 save_to_cache
- 使用 CacheData 类统一缓存数据结构
- 优化嵌入式缓存和常规缓存的处理流程
- 添加模式参数以支持不同查询模式的缓存策略
- 重构 get_best_cached_response 函数,提高缓存查询效率
|
2024-12-06 14:29:16 +08:00 |
|
yuanxiaobin
|
584258078f
|
重构缓存处理逻辑
- 提取通用缓存处理逻辑到新函数 handle_cache 和 save_to_cache
- 使用 CacheData 类统一缓存数据结构
- 优化嵌入式缓存和常规缓存的处理流程
- 添加模式参数以支持不同查询模式的缓存策略
- 重构 get_best_cached_response 函数,提高缓存查询效率
|
2024-12-06 14:29:16 +08:00 |
|
zrguo
|
f2a208c343
|
Merge branch 'main' into main
|
2024-12-06 11:38:27 +08:00 |
|
zrguo
|
ad991f904d
|
Merge branch 'main' into main
|
2024-12-06 11:38:27 +08:00 |
|
magicyuan876
|
5dfb74ef2d
|
修复 args_hash在使用常规缓存时候才计算导致embedding缓存时没有计算的bug
|
2024-12-06 10:40:48 +08:00 |
|
yuanxiaobin
|
7c4bbe2474
|
修复 args_hash在使用常规缓存时候才计算导致embedding缓存时没有计算的bug
|
2024-12-06 10:40:48 +08:00 |
|
magicyuan876
|
6c29a37f20
|
修复 args_hash在使用常规缓存时候才计算导致embedding缓存时没有计算的bug
|
2024-12-06 10:28:35 +08:00 |
|
yuanxiaobin
|
8a69604966
|
修复 args_hash在使用常规缓存时候才计算导致embedding缓存时没有计算的bug
|
2024-12-06 10:28:35 +08:00 |
|
magicyuan876
|
6540d11096
|
修复 args_hash在使用常规缓存时候才计算导致embedding缓存时没有计算的bug
|
2024-12-06 10:21:53 +08:00 |
|
yuanxiaobin
|
f2a1897b61
|
修复 args_hash在使用常规缓存时候才计算导致embedding缓存时没有计算的bug
|
2024-12-06 10:21:53 +08:00 |
|
partoneplay
|
e82d13e182
|
Add support for Ollama streaming output and integrate Open-WebUI as the chat UI demo
|
2024-12-06 10:13:16 +08:00 |
|
partoneplay
|
335179196a
|
Add support for Ollama streaming output and integrate Open-WebUI as the chat UI demo
|
2024-12-06 10:13:16 +08:00 |
|
magicyuan876
|
d48c6e4588
|
feat(lightrag): 添加 查询时使用embedding缓存功能
- 在 LightRAG 类中添加 embedding_cache_config配置项
- 实现基于 embedding 相似度的缓存查询和存储
- 添加量化和反量化函数,用于压缩 embedding 数据
- 新增示例演示 embedding 缓存的使用
|
2024-12-06 08:17:20 +08:00 |
|
yuanxiaobin
|
525c971a23
|
feat(lightrag): 添加 查询时使用embedding缓存功能
- 在 LightRAG 类中添加 embedding_cache_config配置项
- 实现基于 embedding 相似度的缓存查询和存储
- 添加量化和反量化函数,用于压缩 embedding 数据
- 新增示例演示 embedding 缓存的使用
|
2024-12-06 08:17:20 +08:00 |
|
Larfii
|
a2072a055a
|
fix: unexpected keyword argument error
|
2024-12-05 11:47:56 +08:00 |
|
Larfii
|
da73ba9b6b
|
fix: unexpected keyword argument error
|
2024-12-05 11:47:56 +08:00 |
|
LarFii
|
44d441a951
|
update insert custom kg
|
2024-12-04 19:44:04 +08:00 |
|
LarFii
|
db9b9f69f8
|
update insert custom kg
|
2024-12-04 19:44:04 +08:00 |
|
zrguo
|
3e69b326ec
|
Merge pull request #383 from MRX760/main
added nvidia text-embedding API and example of using nvidia API llm a…
|
2024-12-04 11:21:46 +08:00 |
|
zrguo
|
f3ae4fccfa
|
Merge pull request #383 from MRX760/main
added nvidia text-embedding API and example of using nvidia API llm a…
|
2024-12-04 11:21:46 +08:00 |
|
MRX760
|
5f13ce1ce9
|
added nvidia text-embedding API and example of using nvidia API llm and text-embedding
|
2024-12-03 17:15:10 +07:00 |
|
MRX760
|
0b87e4649f
|
added nvidia text-embedding API and example of using nvidia API llm and text-embedding
|
2024-12-03 17:15:10 +07:00 |
|
partoneplay
|
bc2b8c592e
|
embedding deprecated in favor of embed
|
2024-12-03 08:42:36 +08:00 |
|
partoneplay
|
18e86a1825
|
embedding deprecated in favor of embed
|
2024-12-03 08:42:36 +08:00 |
|
zrguo
|
e8b5498699
|
Merge pull request #360 from ahmadhatahet/azure_openai_embedding
Azure OpenAI Embedding
|
2024-12-02 16:02:13 +08:00 |
|
zrguo
|
1420669891
|
Merge pull request #360 from ahmadhatahet/azure_openai_embedding
Azure OpenAI Embedding
|
2024-12-02 16:02:13 +08:00 |
|
Ahmad Hatahet
|
f281414308
|
add api_version to azure_openai_complete_if_cache
|
2024-11-30 17:47:33 +01:00 |
|
Ahmad Hatahet
|
137315ec18
|
add api_version to azure_openai_complete_if_cache
|
2024-11-30 17:47:33 +01:00 |
|
Ahmad Hatahet
|
23cabbe7a3
|
update max_token_size according to openai doc
|
2024-11-30 17:16:07 +01:00 |
|
Ahmad Hatahet
|
9b92d425f6
|
update max_token_size according to openai doc
|
2024-11-30 17:16:07 +01:00 |
|
Ahmad Hatahet
|
7fea7d7b5e
|
add api_version to args
|
2024-11-30 17:11:38 +01:00 |
|
Ahmad Hatahet
|
38c8a6e97c
|
add api_version to args
|
2024-11-30 17:11:38 +01:00 |
|
b10902118
|
0ab55da47a
|
fix error and tested
|
2024-11-30 00:00:51 +08:00 |
|
b10902118
|
2e95c633cf
|
fix error and tested
|
2024-11-30 00:00:51 +08:00 |
|