ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2025-11-23 21:45:39 +00:00

Author	SHA1	Message	Date
Kevin Hu	5e8cd693a5	Refa: split services about llm. (#9450 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-08-13 16:41:01 +08:00
Yongteng Lei	83771e500c	Refa: migrate chat models to LiteLLM (#9394 ) ### What problem does this PR solve? All models pass the mock response tests, which means that if a model can return the correct response, everything should work as expected. However, not all models have been fully tested in a real environment, the real API_KEY. I suggest actively monitoring the refactored models over the coming period to ensure they work correctly and fixing them step by step, or waiting to merge until most have been tested in practical environment. ### Type of change - [x] Refactoring	2025-08-12 10:59:20 +08:00
Kevin Hu	9ca86d801e	Refa: add provider info while adding model. (#9273 ) ### What problem does this PR solve? #9248 ### Type of change - [x] Refactoring	2025-08-07 09:40:42 +08:00
Stephen Hu	1409bb30df	Refactor:Improve the logic so that it does not decode base 64 for the test image each time (#9264 ) ### What problem does this PR solve? Improve the logic so that it does not decode base 64 for the test image each time ### Type of change - [x] Refactoring - [x] Performance Improvement --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-08-06 11:42:25 +08:00
kuschzzp	b638d3f773	Image validation of the image2text model without using local paths (#9052 ) ### What problem does this PR solve? #9050 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-30 12:57:24 +08:00
Adrian Altermatt	6691532079	Feat: Add model editing functionality with improved UI labels (#8855 ) ### What problem does this PR solve? Add edit button for local LLM models <img width="1531" height="1428" alt="image" src="https://github.com/user-attachments/assets/19d62255-59a6-4a7e-9772-8b8743101f78" /> <img width="1531" height="1428" alt="image" src="https://github.com/user-attachments/assets/c3a0f77e-cc6b-4190-95a6-13835463428b" /> ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): --------- Co-authored-by: Liu An <asiro@qq.com>	2025-07-21 19:16:53 +08:00
Kevin Hu	163e71d06f	Fix: Hunyuan model adding error. (#6531 ) ### What problem does this PR solve? #6523 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-26 10:33:33 +08:00
Kevin Hu	5748d58c74	Refa: refine the error message. (#6151 ) ### What problem does this PR solve? #6138 ### Type of change - [x] Refactoring	2025-03-17 13:07:22 +08:00
Kevin Hu	471bd92b4c	Fix: empty api-key causes problems. (#6022 ) ### What problem does this PR solve? #5926 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-13 14:57:47 +08:00
Kevin Hu	45123dcc0a	Fix: ollama model add error. (#5947 ) ### What problem does this PR solve? #5944 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-12 10:56:05 +08:00
Kevin Hu	82f5d901c8	Refa: add model. (#5820 ) ### What problem does this PR solve? #5783 ### Type of change - [x] Refactoring	2025-03-10 11:22:06 +08:00
Kevin Hu	4c9a3e918f	Fix: add image2text issue. (#5431 ) ### What problem does this PR solve? #5356 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-27 14:06:49 +08:00
Yongteng Lei	0e920a91dd	FIX: correct typo (#5387 ) ### What problem does this PR solve? Correct typo in supported_models file ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-26 17:21:09 +08:00
Yongteng Lei	cdcaae17c6	Feat: add VLLM (#5380 ) ### What problem does this PR solve? Read to add VLMM. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-02-26 16:04:53 +08:00
Kevin Hu	4f40f685d9	Code refactor (#5371 ) ### What problem does this PR solve? #5173 ### Type of change - [x] Refactoring	2025-02-26 15:40:52 +08:00
Kevin Hu	605cfdb8dc	Refine error message for re-rank model. (#5278 ) ### What problem does this PR solve? #5261 ### Type of change - [x] Refactoring	2025-02-24 13:01:34 +08:00
yrk111222	7ce675030b	Support downloading models from ModelScope Community. (#5073 ) This PR supports downloading models from ModelScope. The main modifications are as follows: -New Feature (non-breaking change which adds functionality) -Documentation Update --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-02-24 10:12:20 +08:00
Kevin Hu	ef8847eda7	Double check error of adding llm. (#5237 ) ### What problem does this PR solve? #5227 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-21 19:09:49 +08:00
Kevin Hu	78982d88e0	Reformat error message. (#4829 ) ### What problem does this PR solve? #4828 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-02-10 16:47:53 +08:00
Alex Chen	7944aacafa	Feat: add gpustack model provider (#4469 ) ### What problem does this PR solve? Add GPUStack as a new model provider. [GPUStack](https://github.com/gpustack/gpustack) is an open-source GPU cluster manager for running LLMs. Currently, locally deployed models in GPUStack cannot integrate well with RAGFlow. GPUStack provides both OpenAI compatible APIs (Models / Chat Completions / Embeddings / Speech2Text / TTS) and other APIs like Rerank. We would like to use GPUStack as a model provider in ragflow. [GPUStack Docs](https://docs.gpustack.ai/latest/quickstart/) Related issue: https://github.com/infiniflow/ragflow/issues/4064. ### Type of change - [x] New Feature (non-breaking change which adds functionality) ### Testing Instructions 1. Install GPUStack and deploy the `llama-3.2-1b-instruct` llm, `bge-m3` text embedding model, `bge-reranker-v2-m3` rerank model, `faster-whisper-medium` Speech-to-Text model, `cosyvoice-300m-sft` in GPUStack. 2. Add provider in ragflow settings. 3. Testing in ragflow.	2025-01-15 14:15:58 +08:00
Kevin Hu	097aab09a2	Replace image2text model check with internal image. (#4250 ) ### What problem does this PR solve? #4243 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-26 19:46:42 +08:00
Kevin Hu	9b9039de92	Fix connection error for adding visual llm. (#4028 ) ### What problem does this PR solve? #3897 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-13 18:54:51 +08:00
Zhichang Yu	1254ecf445	Added static check at PR CI (#3921 ) ### What problem does this PR solve? Added static check at PR CI ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring	2024-12-08 21:23:51 +08:00
Zhichang Yu	0d68a6cd1b	Fix errors detected by Ruff (#3918 ) ### What problem does this PR solve? Fix errors detected by Ruff ### Type of change - [x] Refactoring	2024-12-08 14:21:12 +08:00
Kevin Hu	655b01a0a4	Remove token check while adding model. (#3903 ) ### What problem does this PR solve? #3820 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-12-06 17:01:19 +08:00
liuhua	d42362deb6	Add api for sessions and add max_tokens for tenant_llm (#3472 ) ### What problem does this PR solve? Add api for sessions and add max_tokens for tenant_llm ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: liuhua <10215101452@stu.ecun.edu.cn>	2024-11-19 14:51:33 +08:00
shizzgar	4b3eeaa6ef	Added LocalAI support for rerank models (#3446 ) ### What problem does this PR solve? Hi there! LocalAI added support of rerank models https://localai.io/features/reranker/ I've implemented LocalAIRerank class (typically copied it from OpenAI_APIRerank class). Also, LocalAI model response with 500 error code if len of "documents" is less than 2 in similarity check. So I've added the second "document" on RERANK model connection check in `api/apps/llm_app.py`. ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-11-18 12:05:52 +08:00
Jin Hai	1e90a1bf36	Move settings initialization after module init phase (#3438 ) ### What problem does this PR solve? 1. Module init won't connect database any more. 2. Config in settings need to be used with settings.CONFIG_NAME ### Type of change - [x] Refactoring Signed-off-by: jinhai <haijin.chn@gmail.com>	2024-11-15 17:30:56 +08:00
Zhichang Yu	30f6421760	Use consistent log file names, introduced initLogger (#3403 ) ### What problem does this PR solve? Use consistent log file names, introduced initLogger ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [x] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2024-11-14 17:13:48 +08:00
Zhichang Yu	a2a5631da4	Rework logging (#3358 ) Unified all log files into one. ### What problem does this PR solve? Unified all log files into one. ### Type of change - [x] Refactoring	2024-11-12 17:35:13 +08:00
Zhichang Yu	185c6a0c71	Unified API response json schema (#3170 ) ### What problem does this PR solve? Unified API response json schema ### Type of change - [x] Refactoring	2024-11-05 11:02:31 +08:00
0000sir	4991107822	Fix keys of Xinference deployed models, especially has the same model name with public hosted models. (#2832 ) ### What problem does this PR solve? Fix keys of Xinference deployed models, especially has the same model name with public hosted models. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: 0000sir <0000sir@gmail.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-16 10:21:08 +08:00
Kevin Hu	190eea7097	trival (#2808 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-11 15:33:38 +08:00
Kevin Hu	2d1c83da59	fix LIGHTEN issue (#2806 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-10-11 15:01:27 +08:00
JobSmithManipulation	18f80743eb	support api-version and change default-model in adding azure-openai and openai (#2799 ) ### What problem does this PR solve? #2701 #2712 #2749 ### Type of change -[x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-10-11 11:26:42 +08:00
JobSmithManipulation	96f56a3c43	add huggingface model (#2624 ) ### What problem does this PR solve? #2469 ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-09-27 19:15:38 +08:00
Kevin Hu	7bb28ca2bd	add lighten control (#2567 ) ### What problem does this PR solve? #2295 ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2024-09-24 19:22:01 +08:00
Kevin Hu	d40041cc82	refine multi-turn chat in agent (#2560 ) ### What problem does this PR solve? #2484 ### Type of change - [x] Performance Improvement - [ ] Other (please describe):	2024-09-24 16:20:19 +08:00
Kevin Hu	7b3099b1a1	add an API of delete llm supplier (#2556 ) ### What problem does this PR solve? #1853 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-09-24 15:24:15 +08:00
liuhua	d9c2a128a5	SparkTTS (#2535 ) ### What problem does this PR solve? SparkTTS ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: liuhua <10215101452@stu.ecun.edu.cn>	2024-09-24 12:15:12 +08:00
Kevin Hu	a44f1f735d	fix self deployed llm lost (#2510 ) ### What problem does this PR solve? #2509 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-09-20 10:41:25 +08:00
Kevin Hu	5968f148bc	refactor add LLM (#2508 ) ### What problem does this PR solve? #2487 ### Type of change - [x] Refactoring	2024-09-20 10:20:35 +08:00
yungongzi	4f962d6bff	BugFix: Fixed api_key generation error for VolcEngine (#2502 ) BugFix: Fixed api_key generation error for VolcEngine with python's f-string syntax ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: 海贼宅 <stu_xyx@163.com>	2024-09-20 10:03:43 +08:00
Kevin Hu	b5d1d2fec4	refine TTS (#2500 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-09-19 19:15:16 +08:00
_Chenbing	af0b4b0828	fix(Add model api): Add VolcEngine to create api_key format error (#2490 ) ### What problem does this PR solve? Add VolcEngine to create api_key format error When constructing the json string, there was an extra "," at the end, which caused a formatting error. This commit fixed the problem. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-09-19 15:10:49 +08:00
Kevin Hu	66c54e75f3	add default model types (#2342 ) ### What problem does this PR solve? ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality)	2024-09-10 11:39:44 +08:00
Kevin Hu	f60dfffb4b	add model types to factories API (#2341 ) ### What problem does this PR solve? ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality)	2024-09-10 11:26:01 +08:00
Kevin Hu	0fe19f3fbc	fix QWenSeq2txt bug (#2245 ) ### What problem does this PR solve? #2243 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-09-04 18:25:43 +08:00
黄腾	87a998e9e5	fix tts add bug (#2224 ) ### What problem does this PR solve? fix tts add bug ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-09-03 18:40:20 +08:00
黄腾	5decdde182	add support for Google Cloud (#2175 ) ### What problem does this PR solve? #1853 add support for Google Cloud ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-09-02 12:06:41 +08:00

1 2

96 Commits