ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2025-12-02 18:06:57 +00:00

Author	SHA1	Message	Date
Stephen Hu	5ccdb95008	Refactor:Introduce Image Close For GeminiCV (#9147 ) ### What problem does this PR solve? Introduce Image Close For GeminiCV ### Type of change - [x] Refactoring - [x] Performance Improvement	2025-08-01 12:38:13 +08:00
JI4JUN	aeaeb169e4	Feat/support 302ai provider (#8742 ) ### What problem does this PR solve? Support 302.AI provider. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-07-31 14:48:30 +08:00
Stephen Hu	20b4d88098	Refactor: Improve the try catch logic for XinferenceEmbed (#9128 ) ### What problem does this PR solve? Improve the try catch logic for XinferenceEmbed ### Type of change - [x] Refactoring	2025-07-31 12:14:50 +08:00
Kevin Hu	d9fe279dde	Feat: Redesign and refactor agent module (#9113 ) ### What problem does this PR solve? #9082 #6365 <u> WARNING: it's not compatible with the older version of `Agent` module, which means that `Agent` from older versions can not work anymore.</u> ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-07-30 19:41:09 +08:00
謝富祥	021e8b57ae	Fix: fix error 429 api rate limit when building knowledge graph for all chat model and Mistral embedding model (#9106 ) ### What problem does this PR solve? fix error 429 api rate limit when building knowledge graph for all chat model and Mistral embedding model. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-30 11:37:49 +08:00
Yongteng Lei	39ef2ffba9	Feat: parsing supports jsonl or ldjson format (#9087 ) ### What problem does this PR solve? Supports jsonl or ldjson format. Feature request from [discussion](https://github.com/orgs/infiniflow/discussions/8774). ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-07-30 09:48:20 +08:00
Stephen Hu	ba563f8095	Update embedding_model.py (#9083 ) ### What problem does this PR solve? Reduce the logic scope for DefaultEmbedding ### Type of change - [x] Refactoring	2025-07-30 09:44:30 +08:00
Zhichang Yu	342a04ec8a	Added infinity rank_feature support (#9044 ) ### What problem does this PR solve? Added infinity rank_feature support ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-29 09:14:23 +08:00
Stephen Hu	86b4da0844	Refactor: Remove Useless split for BedrockEmbed (#9067 ) ### What problem does this PR solve? Remove Useless split for BedrockEmbed ### Type of change - [x] Refactoring	2025-07-28 10:16:38 +08:00
Stephen Hu	53b0b0e583	get keep alive from env (#9039 ) ### What problem does this PR solve? get keepalive from env ### Type of change - [x] Refactoring	2025-07-25 12:16:33 +08:00
pyyuhao	49f3f26622	Bug fix: OpenSearch chunk update some api error (#9032 ) ### What problem does this PR solve? Fix a small non-blocking main workflow bug about chunk update When OpenSearch is the doc engine. When you wanna enable/disable a chunk in the web-page “Knowledge Base / Dataset / Chunk”, the bug ocurred. <img width="2388" height="662" alt="image" src="https://github.com/user-attachments/assets/575987a0-c929-4589-bfa0-ba54e137cfd9" /> The reaseon why it ocurred is that some api params between OpenSearch and ES differs. It functioned well no matter enable/disable/rewrite the chunk after I fixed. I also checked the result when using the chat web-page. <img width="2394" height="660" alt="image" src="https://github.com/user-attachments/assets/8b899dc6-d769-4e80-8dd8-ad0fbbca5f78" /> I will still focus on vector-database espeically OpenSearch. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: 张雨豪 <zhangyh80@chinatelecom.cn> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-07-25 09:57:24 +08:00
Viktor Dmitriyev	b47dcc9108	Fix issue with `keep_alive=-1` for ollama chat model by allowing a user to set an additional configuration option (#9017 ) ### What problem does this PR solve? fix issue with `keep_alive=-1` for ollama chat model by allowing a user to set an additional configuration option. It is no-breaking change because it still uses a previous default value such as: `keep_alive=-1` ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [X] Performance Improvement - [X] Other (please describe): - Additional configuration option has been added to control behavior of RAGFlow while working with ollama LLM	2025-07-24 11:20:14 +08:00
Yongteng Lei	a2f73af1a4	Fix: typo Bearer token (#8998 ) ### What problem does this PR solve? Typo Bearer token. #8960 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-23 18:10:51 +08:00
Yongteng Lei	7ebc1f0943	Feat: add model provider DeepInfra (#9003 ) ### What problem does this PR solve? Add model provider DeepInfra. This model list comes from our community. NOTE: most endpoints haven't been tested, but they should work as OpenAI does. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-07-23 18:10:35 +08:00
Stephen Hu	ec21d9a98f	Refactor:remove use less convert for FastEmbed (#8984 ) ### What problem does this PR solve? remove use less convert for FastEmbed ### Type of change - [x] Refactoring	2025-07-23 10:51:48 +08:00
He Wang	175f5eaa90	use quote_plus to escape password in opendal's mysql url (#8976 ) ### What problem does this PR solve? Use `quote_plus` to escape password in opendal's mysql url to support special characters like `#`. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-23 10:17:34 +08:00
Kevin Hu	935ce872d8	Refa: remove temperature since some LLMs fail to support. (#8981 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-07-23 10:17:04 +08:00
Stephen Hu	95b9208b13	Fix:Improve float operation when rerank (#8963 ) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/8915 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-22 10:04:00 +08:00
Kevin Hu	c783d90ba3	Perf: set timeout for building chunks. (#8940 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2025-07-21 15:56:45 +08:00
Stephen Hu	46caf6ae72	Refactor improve codes for ranker (#8936 ) ### What problem does this PR solve? Use the normalize method directly ### Type of change - [x] Refactoring	2025-07-21 10:22:20 +08:00
Stephen Hu	92cfbcb382	Fix: when parse markdown support extract image at local (#8906 ) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/8902 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-18 17:06:58 +08:00
Kevin Hu	ecdb1701df	Perf: test llm before RAPTOR. (#8897 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2025-07-17 16:48:50 +08:00
湛露先生	fd97ce3e5a	fix s3 init config . (#8886 ) ### What problem does this PR solve? when``` if 'signature_version' in self.s3_config:``` and ```if 'addressing_style' in self.s3_config:``` both true. the config init is error, will be overwrite by last one. this pr is for fix that case. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Signed-off-by: zhanluxianshen <zhanluxianshen@163.com>	2025-07-17 12:10:15 +08:00
Stephen Hu	38b34116dd	Refa: Remove useless conver and fix a bug for DefaultRerank (#8887 ) ### What problem does this PR solve? 1. bug when re-try, we need to reset i. 2. remove useless convert ### Type of change - [x] Refactoring	2025-07-17 12:09:50 +08:00
Kevin Hu	fbd115773b	Perf: set timeout of some steps in KG. (#8873 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2025-07-16 18:06:03 +08:00
Liu An	9e45fcfdb3	Fix: fix typo in OpenAI error logging message (#8865 ) ### What problem does this PR solve? Correct the logging message from "OpenAI cat_with_tools" to "OpenAI chat_with_tools" in the `_exceptions` method of the `Base` class to accurately reflect the method name and improve error traceability. ### Type of change - [x] Typo	2025-07-16 15:31:57 +08:00
Yongteng Lei	e9b14142a5	Fix: fixed invalid save() arguments for slide thumbnails (#8851 ) ### What problem does this PR solve? Fixed invalid save() arguments for slide thumbnails. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-15 17:19:45 +08:00
Kevin Hu	aa4a725529	Pref: use redis to check if canceled. (#8853 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2025-07-15 17:19:27 +08:00
Kevin Hu	24c41d2a61	Perf: make `do_cancel` quicker. (#8846 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2025-07-15 14:35:00 +08:00
Stephen Hu	5fa6f2f151	Update embedding_model.py (#8836 ) ### What problem does this PR solve? Remove useless covert for bge encode_queries ### Type of change - [x] Performance Improvement	2025-07-15 14:04:58 +08:00
Yongteng Lei	51a8604dcb	Fix: fixed context loss caused by separating markdown tables from original text (#8844 ) ### What problem does this PR solve? Fix context loss caused by separating markdown tables from original text. #6871, #8804. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-15 13:03:01 +08:00
Yongteng Lei	dbc2a8689a	Fix: no chunks parsed out for Law (#8842 ) ### What problem does this PR solve? Fixes no chunks parsed out for Law. #5113 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-15 13:01:56 +08:00
Kevin Hu	c642dbefca	Perf: Enhance timeout handling. (#8826 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2025-07-15 09:36:45 +08:00
Stephen Hu	ce140f1393	Fix:Better Support Table Value Type (#8822 ) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/8782 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-14 17:51:26 +08:00
Stephen Hu	5383e254c4	Perf:Remove Useless Convert When BGE Embedding (#8816 ) ### What problem does this PR solve? FlagModel internal support returns as numpy ### Type of change - [x] Performance Improvement	2025-07-14 14:02:48 +08:00
Stephen Hu	f569401398	Fix: better_handle_different_types (#8775 ) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/8719#issuecomment-3055883271 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-11 18:21:39 +08:00
Stephen Hu	2b7adbd2d1	Fix: Improve Memory Usage For Presentation (#8792 ) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/8791 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-11 11:35:25 +08:00
Stephen Hu	07208e519b	Fix: Wrong_Input_type_for_Gemin (#8783 ) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/8763#issuecomment-3055317110 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-11 11:34:04 +08:00
Yongteng Lei	1895667573	Feat: add xAI provider (#8781 ) ### What problem does this PR solve? Add xAI provider (experimental feature, requires user feedback). ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-07-11 10:35:23 +08:00
Kevin Hu	8281ceb406	Refa: refine retry gap. (#8773 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring - [x] Performance Improvement	2025-07-10 14:28:57 +08:00
Stephen Hu	8d027813f5	Refactor: Improve How To Handle QWenEmbed (#8765 ) ### What problem does this PR solve? Based on https://github.com/infiniflow/ragflow/issues/8740 1. A better handle for 'NoneType' object is not subscriptable 2. Add some logs to get the internal message ### Type of change - [x] Refactoring	2025-07-10 10:30:18 +08:00
Stephen Hu	19419281c3	Fix: Change Ollama Embedding Keep Alive (#8734 ) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/8733 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-09 12:17:26 +08:00
Stephen Hu	00c954755e	Fix:use the same logic to handle pos in tokenize_chunks_with_images (#8732 ) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/8719 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-09 09:31:40 +08:00
Stephen Hu	8af0d04ad0	Refactor:Improve the logic in search.py (#8716 ) ### What problem does this PR solve? 1. Remove the useless pop logic due to already been checked at the if logic 2. merge log logic ### Type of change - [x] Refactoring	2025-07-08 12:32:01 +08:00
Stephen Hu	e60ec0a31b	Fix:disallowed special token while embedding (#8692 ) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/8567 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-07 14:13:37 +08:00
6607changchun	9580e99650	fix: retry embedding with Qwen family models when limits temporarily reached. (#8690 ) fix: retry embedding with Qwen family models when limits temporarily reached. APIs of Qwen family models are limited by calling rates. When reached, the "output" attribute of the "resp" will be None, and in turn cause TypeError when trying to retrieve "embeddings". Since these limits are almost temporary, I have added a simple retry mechanism to avoid it. Besides, if retry_max reached, the error can be early raised, instead of hidden behind "TypeError". ### What problem does this PR solve? Sometimes Qwen blocks calling due to rate limits, but it will cause the whole parsing procedure stops when creating knowledge base. In this situation, resp["output"] will be None, and resp["output"]["embeddings"] will cause TypeError. Since the limits are temporary, I apply a simple retry mechanism to solve it. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-07-07 12:15:52 +08:00
Kevin Hu	1e6bda735a	Fix: add ES re-connect once request timeout. (#8678 ) ### What problem does this PR solve? #8669 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-07 09:22:25 +08:00
kwrobel.eth	8a3b5d1d76	Fix a small typo in count of used fragments (#8673 ) ### What problem does this PR solve? Fix a small typo in count of used fragments. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-07-04 19:46:31 +08:00
Yongteng Lei	a306a6f158	Refa: refactor prompts into markdown-style structure using Jinja2 (#8667 ) ### What problem does this PR solve? Refactor prompts into markdown-style structure using Jinja2. ### Type of change - [x] Refactoring	2025-07-04 15:59:41 +08:00
Stephen Hu	d5f6335f99	Fix: The data set created by API call failed to parse after uploading the file. (#8657 ) ### What problem does this PR solve? https://github.com/infiniflow/ragflow/issues/8656 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-07-04 12:41:28 +08:00

1 2 3 4 5 ...

808 Commits