LightRAG

mirror of https://github.com/HKUDS/LightRAG.git synced 2025-11-26 15:07:11 +00:00

Author	SHA1	Message	Date
yangdx	6cab68bb47	Improve KG chunk selection documentation and configuration clarity	2025-08-15 10:09:44 +08:00
yangdx	2a46667ac9	Add OpenAI frequency penalty sample env params	2025-08-14 01:50:27 +08:00
yangdx	5a40ff654e	Change KG chunk selection default to VECTOR - Set KG_CHUNK_PICK_METHOD default to VECTOR - Update env.example with new config option	2025-08-13 23:10:42 +08:00
yangdx	095e0cbfa2	Refac: Add workspace infomation to all logger output for all storage type	2025-08-12 01:19:09 +08:00
yangdx	f5ac6a9f4b	Add default Ollama embedding context length - Set default context length to 8192 - Overide the default context lenght for LLM in binding_options.py	2025-08-08 13:51:25 +08:00
yangdx	dec4148075	Merge branch 'main' into Matt23-star/main	2025-08-08 02:24:34 +08:00
yangdx	f38e10559e	Update PostgreSQL vector index configuration - Remove FLAT index support - Standardize on HNSW as default - Add dimension validation - Improve error logging - Clean up index creation code	2025-08-08 02:21:06 +08:00
yangdx	f4ef254de2	fix(neo4j): enhance connection lifecycle management to prevent timeout errors - Add max_connection_lifetime, liveness_check_timeout, keep_alive parameters - Extend retry mechanisms for connection reset scenarios - Update config examples with new Neo4j connection options - Resolves ClientTimeoutException during data insertion operations	2025-08-08 01:07:45 +08:00
Matt23-star	727ca43d3c	feat: add vector index creation functionality for PostgreSQL	2025-08-07 23:07:18 +08:00
yangdx	7780776af6	Update env.example	2025-08-06 18:50:58 +08:00
yangdx	8dd3069005	feat: add tiktoken cache directory support for offline deployment - Mount tiktoken cache volume in docker-compose - Add TIKTOKEN_CACHE_DIR environment variable - Update env.example with tiktoken cache configuration - Added /data to gitignore - Added tiktoken env example	2025-08-05 14:22:19 +08:00
yangdx	c5babf61d7	Feat: Change embedding formats from float to base64 for efficiency - Add base64 support for Jina embeddings - Add base64 support for OpenAI embeddings - Update env.example with new embedding options	2025-08-05 11:38:40 +08:00
yangdx	adf7ec8e35	feat: Add OpenAI LLM Options support with BindingOptions framework - Add OpenAILLMOptions dataclass with full OpenAI API parameter support - Integrate OpenAI options in config.py for automatic binding detection - Update server functions to inject OpenAI options for openai/azure_openai bindings	2025-08-05 03:47:26 +08:00
yangdx	3099748668	Add temperature fallback for Ollama LLM binding - Implement OLLAMA_LLM_TEMPERATURE env var - Fallback to global TEMPERATURE if unset - Remove redundant OllamaLLMOptions logic - Update env.example with new setting	2025-08-05 01:50:09 +08:00
yangdx	e5e3f0f878	Fix(Ollama option): change stop option from string to list and add fallback global temperature setting	2025-08-04 19:43:14 +08:00
yangdx	9d5603d35e	Set the default LLM temperature to 1.0 and centralize constant management	2025-07-31 17:15:10 +08:00
yangdx	3c530b21b6	Update README	2025-07-31 13:00:09 +08:00
yangdx	c6bd9f0329	Disable conversation history by default - Set default history_turns to 0 - Mark history_turns as deprecated - Remove history_turns from example - Update documentation comments	2025-07-31 12:28:42 +08:00
yangdx	9923821d75	refactor: Remove deprecated `max_token_size` from embedding configuration This parameter is no longer used. Its removal simplifies the API and clarifies that token length management is handled by upstream text chunking logic rather than the embedding wrapper.	2025-07-29 10:49:35 +08:00
yangdx	75d1b1e9f8	Update Ollama context length configuration - Rename OLLAMA_NUM_CTX to OLLAMA_LLM_NUM_CTX - Increase default context window size - Add requirement for minimum context size - Update documentation examples	2025-07-29 09:53:37 +08:00
yangdx	f2ffff063b	feat: refactor ollama server configuration management - Add ollama_server_infos attribute to LightRAG class with default initialization - Move default values to constants.py for centralized configuration - Refactor OllamaServerInfos class with property accessors and CLI support - Update OllamaAPI to get configuration through rag object instead of direct import - Add command line arguments for simulated model name and tag - Fix type imports to avoid circular dependencies	2025-07-28 01:38:35 +08:00
yangdx	519e81aaeb	Update env.example	2025-07-27 18:21:10 +08:00
yangdx	ebaff228aa	feat: Add rerank score filtering with configurable threshold - Add DEFAULT_MIN_RERANK_SCORE constant (default: 0.0) - Add MIN_RERANK_SCORE environment variable support - Filter chunks with rerank scores below threshold in process_chunks_unified - Add info-level logging for filtering operations - Handle empty results gracefully after filtering - Maintain backward compatibility with non-reranked chunks	2025-07-27 16:37:44 +08:00
yangdx	055629d30d	Reduce default max total tokens to 30k	2025-07-27 10:33:06 +08:00
yangdx	b3c2987006	Reduce default MAX_TOKENS from 32000 to 10000	2025-07-26 08:13:49 +08:00
yangdx	ccfe2e73d1	Update env.example	2025-07-25 11:13:15 +08:00
zrguo	55ddc0ee86	fix typo	2025-07-24 15:20:05 +08:00
yangdx	29d1220f26	Merge branch 'main' into context-builder	2025-07-24 14:18:12 +08:00
yangdx	b1dd015e3e	Update env.example	2025-07-24 14:17:49 +08:00
yangdx	f57ed21593	Merge branch 'main' into context-builder	2025-07-24 14:07:05 +08:00
yangdx	2767212ba0	Fix linting	2025-07-24 12:25:50 +08:00
yangdx	d979e9078f	feat: Integrate Jina embeddings API support - Implemented Jina embedding function - Add new EMBEDDING_BINDING type of jina for LightRAG Server - Add env var sample	2025-07-24 12:15:00 +08:00
yangdx	edeb89abc6	Update comments	2025-07-24 10:09:24 +08:00
yangdx	795ce4cbe7	Update RELATED_CHUNK_NUMBER comment	2025-07-24 10:06:33 +08:00
yangdx	2c940f0728	reduce RELATED_CHUNK_NUMBER from 10 to 5	2025-07-24 02:49:05 +08:00
yangdx	df8b4202f3	feat: Add SSL support for PostgreSQL database connections - Add SSL configuration options (ssl_mode, ssl_cert, ssl_key, ssl_root_cert, ssl_crl) - Support all PostgreSQL SSL modes (disable, allow, prefer, require, verify-ca, verify-full) - Add SSL context creation with certificate validation - Update initdb() method to handle SSL connection parameters - Add SSL environment variables to env.example - Maintain backward compatibility with existing non-SSL configurations	2025-07-21 02:03:06 +08:00
yangdx	2c7d2b3f5f	Increase Neo4j connection pool size and timeouts - Bump default connection pool size to 100 - Add new Neo4j timeout env variables to env.example	2025-07-19 13:27:34 +08:00
yangdx	96b94acc83	Enhance Redis connection handling with retries and timeouts - Added Redis connection timeout configurations - Implemented retry logic for Redis operations - Updated error handling for timeout cases - Improved connection pool management - Added environment variable support	2025-07-19 10:15:26 +08:00
yangdx	8103b200db	Set DEFAULT_HISTORY_TURNS to 0	2025-07-16 02:20:27 +08:00
yangdx	6e084bfae1	Increase default related chunk number from 5 to 10	2025-07-16 00:22:34 +08:00
yangdx	5f7cb437e8	Centralize query parameters into LightRAG class This commit refactors query parameter management by consolidating settings like `top_k`, token limits, and thresholds into the `LightRAG` class, and consistently sourcing parameters from a single location.	2025-07-15 23:56:49 +08:00
yangdx	26865c2cd0	Update env.example	2025-07-15 17:08:23 +08:00
yangdx	47341d3a71	Merge branch 'main' into rerank	2025-07-15 16:12:33 +08:00
yangdx	e8e1f6ab56	feat: centralize environment variable defaults in constants.py	2025-07-15 16:11:50 +08:00
zrguo	91d0f65476	Update QueryParam	2025-07-15 14:21:58 +08:00
yangdx	ccc2a20071	feat: remove deprecated MAX_TOKEN_SUMMARY parameter to prevent LLM output truncation - Remove MAX_TOKEN_SUMMARY parameter and related configurations - Eliminate forced token-based truncation in entity/relationship descriptions - Switch to fragment-count based summarization logic using FORCE_LLM_SUMMARY_ON_MERGE - Update FORCE_LLM_SUMMARY_ON_MERGE default from 6 to 4 for better summarization - Clean up documentation, environment examples, and API display code - Preserve backward compatibility by graceful parameter removal This change resolves issues where LLMs were forcibly truncating entity relationship descriptions mid-sentence, leading to incomplete and potentially inaccurate knowledge graph content. The new approach allows LLMs to generate complete descriptions while still providing summarization when multiple fragments need to be merged. Breaking Change: None - parameter removal is backward compatible Fixes: Entity relationship description truncation issues	2025-07-15 12:26:33 +08:00
zrguo	7c882313bb	remove chunk_rerank_top_k	2025-07-15 11:52:34 +08:00
zrguo	4e425b1b59	Revert "update from main" This reverts commit 1d0376d6a926ef60d641af4406dacf5b8bbb430f.	2025-07-14 16:29:00 +08:00
zrguo	1d0376d6a9	update from main	2025-07-14 16:27:49 +08:00
zrguo	c9cbd2d3e0	Merge branch 'main' into rerank	2025-07-14 16:24:29 +08:00

1 2 3

120 Commits