4753 Commits

Author SHA1 Message Date
Daniel.y
c4544c23a7
Merge pull request #1876 from Ja1aia/main
Fix: corrected unterminated f-string in config.py
2025-07-29 14:12:57 +08:00
administrator
c26dfa33de Fix: corrected unterminated f-string in config.py 2025-07-29 11:21:23 +07:00
Daniel.y
7a5df185a5
Merge pull request #1875 from danielaskdd/remove-embedding-max-token-size
refactor: Remove deprecated `max_token_size` from embedding configura…
2025-07-29 11:23:54 +08:00
yangdx
9923821d75 refactor: Remove deprecated max_token_size from embedding configuration
This parameter is no longer used. Its removal simplifies the API and clarifies that token length management is handled by upstream text chunking logic rather than the embedding wrapper.
2025-07-29 10:49:35 +08:00
yangdx
d26d413d97 Merge branch 'patch-1' 2025-07-29 09:57:58 +08:00
yangdx
f4c2dc327d Fix linting 2025-07-29 09:57:41 +08:00
yangdx
75d1b1e9f8 Update Ollama context length configuration
- Rename OLLAMA_NUM_CTX to OLLAMA_LLM_NUM_CTX
- Increase default context window size
- Add requirement for minimum context size
- Update documentation examples
2025-07-29 09:53:37 +08:00
yangdx
645f81f7c8 fixes a critical bug where Ollama options were not being applied correctly
`dict.update()` modifies the dictionary in-place and returns `None`.
2025-07-29 09:52:25 +08:00
yangdx
9bdbdae120 Update README 2025-07-29 08:22:48 +08:00
Michele Comitini
bd94714b15 options needs to be passed to ollama client embed() method
Fix line length

Create binding_options.py

Remove test property

Add dynamic binding options to CLI and environment config

Automatically generate command-line arguments and environment variable
support for all LLM provider bindings using BindingOptions. Add sample
.env generation and extensible framework for new providers.

Add example option definitions and fix test arg check in OllamaOptions

Add options_dict method to BindingOptions for argument parsing

Add comprehensive Ollama binding configuration options

ruff formatting Apply ruff formatting to binding_options.py

Add Ollama separate options for embedding and LLM

Refactor Ollama binding options and fix class var handling

The changes improve how class variables are handled in binding options
and better organize the Ollama-specific options into LLM and embedding
subclasses.

Fix typo in arg test.

Rename cls parameter to klass to avoid keyword shadowing

Fix Ollama embedding binding name typo

Fix ollama embedder context param name

Split Ollama options into LLM and embedding configs with mixin base

Add Ollama option configuration to LLM and embeddings in lightrag_server

Update sample .env generation and environment handling

Conditionally add env vars and cmdline options only when ollama bindings
are used. Add example env file for Ollama binding options.
2025-07-28 12:05:40 +02:00
yangdx
ee53e43568 Update webui assets 2025-07-28 02:52:32 +08:00
yangdx
84b09aa5da feat: add threshold status line to StatusCard with i18n support
- Add cosine_threshold, min_rerank_score, related_chunk_number to LightragStatus type
2025-07-28 02:51:36 +08:00
Daniel.y
1613f4410e
Merge pull request #1873 from danielaskdd/add-status
Feat(webui): enhance status card with new settings from health endpoint
2025-07-28 02:34:44 +08:00
yangdx
769f77ef8f Update webui assets 2025-07-28 02:26:07 +08:00
yangdx
958ed80b66 Update translation strings for document processing
- Renamed "maxParallelInsert" to be more descriptive
- Removed "maxTokens" from all translations
2025-07-28 02:24:41 +08:00
yangdx
5aceca0052 feat(webui): enhance status card with new health endpoint data
- Update StatusCard to display consolidated server info with parallel insert limits and summary settings
- Merge LLM and embedding configurations with async parameters for cleaner display
- Add new status fields to TypeScript interface (summary_language, max_parallel_insert, etc.)
2025-07-28 02:19:27 +08:00
yangdx
98ac6fb3f0 Bump api version to 0192 2025-07-28 01:42:51 +08:00
yangdx
f2ffff063b feat: refactor ollama server configuration management
- Add ollama_server_infos attribute to LightRAG class with default initialization
- Move default values to constants.py for centralized configuration
- Refactor OllamaServerInfos class with property accessors and CLI support
- Update OllamaAPI to get configuration through rag object instead of direct import
- Add command line arguments for simulated model name and tag
- Fix type imports to avoid circular dependencies
2025-07-28 01:38:35 +08:00
yangdx
598eecd06d Refactor: Rename llm_model_max_token_size to summary_max_tokens
This commit renames the parameter 'llm_model_max_token_size' to 'summary_max_tokens' for better clarity, as it specifically controls the token limit for entity relation summaries.
2025-07-28 00:49:08 +08:00
yangdx
d0d57a45b6 feat: add environment variables to /health endpoint and centralize defaults
- Add 9 environment variables to /health endpoint configuration section
- Centralize default constants in lightrag/constants.py for consistency
- Update config.py to use centralized defaults for better maintainability
2025-07-28 00:30:56 +08:00
yangdx
9c4e98ec3b Unify entity extraction prompt between passes
- Disallow hallucinated info in descriptions
- Align reminder steps with main extraction
2025-07-27 23:06:55 +08:00
Daniel.y
4eef9f3778
Merge pull request #1845 from AkosLukacs/patch-2
Better prompt for entity description extraction to avoid hallucinations
2025-07-27 22:38:08 +08:00
yangdx
3951a44666 Revert file_path build method, built from related chunks 2025-07-27 21:56:20 +08:00
yangdx
d70c584d80 Bump api version to 0191 2025-07-27 21:24:53 +08:00
Daniel.y
7da485dd40
Merge pull request #1872 from danielaskdd/main
Fix: Improve keyword extraction prompt for robust JSON output.
2025-07-27 21:21:47 +08:00
yangdx
35734baa5c Merge remote-tracking branch 'upstream/main' 2025-07-27 21:12:29 +08:00
yangdx
f2d051eea5 Fix: Improve keyword extraction prompt for robust JSON output.
*   Emphasize strict JSON output in key extration prompt
*   Clean up prompt examples in key extration prompt
*   Log raw LLM response on JSON error
2025-07-27 21:10:47 +08:00
yangdx
519e81aaeb Update env.example 2025-07-27 18:21:10 +08:00
yangdx
3f5ade47cd Update README 2025-07-27 17:26:49 +08:00
Daniel.y
2054bba7e6
Merge pull request #1871 from danielaskdd/min-rank-score
feat: Add rerank score filtering with configurable threshold
2025-07-27 17:01:22 +08:00
yangdx
e09929b42e Refine rerank filtering log message for clarity 2025-07-27 16:57:38 +08:00
yangdx
f4bca7bfb2 Fix linting 2025-07-27 16:50:45 +08:00
yangdx
a9565d7379 feat: Skip rerank filtering when min_rerank_score is 0.0 2025-07-27 16:50:12 +08:00
yangdx
ebaff228aa feat: Add rerank score filtering with configurable threshold
- Add DEFAULT_MIN_RERANK_SCORE constant (default: 0.0)
- Add MIN_RERANK_SCORE environment variable support
- Filter chunks with rerank scores below threshold in process_chunks_unified
- Add info-level logging for filtering operations
- Handle empty results gracefully after filtering
- Maintain backward compatibility with non-reranked chunks
2025-07-27 16:37:44 +08:00
Daniel.y
358fbd689f
Merge pull request #1869 from danielaskdd/file_paths
refactor: unify file_path handling across merge and rebuild functions
2025-07-27 12:38:44 +08:00
yangdx
99e3812c38 refactor: unify file_path handling across merge and rebuild functions
- Replace simple string concatenation with build_file_path() in:
  - _merge_edges_then_upsert
  - _rebuild_single_entity
  - _rebuild_single_relationship
- Ensures consistent deduplication, length limiting, and error handling
- Aligns with existing _merge_nodes_then_upsert implementation
2025-07-27 12:37:24 +08:00
Daniel.y
c6cfbee3e8
Merge pull request #1868 from danielaskdd/optimize-prompt
Refine entity continuation prompt to avoid duplicates.
2025-07-27 10:49:39 +08:00
yangdx
cf1ca39b3f Refine entity continuation prompt to avoid duplicates.
- Clarify finding missing entities
- Instruct not to repeat extractions
2025-07-27 10:48:29 +08:00
yangdx
0dfbce0bb4 Update the README to clarify the explanation of concurrent processes. 2025-07-27 10:39:28 +08:00
yangdx
055629d30d Reduce default max total tokens to 30k 2025-07-27 10:33:06 +08:00
yangdx
a67f93acc9 Replace hardcoded max tokens with DEFAULT_MAX_TOTAL_TOKENS constant
- Use constant in process_chunks_unified
- Update WebUI default to match (32000)
2025-07-26 11:23:54 +08:00
yangdx
7b915b34f6 Refactor: move build_file_path function from operate.py to utils.py 2025-07-26 10:52:59 +08:00
yangdx
c8c3545454 refactor: extract file path length limit to shared constant
• Add DEFAULT_MAX_FILE_PATH_LENGTH constant
• Replace hardcoded 4090 in Milvus impl
2025-07-26 10:45:03 +08:00
yangdx
8e7014d366 Merge branch 'separator_file_path' 2025-07-26 10:39:03 +08:00
yangdx
a943265257 fix: preserve file path order in build_file_path function 2025-07-26 10:21:32 +08:00
yangdx
6efa8ab263 Improve file path length warning message clarity and urgency
• Change debug to warning level
• Simplify message wording
2025-07-26 10:00:18 +08:00
Daniel.y
2ed046171e
Merge pull request #1863 from HKUDS/fix-sigma-null-blendFunc
fix(webui): Correct edge renderer for sigma.js v3
2025-07-26 08:47:47 +08:00
yangdx
e7baf54ec2 Update webui assets 2025-07-26 08:43:12 +08:00
yangdx
2c4f621ded fix(webui): Correct edge renderer for sigma.js v3
The `curvedNoArrow` edge type was incorrectly configured, causing a `TypeError` during graph rendering. This commit updates the `edgeProgramClasses` in `GraphViewer.tsx` to use the `createEdgeCurveProgram()` factory function as required by `@sigma/edge-curve` v3, resolving the crash.
2025-07-26 08:42:19 +08:00
xuewei
56c3cb2dbe Improve build_file_path log 2025-07-26 08:38:02 +08:00