ragflow/docs/guides/dataset/construct_knowledge_graph.md

---
sidebar_position: 8
slug: /construct_knowledge_graph
---

# Construct knowledge graph

Generate a knowledge graph for your dataset.

---

To enhance multi-hop question-answering, RAGFlow adds a knowledge graph construction step between data extraction and indexing, as illustrated below. This step creates additional chunks from existing ones generated by your specified chunking method.

![Image](https://github.com/user-attachments/assets/1ec21d8e-f255-4d65-9918-69b72dfa142b)

From v0.16.0 onward, RAGFlow supports constructing a knowledge graph on a dataset, allowing you to construct a *unified* graph across multiple files within your dataset. When a newly uploaded file starts parsing, the generated graph will automatically update.

:::danger WARNING
Constructing a knowledge graph requires significant memory, computational resources, and tokens.
:::

## Scenarios

Knowledge graphs are especially useful for multi-hop question-answering involving *nested* logic. They outperform traditional extraction approaches when you are performing question answering on books or works with complex entities and relationships.

:::tip NOTE
RAPTOR (Recursive Abstractive Processing for Tree Organized Retrieval) can also be used for multi-hop question-answering tasks. See [Enable RAPTOR](./enable_raptor.md) for details. You may use either approach or both, but ensure you understand the memory, computational, and token costs involved.
:::

## Prerequisites

The system's default chat model is used to generate knowledge graph. Before proceeding, ensure that you have a chat model properly configured:

![Set default models](https://raw.githubusercontent.com/infiniflow/ragflow-docs/main/images/set_default_models.jpg)

## Configurations

### Entity types (*Required*)

The types of the entities to extract from your dataset. The default types are: **organization**, **person**, **event**, and **category**. Add or remove types to suit your specific dataset.

### Method

The method to use to construct knowledge graph:

- **General**: Use prompts provided by [GraphRAG](https://github.com/microsoft/graphrag) to extract entities and relationships.
- **Light**: (Default) Use prompts provided by [LightRAG](https://github.com/HKUDS/LightRAG) to extract entities and relationships. This option consumes fewer tokens, less memory, and fewer computational resources.

### Entity resolution

Whether to enable entity resolution. You can think of this as an entity deduplication switch. When enabled, the LLM will combine similar entities - e.g., '2025' and 'the year of 2025', or 'IT' and 'Information Technology' - to construct a more effective graph.

- (Default) Disable entity resolution.
- Enable entity resolution. This option consumes more tokens.

### Community report generation

In a knowledge graph, a community is a cluster of entities linked by relationships. You can have the LLM generate an abstract for each community, known as a community report. See [here](https://www.microsoft.com/en-us/research/blog/graphrag-improving-global-search-via-dynamic-community-selection/) for more information. This indicates whether to generate community reports:

- Generate community reports. This option consumes more tokens.
- (Default) Do not generate community reports.

## Procedure

1. On the **Configuration** page of your dataset, switch on **Extract knowledge graph** or adjust its settings as needed, and click **Save** to confirm your changes.

   - *The default knowledge graph configurations for your dataset are now set and files uploaded from this point onward will automatically use these settings during parsing.*
   - *Files parsed before this update will retain their original knowledge graph settings.*

2. The knowledge graph of your dataset does *not* automatically update *until* a newly uploaded file is parsed.

   _A **Knowledge graph** entry appears under **Configuration** once a knowledge graph is created._

3. Click **Knowledge graph** to view the details of the generated graph.
4. To use the created knowledge graph, do either of the following:
   
   - In the **Chat setting** panel of your chat app, switch on the **Use knowledge graph** toggle.
   - If you are using an agent, click the **Retrieval** agent component to specify the dataset(s) and switch on the **Use knowledge graph** toggle.

## Frequently asked questions

### Can I have different knowledge graph settings for different files in my dataset?

Yes, you can. Just one graph is generated per dataset. The smaller graphs of your files will be *combined* into one big, unified graph at the end of the graph extraction process.

### Does the knowledge graph automatically update when I remove a related file?

Nope. The knowledge graph does *not* automatically update *until* a newly uploaded document is parsed.

### How to remove a generated knowledge graph?

To remove the generated knowledge graph, delete all related files in your dataset. Although the **Knowledge graph** entry will still be visible, the graph has actually been deleted.

### Where is the created knowledge graph stored?

All chunks of the created knowledge graph are stored in RAGFlow's document engine: either Elasticsearch or [Infinity](https://github.com/infiniflow/infinity).

### How to export a created knowledge graph?

Nope. Exporting a created knowledge graph is not supported. If you still consider this feature essential, please [raise an issue](https://github.com/infiniflow/ragflow/issues) explaining your use case and its importance.
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00			`---`
Miscellaneous editorial updates. (#7047) ### What problem does this PR solve? #6910 ### Type of change - [x] Documentation Update 2025-04-16 10:31:10 +08:00			`sidebar_position: 8`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00			`slug: /construct_knowledge_graph`
			`---`

			`# Construct knowledge graph`

Docs: Knowledge base renamed to dataset. (#10269) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-09-25 09:45:27 +08:00			`Generate a knowledge graph for your dataset.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
			`---`

DOC: Miscellaneous UI and editorial updates (#7324) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-04-27 11:44:08 +08:00			`To enhance multi-hop question-answering, RAGFlow adds a knowledge graph construction step between data extraction and indexing, as illustrated below. This step creates additional chunks from existing ones generated by your specified chunking method.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
Added a guide on running a retrieval test, with and without knowledge graph (#5200) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-21 19:36:20 +08:00			`![Image](https://github.com/user-attachments/assets/1ec21d8e-f255-4d65-9918-69b72dfa142b)`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
Docs: Knowledge base renamed to dataset. (#10269) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-09-25 09:45:27 +08:00			`From v0.16.0 onward, RAGFlow supports constructing a knowledge graph on a dataset, allowing you to construct a unified graph across multiple files within your dataset. When a newly uploaded file starts parsing, the generated graph will automatically update.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
			`:::danger WARNING`
			`Constructing a knowledge graph requires significant memory, computational resources, and tokens.`
			`:::`

			`## Scenarios`

			`Knowledge graphs are especially useful for multi-hop question-answering involving nested logic. They outperform traditional extraction approaches when you are performing question answering on books or works with complex entities and relationships.`

Added a RAPTOR guide (#7211) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-04-22 20:56:30 +08:00			`:::tip NOTE`
			`RAPTOR (Recursive Abstractive Processing for Tree Organized Retrieval) can also be used for multi-hop question-answering tasks. See [Enable RAPTOR](./enable_raptor.md) for details. You may use either approach or both, but ensure you understand the memory, computational, and token costs involved.`
			`:::`

Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00			`## Prerequisites`

Fix typo and error (#5479) ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Documentation Update --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com> 2025-02-28 16:09:40 +08:00			`The system's default chat model is used to generate knowledge graph. Before proceeding, ensure that you have a chat model properly configured:`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
UI updates (#9836) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-08-30 21:44:58 +08:00			`![Set default models](https://raw.githubusercontent.com/infiniflow/ragflow-docs/main/images/set_default_models.jpg)`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
			`## Configurations`

			`### Entity types (Required)`

Docs: Knowledge base renamed to dataset. (#10269) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-09-25 09:45:27 +08:00			`The types of the entities to extract from your dataset. The default types are: organization, person, event, and category. Add or remove types to suit your specific dataset.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
			`### Method`

			`The method to use to construct knowledge graph:`

			`- General: Use prompts provided by [GraphRAG](https://github.com/microsoft/graphrag) to extract entities and relationships.`
			`- Light: (Default) Use prompts provided by [LightRAG](https://github.com/HKUDS/LightRAG) to extract entities and relationships. This option consumes fewer tokens, less memory, and fewer computational resources.`

			`### Entity resolution`

Added an FAQ (#5092) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 19:29:40 +08:00			`Whether to enable entity resolution. You can think of this as an entity deduplication switch. When enabled, the LLM will combine similar entities - e.g., '2025' and 'the year of 2025', or 'IT' and 'Information Technology' - to construct a more effective graph.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
Added an FAQ (#5092) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 19:29:40 +08:00			`- (Default) Disable entity resolution.`
			`- Enable entity resolution. This option consumes more tokens.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
			`### Community report generation`

			`In a knowledge graph, a community is a cluster of entities linked by relationships. You can have the LLM generate an abstract for each community, known as a community report. See [here](https://www.microsoft.com/en-us/research/blog/graphrag-improving-global-search-via-dynamic-community-selection/) for more information. This indicates whether to generate community reports:`

Added an FAQ (#5092) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 19:29:40 +08:00			`- Generate community reports. This option consumes more tokens.`
			`- (Default) Do not generate community reports.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
			`## Procedure`

Docs: Knowledge base renamed to dataset. (#10269) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-09-25 09:45:27 +08:00			`1. On the Configuration page of your dataset, switch on Extract knowledge graph or adjust its settings as needed, and click Save to confirm your changes.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
Docs: Knowledge base renamed to dataset. (#10269) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-09-25 09:45:27 +08:00			`- The default knowledge graph configurations for your dataset are now set and files uploaded from this point onward will automatically use these settings during parsing.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00			`- Files parsed before this update will retain their original knowledge graph settings.`

Docs: Knowledge base renamed to dataset. (#10269) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-09-25 09:45:27 +08:00			`2. The knowledge graph of your dataset does not automatically update until a newly uploaded file is parsed.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
Miscelleneous editorial updates (#5390) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-26 19:03:50 +08:00			`_A Knowledge graph entry appears under Configuration once a knowledge graph is created._`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
Miscelleneous editorial updates (#5390) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-26 19:03:50 +08:00			`3. Click Knowledge graph to view the details of the generated graph.`
0321 chunkmethods (#6520) ### What problem does this PR solve? #6061 ### Type of change - [x] Documentation Update 2025-03-26 09:03:18 +08:00			`4. To use the created knowledge graph, do either of the following:`

UI updates (#9836) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-08-30 21:44:58 +08:00			`- In the Chat setting panel of your chat app, switch on the Use knowledge graph toggle.`
Docs: Knowledge base renamed to dataset. (#10269) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-09-25 09:45:27 +08:00			`- If you are using an agent, click the Retrieval agent component to specify the dataset(s) and switch on the Use knowledge graph toggle.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
			`## Frequently asked questions`

Docs: Knowledge base renamed to dataset. (#10269) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-09-25 09:45:27 +08:00			`### Can I have different knowledge graph settings for different files in my dataset?`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
Docs: Knowledge base renamed to dataset. (#10269) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-09-25 09:45:27 +08:00			`Yes, you can. Just one graph is generated per dataset. The smaller graphs of your files will be combined into one big, unified graph at the end of the graph extraction process.`
Added a guide on running a retrieval test, with and without knowledge graph (#5200) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-21 19:36:20 +08:00
			`### Does the knowledge graph automatically update when I remove a related file?`

Docs: Restructured MCP-specific documents (#7565) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-05-12 09:49:54 +08:00			`Nope. The knowledge graph does not automatically update until a newly uploaded document is parsed.`
Added a guide on running a retrieval test, with and without knowledge graph (#5200) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-21 19:36:20 +08:00
			`### How to remove a generated knowledge graph?`

Docs: Knowledge base renamed to dataset. (#10269) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-09-25 09:45:27 +08:00			`To remove the generated knowledge graph, delete all related files in your dataset. Although the Knowledge graph entry will still be visible, the graph has actually been deleted.`
Docs: Initial editorial pass to MCP server (#7359) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-05-07 19:40:45 +08:00
			`### Where is the created knowledge graph stored?`

Docs: exporting created knowledge graphs is not supported (#8465) ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [x] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): 2025-06-25 09:21:54 +08:00			`All chunks of the created knowledge graph are stored in RAGFlow's document engine: either Elasticsearch or [Infinity](https://github.com/infiniflow/infinity).`

			`### How to export a created knowledge graph?`

			`Nope. Exporting a created knowledge graph is not supported. If you still consider this feature essential, please [raise an issue](https://github.com/infiniflow/ragflow/issues) explaining your use case and its importance.`