ragflow/docs/guides/dataset/construct_knowledge_graph.md

---
sidebar_position: 8
slug: /construct_knowledge_graph
---

# Construct knowledge graph

Generate a knowledge graph for your dataset.

---

To enhance multi-hop question-answering, RAGFlow adds a knowledge graph construction step between data extraction and indexing, as illustrated below. This step creates additional chunks from existing ones generated by your specified chunking method.

![Image](https://github.com/user-attachments/assets/1ec21d8e-f255-4d65-9918-69b72dfa142b)

From v0.16.0 onward, RAGFlow supports constructing a knowledge graph on a dataset, allowing you to construct a *unified* graph across multiple files within your dataset. When a newly uploaded file starts parsing, the generated graph will automatically update.

:::danger WARNING
Constructing a knowledge graph requires significant memory, computational resources, and tokens.
:::

## Scenarios

Knowledge graphs are especially useful for multi-hop question-answering involving *nested* logic. They outperform traditional extraction approaches when you are performing question answering on books or works with complex entities and relationships.

:::tip NOTE
RAPTOR (Recursive Abstractive Processing for Tree Organized Retrieval) can also be used for multi-hop question-answering tasks. See [Enable RAPTOR](./enable_raptor.md) for details. You may use either approach or both, but ensure you understand the memory, computational, and token costs involved.
:::

## Prerequisites

The system's default chat model is used to generate knowledge graph. Before proceeding, ensure that you have a chat model properly configured:

![Set default models](https://raw.githubusercontent.com/infiniflow/ragflow-docs/main/images/set_default_models.jpg)

## Configurations

### Entity types (*Required*)

The types of the entities to extract from your dataset. The default types are: **organization**, **person**, **event**, and **category**. Add or remove types to suit your specific dataset.

### Method

The method to use to construct knowledge graph:

- **General**: Use prompts provided by [GraphRAG](https://github.com/microsoft/graphrag) to extract entities and relationships.
- **Light**: (Default) Use prompts provided by [LightRAG](https://github.com/HKUDS/LightRAG) to extract entities and relationships. This option consumes fewer tokens, less memory, and fewer computational resources.

### Entity resolution

Whether to enable entity resolution. You can think of this as an entity deduplication switch. When enabled, the LLM will combine similar entities - e.g., '2025' and 'the year of 2025', or 'IT' and 'Information Technology' - to construct a more effective graph.

- (Default) Disable entity resolution.
- Enable entity resolution. This option consumes more tokens.

### Community reports

In a knowledge graph, a community is a cluster of entities linked by relationships. You can have the LLM generate an abstract for each community, known as a community report. See [here](https://www.microsoft.com/en-us/research/blog/graphrag-improving-global-search-via-dynamic-community-selection/) for more information. This indicates whether to generate community reports:

- Generate community reports. This option consumes more tokens.
- (Default) Do not generate community reports.

## Quickstart

1. Navigate to the **Configuration** page of your dataset and update:
   
   - Entity types: *Required* - Specifies the entity types in the knowledge graph to generate. You don't have to stick with the default, but you need to customize them for your documents.
   - Method: *Optional*
   - Entity resolution: *Optional*
   - Community reports: *Optional*
   *The default knowledge graph configurations for your dataset are now set.*

2. Navigate to the **Files** page of your dataset, click the **Generate** button on the top right corner of the page, then select **Knowledge graph** from the dropdown to initiate the knowledge graph generation process.

   *You can click the pause button in the dropdown to halt the build process when necessary.*

3. Go back to the **Configuration** page:  
   
   *Once a knowledge graph is generated, the **Knowledge graph** field changes from `Not generated` to `Generated at a specific timestamp`. You can delete it by clicking the recycle bin button to the right of the field.*

4. To use the created knowledge graph, do either of the following:
   
   - In the **Chat setting** panel of your chat app, switch on the **Use knowledge graph** toggle.
   - If you are using an agent, click the **Retrieval** agent component to specify the dataset(s) and switch on the **Use knowledge graph** toggle.

## Frequently asked questions

### Does the knowledge graph automatically update when I remove a related file?

Nope. The knowledge graph does *not* update *until* you regenerate a knowledge graph for your dataset.

### How to remove a generated knowledge graph?

On the **Configuration** page of your dataset, find the **Knoweledge graph** field and click the recycle bin button to the right of the field.

### Where is the created knowledge graph stored?

All chunks of the created knowledge graph are stored in RAGFlow's document engine: either Elasticsearch or [Infinity](https://github.com/infiniflow/infinity).

### How to export a created knowledge graph?

Nope. Exporting a created knowledge graph is not supported. If you still consider this feature essential, please [raise an issue](https://github.com/infiniflow/ragflow/issues) explaining your use case and its importance.
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00			`---`
Miscellaneous editorial updates. (#7047) ### What problem does this PR solve? #6910 ### Type of change - [x] Documentation Update 2025-04-16 10:31:10 +08:00			`sidebar_position: 8`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00			`slug: /construct_knowledge_graph`
			`---`

			`# Construct knowledge graph`

Docs: Knowledge base renamed to dataset. (#10269) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-09-25 09:45:27 +08:00			`Generate a knowledge graph for your dataset.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
			`---`

DOC: Miscellaneous UI and editorial updates (#7324) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-04-27 11:44:08 +08:00			`To enhance multi-hop question-answering, RAGFlow adds a knowledge graph construction step between data extraction and indexing, as illustrated below. This step creates additional chunks from existing ones generated by your specified chunking method.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
Added a guide on running a retrieval test, with and without knowledge graph (#5200) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-21 19:36:20 +08:00			`![Image](https://github.com/user-attachments/assets/1ec21d8e-f255-4d65-9918-69b72dfa142b)`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
Docs: Knowledge base renamed to dataset. (#10269) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-09-25 09:45:27 +08:00			`From v0.16.0 onward, RAGFlow supports constructing a knowledge graph on a dataset, allowing you to construct a unified graph across multiple files within your dataset. When a newly uploaded file starts parsing, the generated graph will automatically update.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
			`:::danger WARNING`
			`Constructing a knowledge graph requires significant memory, computational resources, and tokens.`
			`:::`

			`## Scenarios`

			`Knowledge graphs are especially useful for multi-hop question-answering involving nested logic. They outperform traditional extraction approaches when you are performing question answering on books or works with complex entities and relationships.`

Added a RAPTOR guide (#7211) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-04-22 20:56:30 +08:00			`:::tip NOTE`
			`RAPTOR (Recursive Abstractive Processing for Tree Organized Retrieval) can also be used for multi-hop question-answering tasks. See [Enable RAPTOR](./enable_raptor.md) for details. You may use either approach or both, but ensure you understand the memory, computational, and token costs involved.`
			`:::`

Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00			`## Prerequisites`

Fix typo and error (#5479) ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Documentation Update --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com> 2025-02-28 16:09:40 +08:00			`The system's default chat model is used to generate knowledge graph. Before proceeding, ensure that you have a chat model properly configured:`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
UI updates (#9836) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-08-30 21:44:58 +08:00			`![Set default models](https://raw.githubusercontent.com/infiniflow/ragflow-docs/main/images/set_default_models.jpg)`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
			`## Configurations`

			`### Entity types (Required)`

Docs: Knowledge base renamed to dataset. (#10269) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-09-25 09:45:27 +08:00			`The types of the entities to extract from your dataset. The default types are: organization, person, event, and category. Add or remove types to suit your specific dataset.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
			`### Method`

			`The method to use to construct knowledge graph:`

			`- General: Use prompts provided by [GraphRAG](https://github.com/microsoft/graphrag) to extract entities and relationships.`
			`- Light: (Default) Use prompts provided by [LightRAG](https://github.com/HKUDS/LightRAG) to extract entities and relationships. This option consumes fewer tokens, less memory, and fewer computational resources.`

			`### Entity resolution`

Added an FAQ (#5092) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 19:29:40 +08:00			`Whether to enable entity resolution. You can think of this as an entity deduplication switch. When enabled, the LLM will combine similar entities - e.g., '2025' and 'the year of 2025', or 'IT' and 'Information Technology' - to construct a more effective graph.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
Added an FAQ (#5092) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 19:29:40 +08:00			`- (Default) Disable entity resolution.`
			`- Enable entity resolution. This option consumes more tokens.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
Docs: Updated dataset configuration, KG building and RAPTOR building for v0.21.0 (#10584) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-10-15 16:39:26 +08:00			`### Community reports`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
			`In a knowledge graph, a community is a cluster of entities linked by relationships. You can have the LLM generate an abstract for each community, known as a community report. See [here](https://www.microsoft.com/en-us/research/blog/graphrag-improving-global-search-via-dynamic-community-selection/) for more information. This indicates whether to generate community reports:`

Added an FAQ (#5092) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 19:29:40 +08:00			`- Generate community reports. This option consumes more tokens.`
			`- (Default) Do not generate community reports.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
Docs: Updated dataset configuration, KG building and RAPTOR building for v0.21.0 (#10584) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-10-15 16:39:26 +08:00			`## Quickstart`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
Docs: Updated dataset configuration, KG building and RAPTOR building for v0.21.0 (#10584) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-10-15 16:39:26 +08:00			`1. Navigate to the Configuration page of your dataset and update:`

			`- Entity types: Required - Specifies the entity types in the knowledge graph to generate. You don't have to stick with the default, but you need to customize them for your documents.`
			`- Method: Optional`
			`- Entity resolution: Optional`
			`- Community reports: Optional`
			`The default knowledge graph configurations for your dataset are now set.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
Docs: Updated dataset configuration, KG building and RAPTOR building for v0.21.0 (#10584) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-10-15 16:39:26 +08:00			`2. Navigate to the Files page of your dataset, click the Generate button on the top right corner of the page, then select Knowledge graph from the dropdown to initiate the knowledge graph generation process.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
Docs: Updated dataset configuration, KG building and RAPTOR building for v0.21.0 (#10584) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-10-15 16:39:26 +08:00			`You can click the pause button in the dropdown to halt the build process when necessary.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
Docs: Updated dataset configuration, KG building and RAPTOR building for v0.21.0 (#10584) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-10-15 16:39:26 +08:00			`3. Go back to the Configuration page:`

			Once a knowledge graph is generated, the Knowledge graph* field changes from `Not generated` to `Generated at a specific timestamp`. You can delete it by clicking the recycle bin button to the right of the field.*
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
0321 chunkmethods (#6520) ### What problem does this PR solve? #6061 ### Type of change - [x] Documentation Update 2025-03-26 09:03:18 +08:00			`4. To use the created knowledge graph, do either of the following:`

UI updates (#9836) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-08-30 21:44:58 +08:00			`- In the Chat setting panel of your chat app, switch on the Use knowledge graph toggle.`
Docs: Knowledge base renamed to dataset. (#10269) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-09-25 09:45:27 +08:00			`- If you are using an agent, click the Retrieval agent component to specify the dataset(s) and switch on the Use knowledge graph toggle.`
Added a graphrag guide (#4978) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-18 13:42:06 +08:00
			`## Frequently asked questions`

Added a guide on running a retrieval test, with and without knowledge graph (#5200) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-21 19:36:20 +08:00			`### Does the knowledge graph automatically update when I remove a related file?`

Docs: Updated dataset configuration, KG building and RAPTOR building for v0.21.0 (#10584) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-10-15 16:39:26 +08:00			`Nope. The knowledge graph does not update until you regenerate a knowledge graph for your dataset.`
Added a guide on running a retrieval test, with and without knowledge graph (#5200) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-02-21 19:36:20 +08:00
			`### How to remove a generated knowledge graph?`

Docs: Updated dataset configuration, KG building and RAPTOR building for v0.21.0 (#10584) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-10-15 16:39:26 +08:00			`On the Configuration page of your dataset, find the Knoweledge graph field and click the recycle bin button to the right of the field.`
Docs: Initial editorial pass to MCP server (#7359) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-05-07 19:40:45 +08:00
			`### Where is the created knowledge graph stored?`

Docs: exporting created knowledge graphs is not supported (#8465) ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [x] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): 2025-06-25 09:21:54 +08:00			`All chunks of the created knowledge graph are stored in RAGFlow's document engine: either Elasticsearch or [Infinity](https://github.com/infiniflow/infinity).`

			`### How to export a created knowledge graph?`

			`Nope. Exporting a created knowledge graph is not supported. If you still consider this feature essential, please [raise an issue](https://github.com/infiniflow/ragflow/issues) explaining your use case and its importance.`