ragflow/docs/guides/dataset/autokeyword_autoquestion.mdx

---
sidebar_position: 3
slug: /autokeyword_autoquestion
---

# Auto-keyword Auto-question
import APITable from '@site/src/components/APITable';

Use a chat model to generate keywords or questions from each chunk in the knowledge base.

---

When selecting a chunking method, you can also enable auto-keyword or auto-question generation to increase retrieval rates. This feature uses a chat model to produce a specified number of keywords and questions from each created chunk, generating an "additional layer of information" from the original content.

:::caution WARNING
Enabling this feature increases document indexing time and uses extra tokens, as all created chunks will be sent to the chat model for keyword or question generation.
:::

## What is Auto-keyword?

Auto-keyword refers to the auto-keyword generation feature of RAGFlow. It uses a chat model to generate a set of keywords or synonyms from each chunk to correct errors and enhance retrieval accuracy. This feature is implemented as a slider under **Page rank** on the **Configuration** page of your knowledge base.

**Values**:  

- 0: (Default) Disabled.  
- Between 3 and 5 (inclusive): Recommended if you have chunks of approximately 1,000 characters. 
- 30 (maximum) 

:::tip NOTE
- If your chunk size increases, you can increase the value accordingly. Please note, as the value increases, the marginal benefit decreases. 
- An Auto-keyword value must be an integer. If you set it to a non-integer, say 1.7, it will be rounded down to the nearest integer, which in this case is 1.
:::

## What is Auto-question?

Auto-question is a feature of RAGFlow that automatically generates questions from chunks of data using a chat model. These questions (e.g. who, what, and why) also help correct errors and improve the matching of user queries. The feature usually works with FAQ retrieval scenarios involving product manuals or policy documents. And you can find this feature as a slider under **Page rank** on the **Configuration** page of your knowledge base.

**Values**:

- 0: (Default) Disabled.  
- 1 or 2: Recommended if you have chunks of approximately 1,000 characters.  
- 10 (maximum)

:::tip NOTE
- If your chunk size increases, you can increase the value accordingly. Please note, as the value increases, the marginal benefit decreases. 
- An Auto-question value must be an integer. If you set it to a non-integer, say 1.7, it will be rounded down to the nearest integer, which in this case is 1.
:::

## Tips from the community

The Auto-keyword or Auto-question values relate closely to the chunking size in your knowledge base. However, if you are new to this feature and unsure which value(s) to start with, the following are some value settings we gathered from our community. While they may not be accurate, they provide a starting point at the very least.

```mdx-code-block
<APITable>
```

| Use cases or typical scenarios                                      | Document volume/length          | Auto_keyword (0–30)        | Auto_question (0–10)       |
|---------------------------------------------------------------------|---------------------------------|----------------------------|----------------------------|
| Internal process guidance for employee handbook                     | Small, under 10 pages           | 0                          | 0                          |
| Customer service FAQs                                               | Medium, 10–100 pages            | 3–7                        | 1–3                        |
| Technical whitepapers: Development standards, protocol details      | Large, over 100 pages           | 2–4                        | 1–2                        |
| Contracts / Regulations / Legal clause retrieval                    | Large, over 50 pages            | 2–5                        | 0–1                        |
| Multi-repository layered new documents + old archive                | Many                            | Adjust as appropriate      |Adjust as appropriate       |
| Social media comment pool: multilingual & mixed spelling            | Very large volume of short text | 8–12                       | 0                          |
| Operational logs for troubleshooting                                | Very large volume of short text | 3–6                        | 0                          |
| Marketing asset library: multilingual product descriptions          | Medium                          | 6–10                       | 1–2                        |
| Training courses / eBooks                                           | Large                           | 2–5                        | 1–2                        |
| Maintenance manual: equipment diagrams + steps                      | Medium                          | 3–7                        | 1–2                        |

```mdx-code-block
</APITable>
```
-												Docs: Added auto-keyword auto-question guide (#8113)

### What problem does this PR solve?

### Type of change


- [x] Documentation Update
											
										
										
											2025-06-06 19:27:41 +08:00
+								---
 								sidebar_position: 3
 								slug: /autokeyword_autoquestion
 								---
 								# Auto-keyword Auto-question
 								import APITable from '@site/src/components/APITable';
-												Docs: Miscellaneous editorial updates (#8237)

### What problem does this PR solve?


### Type of change


- [x] Documentation Update
											
										
										
											2025-06-13 09:46:24 +08:00
+								Use a chat model to generate keywords or questions from each chunk in the knowledge base.
-												Docs: Added auto-keyword auto-question guide (#8113)

### What problem does this PR solve?

### Type of change


- [x] Documentation Update
											
										
										
											2025-06-06 19:27:41 +08:00
 								---
-												Docs: Miscellaneous editorial updates (#8237)

### What problem does this PR solve?


### Type of change


- [x] Documentation Update
											
										
										
											2025-06-13 09:46:24 +08:00
+								When selecting a chunking method, you can also enable auto-keyword or auto-question generation to increase retrieval rates. This feature uses a chat model to produce a specified number of keywords and questions from each created chunk, generating an "additional layer of information" from the original content.
-												Docs: Added auto-keyword auto-question guide (#8113)

### What problem does this PR solve?

### Type of change


- [x] Documentation Update
											
										
										
											2025-06-06 19:27:41 +08:00
-												Docs: Miscellaneous (#8198)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-12 09:42:07 +08:00
+								:::caution WARNING
 								Enabling this feature increases document indexing time and uses extra tokens, as all created chunks will be sent to the chat model for keyword or question generation.
-												Docs: Added auto-keyword auto-question guide (#8113)

### What problem does this PR solve?

### Type of change


- [x] Documentation Update
											
										
										
											2025-06-06 19:27:41 +08:00
+								:::
-												Docs: Updated Auto-question Auto-keyword (#8168)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-10 19:38:28 +08:00
+								## What is Auto-keyword?
-												Docs: Added auto-keyword auto-question guide (#8113)

### What problem does this PR solve?

### Type of change


- [x] Documentation Update
											
										
										
											2025-06-06 19:27:41 +08:00
-												Docs: Miscellaneous (#8198)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-12 09:42:07 +08:00
+								Auto-keyword refers to the auto-keyword generation feature of RAGFlow. It uses a chat model to generate a set of keywords or synonyms from each chunk to correct errors and enhance retrieval accuracy. This feature is implemented as a slider under **Page rank** on the **Configuration** page of your knowledge base.
-												Docs: Added auto-keyword auto-question guide (#8113)

### What problem does this PR solve?

### Type of change


- [x] Documentation Update
											
										
										
											2025-06-06 19:27:41 +08:00
-												Docs: Miscellaneous (#8198)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-12 09:42:07 +08:00
+								**Values**:
-												Docs: Added auto-keyword auto-question guide (#8113)

### What problem does this PR solve?

### Type of change


- [x] Documentation Update
											
										
										
											2025-06-06 19:27:41 +08:00
-												Docs: Updated Auto-question Auto-keyword (#8168)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-10 19:38:28 +08:00
+								- 0: (Default) Disabled.
-												Docs: Miscellaneous editorial updates (#8237)

### What problem does this PR solve?


### Type of change


- [x] Documentation Update
											
										
										
											2025-06-13 09:46:24 +08:00
+								- Between 3 and 5 (inclusive): Recommended if you have chunks of approximately 1,000 characters.
-												Docs: Miscellaneous (#8198)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-12 09:42:07 +08:00
+								- 30 (maximum)
-												Docs: Added auto-keyword auto-question guide (#8113)

### What problem does this PR solve?

### Type of change


- [x] Documentation Update
											
										
										
											2025-06-06 19:27:41 +08:00
 								:::tip NOTE
-												Docs: Miscellaneous (#8198)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-12 09:42:07 +08:00
+								- If your chunk size increases, you can increase the value accordingly. Please note, as the value increases, the marginal benefit decreases.
 								- An Auto-keyword value must be an integer. If you set it to a non-integer, say 1.7, it will be rounded down to the nearest integer, which in this case is 1.
-												Docs: Added auto-keyword auto-question guide (#8113)

### What problem does this PR solve?

### Type of change


- [x] Documentation Update
											
										
										
											2025-06-06 19:27:41 +08:00
+								:::
-												Docs: Updated Auto-question Auto-keyword (#8168)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-10 19:38:28 +08:00
+								## What is Auto-question?
-												Docs: Miscellaneous (#8198)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-12 09:42:07 +08:00
+								Auto-question is a feature of RAGFlow that automatically generates questions from chunks of data using a chat model. These questions (e.g. who, what, and why) also help correct errors and improve the matching of user queries. The feature usually works with FAQ retrieval scenarios involving product manuals or policy documents. And you can find this feature as a slider under **Page rank** on the **Configuration** page of your knowledge base.
-												Docs: Updated Auto-question Auto-keyword (#8168)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-10 19:38:28 +08:00
-												Docs: Miscellaneous (#8198)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-12 09:42:07 +08:00
+								**Values**:
-												Docs: Updated Auto-question Auto-keyword (#8168)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-10 19:38:28 +08:00
 								- 0: (Default) Disabled.
 								- 1 or 2: Recommended if you have chunks of approximately 1,000 characters.
-												Docs: Miscellaneous (#8198)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-12 09:42:07 +08:00
+								- 10 (maximum)
-												Docs: Updated Auto-question Auto-keyword (#8168)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-10 19:38:28 +08:00
 								:::tip NOTE
-												Docs: Miscellaneous (#8198)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-12 09:42:07 +08:00
+								- If your chunk size increases, you can increase the value accordingly. Please note, as the value increases, the marginal benefit decreases.
 								- An Auto-question value must be an integer. If you set it to a non-integer, say 1.7, it will be rounded down to the nearest integer, which in this case is 1.
-												Docs: Updated Auto-question Auto-keyword (#8168)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-10 19:38:28 +08:00
+								:::
-												Docs: Added auto-keyword auto-question guide (#8113)

### What problem does this PR solve?

### Type of change


- [x] Documentation Update
											
										
										
											2025-06-06 19:27:41 +08:00
-												Docs: Miscellaneous (#8198)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-12 09:42:07 +08:00
+								## Tips from the community
-												Docs: Added auto-keyword auto-question guide (#8113)

### What problem does this PR solve?

### Type of change


- [x] Documentation Update
											
										
										
											2025-06-06 19:27:41 +08:00
-												Docs: Miscellaneous (#8198)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-12 09:42:07 +08:00
+								The Auto-keyword or Auto-question values relate closely to the chunking size in your knowledge base. However, if you are new to this feature and unsure which value(s) to start with, the following are some value settings we gathered from our community. While they may not be accurate, they provide a starting point at the very least.
-												Docs: Added auto-keyword auto-question guide (#8113)

### What problem does this PR solve?

### Type of change


- [x] Documentation Update
											
										
										
											2025-06-06 19:27:41 +08:00
 								```mdx-code-block
 								<APITable>
 								```
-												Docs: Updated Auto-question Auto-keyword (#8168)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-10 19:38:28 +08:00
+								| Use cases or typical scenarios                                      | Document volume/length          | Auto_keyword (0–30)        | Auto_question (0–10)       |
-												Docs: Added auto-keyword auto-question guide (#8113)

### What problem does this PR solve?

### Type of change


- [x] Documentation Update
											
										
										
											2025-06-06 19:27:41 +08:00
+								|---------------------------------------------------------------------|---------------------------------|----------------------------|----------------------------|
-												Docs: Updated Auto-question Auto-keyword (#8168)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-10 19:38:28 +08:00
+								| Internal process guidance for employee handbook                     | Small, under 10 pages           | 0                          | 0                          |
 								| Customer service FAQs                                               | Medium, 10–100 pages            | 3–7                        | 1–3                        |
 								| Technical whitepapers: Development standards, protocol details      | Large, over 100 pages           | 2–4                        | 1–2                        |
 								| Contracts / Regulations / Legal clause retrieval                    | Large, over 50 pages            | 2–5                        | 0–1                        |
 								| Multi-repository layered new documents + old archive                | Many                            | Adjust as appropriate      |Adjust as appropriate       |
 								| Social media comment pool: multilingual & mixed spelling            | Very large volume of short text | 8–12                       | 0                          |
 								| Operational logs for troubleshooting                                | Very large volume of short text | 3–6                        | 0                          |
-												Docs: Miscellaneous (#8198)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-12 09:42:07 +08:00
+								| Marketing asset library: multilingual product descriptions          | Medium                          | 6–10                       | 1–2                        |
 								| Training courses / eBooks                                           | Large                           | 2–5                        | 1–2                        |
-												Docs: Updated Auto-question Auto-keyword (#8168)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-06-10 19:38:28 +08:00
+								| Maintenance manual: equipment diagrams + steps                      | Medium                          | 3–7                        | 1–2                        |
-												Docs: Added auto-keyword auto-question guide (#8113)

### What problem does this PR solve?

### Type of change


- [x] Documentation Update
											
										
										
											2025-06-06 19:27:41 +08:00
 								```mdx-code-block
 								</APITable>
 								```