ragflow/docs/guides/agent/agent_component_reference/chunker_token.md

---
sidebar_position: 32
slug: /chunker_token_component
---

# Token chunker component

A component that splits texts into chunks, respecting a maximum token limit and using delimiters to find optimal breakpoints.

---

A **Token chunker** component is a text splitter that creates chunks by respecting a recommended maximum token length, using delimiters to ensure logical chunk breakpoints. It splits long texts into appropriately-sized, semantically related chunks.


## Scenario

A **Token chunker** component is optional, usually placed immediately after **Parser** or **Title chunker**.

## Configurations

### Recommended chunk size

The recommended maximum token limit for each created chunk. The **Token chunker** component creates chunks at specified delimiters. If this token limit is reached before a delimiter, a chunk is created at that point.

### Overlapped percent (%)

This defines the overlap percentage between chunks. An appropriate degree of overlap ensures semantic coherence without creating excessive, redundant tokens for the LLM.

- Default: 0
- Maximum: 30%


### Delimiters

Defaults to `\n`. Click the right-hand **Recycle bin** button to remove it, or click **+ Add** to add a delimiter.


### Output

The global variable name for the output of the **Token chunker** component, which can be referenced by subsequent components in the ingestion pipeline.

- Default: `chunks`
- Type: `Array<Object>`
Feat: Support attribute filtering #8703 (#10670) ### What problem does this PR solve? Feat: Support attribute filtering #8703 ### Type of change - [X] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: writinwaters <93570324+writinwaters@users.noreply.github.com> Co-authored-by: writinwaters <cai.keith@gmail.com> 2025-10-21 10:38:40 +08:00			`---`
			`sidebar_position: 32`
			`slug: /chunker_token_component`
			`---`

Docs: Added token chunker and title chunker components (#10711) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-10-21 20:11:23 +08:00			`# Token chunker component`
Feat: Support attribute filtering #8703 (#10670) ### What problem does this PR solve? Feat: Support attribute filtering #8703 ### Type of change - [X] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: writinwaters <93570324+writinwaters@users.noreply.github.com> Co-authored-by: writinwaters <cai.keith@gmail.com> 2025-10-21 10:38:40 +08:00
Docs: Added token chunker and title chunker components (#10711) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-10-21 20:11:23 +08:00			`A component that splits texts into chunks, respecting a maximum token limit and using delimiters to find optimal breakpoints.`
Feat: Support attribute filtering #8703 (#10670) ### What problem does this PR solve? Feat: Support attribute filtering #8703 ### Type of change - [X] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: writinwaters <93570324+writinwaters@users.noreply.github.com> Co-authored-by: writinwaters <cai.keith@gmail.com> 2025-10-21 10:38:40 +08:00
			`---`

Docs: Added token chunker and title chunker components (#10711) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-10-21 20:11:23 +08:00			`A Token chunker component is a text splitter that creates chunks by respecting a recommended maximum token length, using delimiters to ensure logical chunk breakpoints. It splits long texts into appropriately-sized, semantically related chunks.`
Feat: Support attribute filtering #8703 (#10670) ### What problem does this PR solve? Feat: Support attribute filtering #8703 ### Type of change - [X] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: writinwaters <93570324+writinwaters@users.noreply.github.com> Co-authored-by: writinwaters <cai.keith@gmail.com> 2025-10-21 10:38:40 +08:00

			`## Scenario`

Docs: Added token chunker and title chunker components (#10711) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-10-21 20:11:23 +08:00			`A Token chunker component is optional, usually placed immediately after Parser or Title chunker.`

			`## Configurations`

			`### Recommended chunk size`

			`The recommended maximum token limit for each created chunk. The Token chunker component creates chunks at specified delimiters. If this token limit is reached before a delimiter, a chunk is created at that point.`

			`### Overlapped percent (%)`

			`This defines the overlap percentage between chunks. An appropriate degree of overlap ensures semantic coherence without creating excessive, redundant tokens for the LLM.`

			`- Default: 0`
			`- Maximum: 30%`


			`### Delimiters`

			Defaults to `\n`. Click the right-hand Recycle bin button to remove it, or click + Add to add a delimiter.


			`### Output`

Fix typo (#10737) ### What problem does this PR solve? Chunkder to Chunker ### Type of change - [x] Documentation Update 2025-10-23 03:25:15 +02:00			`The global variable name for the output of the Token chunker component, which can be referenced by subsequent components in the ingestion pipeline.`
Docs: Added token chunker and title chunker components (#10711) ### What problem does this PR solve? ### Type of change - [x] Documentation Update 2025-10-21 20:11:23 +08:00
			- Default: `chunks`
			- Type: `Array<Object>`