docs/guides/agent/agent_component_reference/chunker_title.md

---
sidebar_position: 31
slug: /chunker_title_component
sidebar_custom_props: {
  categoryIcon: LucideBlocks
}
---
# Title chunker component

A component that splits texts into chunks by heading level.

---

A **Token chunker** component is a text splitter that uses specified heading level as delimiter to define chunk boundaries and create chunks.

## Scenario

A **Title chunker** component is optional, usually placed immediately after **Parser**.

:::caution WARNING
Placing a **Title chunker** after a **Token chunker** is invalid and will cause an error. Please note that this restriction is not currently system-enforced and requires your attention.
:::

## Configurations

### Hierarchy or Group

Select how a document is split:

- Hierarchy: Construct a heading tree and produce self-contained chunks, each carrying its full ancestral path (e.g. Part 1 › Chapter 3 › Section 2 + body text). Best for highly structured texts — such as legal statutes, regulations, contracts, and technical specs — where each chunk must be identifiable by its position in the hierarchy.
- Group: Split the document flat at a chosen heading level, merging adjacent small sections to ensure semantic flow. Chunks exclude ancestral path. Best for documents with flowing, contextually connected content — such as books, manuals, reports, and articles — where narrative coherence depends on keeping adjacent paragraphs together.

#### Separate parent-heading content

:::tip NOTE
Available only when **Hierarchy** is selected.
:::

When enabled, chunks include only their heading path and content; content immediately following a parent heading is kept as a separate chunk.

#### Set first chunk as global context

:::tip NOTE
Available only when **Hierarchy** is selected.
:::

Treats the first split as a global heading to maintain consistent context across the document hierarchy. Ideal for resumes where the first section identifies the subject.

#### H3

Specifies the heading level to define chunk boundaries: 

- H1
- H2
- H3 (Default)
- H4
- H5

Click **+ Add regular expressions** to add heading levels here or update the corresponding **Regular Expressions** fields for custom heading patterns.

### Output

The global variable name for the output of the **Title chunker** component, which can be referenced by subsequent components in the ingestion pipeline.

- Default: `chunks`
- Type: `Array<Object>`
-												Docs: Added token chunker and title chunker components (#10711)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-10-21 20:11:23 +08:00
+								---
 								sidebar_position: 31
 								slug: /chunker_title_component
-												docs: update docs icons (#12465)

### What problem does this PR solve?

Update icons for docs.
Trailing spaces are auto truncated by the editor, does not affect real
content.

### Type of change

- [x] Documentation Update
											
										
										
											2026-01-07 10:00:09 +08:00
+								sidebar_custom_props: {
 								  categoryIcon: LucideBlocks
 								}
-												Docs: Added token chunker and title chunker components (#10711)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-10-21 20:11:23 +08:00
+								---
 								# Title chunker component
 								A component that splits texts into chunks by heading level.
 								---
 								A **Token chunker** component is a text splitter that uses specified heading level as delimiter to define chunk boundaries and create chunks.
 								## Scenario
 								A **Title chunker** component is optional, usually placed immediately after **Parser**.
 								:::caution WARNING
 								Placing a **Title chunker** after a **Token chunker** is invalid and will cause an error. Please note that this restriction is not currently system-enforced and requires your attention.
 								:::
 								## Configurations
-												Docs: Updated Title chunker references (#14483)

### What problem does this PR solve?

Updated Title chunker references

### Type of change

- [x] Documentation Update
											
										
										
											2026-04-29 19:37:24 +08:00
+								### Hierarchy or Group
 								Select how a document is split:
 								- Hierarchy: Construct a heading tree and produce self-contained chunks, each carrying its full ancestral path (e.g. Part 1 › Chapter 3 › Section 2 + body text). Best for highly structured texts — such as legal statutes, regulations, contracts, and technical specs — where each chunk must be identifiable by its position in the hierarchy.
 								- Group: Split the document flat at a chosen heading level, merging adjacent small sections to ensure semantic flow. Chunks exclude ancestral path. Best for documents with flowing, contextually connected content — such as books, manuals, reports, and articles — where narrative coherence depends on keeping adjacent paragraphs together.
 								#### Separate parent-heading content
 								:::tip NOTE
 								Available only when **Hierarchy** is selected.
 								:::
 								When enabled, chunks include only their heading path and content; content immediately following a parent heading is kept as a separate chunk.
 								#### Set first chunk as global context
 								:::tip NOTE
 								Available only when **Hierarchy** is selected.
 								:::
 								Treats the first split as a global heading to maintain consistent context across the document hierarchy. Ideal for resumes where the first section identifies the subject.
 								#### H3
-												Docs: Added token chunker and title chunker components (#10711)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-10-21 20:11:23 +08:00
-												revert white-space changes in docs (#12557)

### What problem does this PR solve?

Trailing white-spaces in commit 6814ace1aa1d449b792f2a87d5ee5686e41b3081
got automatically trimmed by code editor may causes documentation
typesetting broken.

Mostly for double spaces for soft line breaks.  

### Type of change

- [x] Documentation Update
											
										
										
											2026-01-13 09:41:02 +08:00
+								Specifies the heading level to define chunk boundaries:
-												Docs: Added token chunker and title chunker components (#10711)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-10-21 20:11:23 +08:00
 								- H1
 								- H2
 								- H3 (Default)
 								- H4
-												Docs: Updated Title chunker references (#14483)

### What problem does this PR solve?

Updated Title chunker references

### Type of change

- [x] Documentation Update
											
										
										
											2026-04-29 19:37:24 +08:00
+								- H5
-												Docs: Added token chunker and title chunker components (#10711)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-10-21 20:11:23 +08:00
-												Docs: Updated Title chunker references (#14483)

### What problem does this PR solve?

Updated Title chunker references

### Type of change

- [x] Documentation Update
											
										
										
											2026-04-29 19:37:24 +08:00
+								Click **+ Add regular expressions** to add heading levels here or update the corresponding **Regular Expressions** fields for custom heading patterns.
-												Docs: Added token chunker and title chunker components (#10711)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-10-21 20:11:23 +08:00
 								### Output
-												Fix typo (#10737)

### What problem does this PR solve?

Chunkder to Chunker

### Type of change

- [x] Documentation Update
											
										
										
											2025-10-23 03:25:15 +02:00
+								The global variable name for the output of the **Title chunker** component, which can be referenced by subsequent components in the ingestion pipeline.
-												Docs: Added token chunker and title chunker components (#10711)

### What problem does this PR solve?

### Type of change

- [x] Documentation Update
											
										
										
											2025-10-21 20:11:23 +08:00
 								- Default: `chunks`
 								- Type: `Array<Object>`