From 1434f8ade851af34986b8adc3cf791f1ae15a3f2 Mon Sep 17 00:00:00 2001 From: writinwaters <93570324+writinwaters@users.noreply.github.com> Date: Wed, 22 Apr 2026 20:00:06 +0800 Subject: [PATCH] Doc: two PDF parser optimizers are supported as of v0.25.0. (#14261) ### What problem does this PR solve? Multi-column layout detection is supported in v0.25.0 ### Type of change - [x] Documentation Update --- docs/guides/agent/agent_component_reference/parser.md | 8 ++++++++ 1 file changed, 8 insertions(+) diff --git a/docs/guides/agent/agent_component_reference/parser.md b/docs/guides/agent/agent_component_reference/parser.md index 75b6341cb2..e036432f9c 100644 --- a/docs/guides/agent/agent_component_reference/parser.md +++ b/docs/guides/agent/agent_component_reference/parser.md @@ -31,6 +31,14 @@ The **Parser** component supports parsing the following file types: | Audio | MP3, WAV | | Video | MP4, AVI, MKV | +### Detect multi-column layout + +Optimizes the parser to detect and reorder multi-column pages into a logical sequence. Ideal for PDF documents with two-column or newspaper-style layouts. + +### Remove original table of contents + +Strips the original table of contents from PDF files. Once enabled, the table of contents is not chunked or parsed for retrieval. + ### PDF parser The output of a PDF parser is `json`. In the PDF parser, you select the parsing method that works best with your PDFs.