828 B
828 B
name, description, metadata
| name | description | metadata | |||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| pdf-extract | 从PDF文件中提取文本以供LLM处理。 |
|
PDF Extract
Extract text from PDF files for LLM processing. Uses pdftotext from the poppler-utils package to convert PDF documents into plain text.
Commands
# Extract all text from a PDF
pdf-extract "document.pdf"
# Extract text from specific pages
pdf-extract "document.pdf" --pages 1-5
Install
sudo dnf install poppler-utils