Back to Tools

Document Parser

Batch extract structured fields from PDFs, images, and text documents with annotated visual output and optional comparison against expected values.

Video tutorial coming soon

About This Tool

Document Parser is a Claude Code skill that extracts structured field data from batches of documents—PDFs, images, Word files, and plain text. It produces annotated visual output showing exactly where each value was found in the source document.

Supports two modes: extract a list of fields across documents, or compare extracted values against a user-provided table of expected values for tick-and-tie verification. Extraction agents never see expected values, ensuring unbiased audit results.

Key Features

Access the Tool

Open source under the MIT License. Free to use, modify, and distribute.