Submit a receipt image or PDF for asynchronous AI extraction of receipt data. Receive a job ID to poll for status, with optional webhook notification. Returns extracted data for further processing.
An MCP server that provides a tool to extract text content from local PDF files, supporting both standard PDF reading and OCR capabilities with optional page selection.
Extract the table of contents from a PDF attachment as a hierarchical markdown list with page numbers, helping you quickly orient in a paper before fetching full text.
Converts PDF to reading-order Markdown for LLM consumption. Reconstructs up to 2 content columns, infers headings from font size, detects lists; tables rendered as plain text.
Extract structured data from documents using custom field definitions, combining information from multiple files into a single record with enhanced AI accuracy for complex layouts and low-quality scans.