parse_pdf_to_text
Download a PDF from a URL and extract all text content page by page. Use this to read full text of financial documents such as annual reports.
Instructions
Download a PDF from a URL and extract all text content, page by page.
Use this to read the full text of a specific document — for example, an annual report PDF linked from a search_filings result. Best combined with search_filings: use search_filings to locate the document, then parse_pdf_to_text for the full text. Do not use for PDFs that are already well-represented in the database — search_filings is faster and returns pre-ranked, relevant excerpts. Not suitable for scanned (image-only) PDFs without embedded text; those pages will be returned as "(no extractable text)".
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| pdf_url | Yes | Direct HTTPS URL to the PDF file |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |