find_text
Locate specific text within PDFs and retrieve their coordinates. Supports regex for advanced search patterns, enabling precise text extraction from any page range.
Instructions
Input Schema
Name | Required | Description | Default |
---|---|---|---|
api_key | No | PDF.co API key. If not provided, will use X_API_KEY environment variable. (Optional) | |
httppassword | No | HTTP auth password if required to access source url. (Optional) | |
httpusername | No | HTTP auth user name if required to access source url. (Optional) | |
pages | No | Comma-separated list of page indices (or ranges) to process. Leave empty for all pages. Example: '0,2-5,7-'. The first-page index is 0. (Optional) | |
password | No | Password of the PDF file. (Optional) | |
regexSearch | No | Set to True to enable regular expressions in the search string. (Optional) | |
searchString | Yes | Text to search. Can support regular expressions if regexSearch is set to True. | |
url | Yes | URL to the source PDF file. Supports publicly accessible links including Google Drive, Dropbox, PDF.co Built-In Files Storage. Use 'upload_file' tool to upload local files. | |
wordMatchingMode | No | Values can be either SmartMatch, ExactMatch, or None. (Optional) |