search_multiple_pdfs
Search for text patterns across multiple PDF files simultaneously using parallel processing to reduce search time and handle large batches efficiently.
Instructions
Search for text patterns across multiple PDF files in parallel. Processes files concurrently based on the parallelism factor for optimal performance. Increase parallelism (max: 50) to search more files simultaneously and reduce total search time. For large batches of files, prefer a single call with high parallelism rather than multiple smaller calls (e.g., search 100 files with parallelism=50 in one call instead of multiple calls with 20 files each). Returns matches and errors for each file separately.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| files | Yes | Array of PDF files to search. Each file must specify either absolute_path or relative_path. | |
| search_pattern | Yes | Search pattern: '/regex/flags' format or plain text. Applied to all files. | |
| parallelism | No | Number of files to process concurrently. Higher values = faster search. Default: 4, Max: 50 | |
| page_range | No | Page range to search in each file. Default: '1:' (all pages) | 1: |
| max_results_per_file | No | Max matches per file before stopping. Optional. | |
| max_pages_scanned_per_file | No | Max pages to scan per file. Optional. | |
| context_chars | No | Characters of context around matches. Default: 150 | |
| search_timeout | No | Timeout per file in milliseconds. Default: 10000 |