unpaywall_fetch_pdf_text
Extract text from open access research papers by DOI or PDF URL to analyze academic content without accessing paywalls.
Instructions
Download and extract text from best OA PDF for a DOI, or from a provided PDF URL.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| doi | No | DOI string or DOI URL. Used if pdf_url is not provided. | |
| pdf_url | No | Direct PDF URL to download and parse (takes precedence over DOI). | |
| No | Email to identify requests to Unpaywall (required when resolving via DOI). | ||
| truncate_chars | No | Max characters of extracted text to return (default 20000). |