paper_read
Extract text content from academic paper PDFs using identifiers from arXiv, PubMed, bioRxiv, medRxiv, IACR, Semantic Scholar, or CrossRef sources.
Instructions
Read and extract text content from academic paper PDFs from multiple sources.
Input Constraints:
searcher: Required, must be one of: arxiv, pubmed, biorxiv, medrxiv, iacr, semantic, crossref
paper_id: Required, 1-200 characters, cannot be empty
Example:
arXiv
paper_read({"searcher": "arxiv", "paper_id": "2106.12345", "save_path": "./downloads"}) # paper_id is arXiv ID.
PubMed
paper_read({"searcher": "pubmed", "paper_id": "32790614", "save_path": "./downloads"}) # paper_id is PubMed ID (PMID).
bioRxiv
paper_read({"searcher": "biorxiv", "paper_id": "10.1101/2020.01.01.123456", "save_path": "./downloads"}) # paper_id is bioRxiv DOI.
medRxiv
paper_read({"searcher": "medrxiv", "paper_id": "10.1101/2020.01.01.123456", "save_path": "./downloads"}) # paper_id is medRxiv DOI.
IACR
paper_read({"searcher": "iacr", "paper_id": "2009/101", "save_path": "./downloads"}) # paper_id is IACR paper ID.
Semantic Scholar
paper_read({"searcher": "semantic", "paper_id": "DOI:10.18653/v1/N18-3011", "save_path": "./downloads"}) where paper_id: Semantic Scholar paper ID, Paper identifier in one of the following formats: - Semantic Scholar ID (e.g., "649def34f8be52c8b66281af98ae884c09aef38b") - DOI: (e.g., "DOI:10.18653/v1/N18-3011") - ARXIV: (e.g., "ARXIV:2106.15928") - MAG: (e.g., "MAG:112218234") - ACL: (e.g., "ACL:W12-3903") - PMID: (e.g., "PMID:19872477") - PMCID: (e.g., "PMCID:2323736") - URL: (e.g., "URL:https://arxiv.org/abs/2106.15928v1")
CrossRef
paper_read({"searcher": "crossref", "paper_id": "10.1038/s41586-020-2649-2", "save_path": "./downloads"}) # paper_id is DOI.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| searcher | Yes | ||
| paper_id | Yes | The unique identifier of the paper to read (format depends on searcher) |