fetch
Download files from research data repositories (Zenodo, SRA, GEO, Figshare, Dataverse, OSF) and open-access literature to local disk, with checksum verification and a sidecar metadata file.
Instructions
Download a resource's files to local disk and return the PATHS (never the file contents). Fetchable: Zenodo, SRA (ENA FASTQ), GEO supplementary files, and DataCite-discovered Figshare/Dataverse/OSF deposits (md5-verified), and open-access literature full text (EuropePMC XML / Unpaywall PDF, unverified); a DataCite Dryad id is manifest-only (resolve lists its files but fetch fails loud), and other DataCite repos plus paywalled/non-OA literature ids fail loud. Fails loud if selected files exceed max_bytes unless force=true. Verifies checksums; writes a .dataresource.json sidecar.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| id | Yes | Source-prefixed id or bare Zenodo id | |
| dest | No | Destination dir (default managed cache) | |
| files | No | Glob over file names (default all) | |
| max_bytes | No | Byte ceiling before failing loud | |
| force | No | Override max_bytes | |
| extract | No | Unpack downloaded zip/tar archives into the destination (default false). Path-traversal-guarded; counts against max_bytes. |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| paths | No | ||
| bytes | No | ||
| skipped | No |