medrxiv-mcp
Allows retrieval of full text of medRxiv preprints from the official medRxiv AWS S3 archive (Requester-Pays) using the user's own AWS credentials.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@medrxiv-mcpSearch medRxiv for recent preprints on long-COVID and summarize abstracts."
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
medrxiv-mcp
MCP server that turns a medRxiv DOI into clean markdown full text β plus free, relevance-ranked preprint search.
π Full-text retrieval β give it a DOI, get the whole paper as clean markdown (sections, tables, figure captions)
π Free search β relevance-ranked medRxiv search via Europe PMC, returns DOI + title + abstract + date
π©Ί Source of truth β full text comes from the official medRxiv
.mecaText-and-Data-Mining archive, not scraped HTMLβ‘ Lazy local index β a DOIβfile index is built on demand and cached in sqlite, so repeat fetches in a month are instant
π Your data, your key β full text reads a Requester-Pays S3 bucket with your own AWS key; nothing is shipped or shared
bioRxiv has its own package:
biorxiv-mcp.
What is this?
This is a Model Context Protocol (MCP) server that gives AI assistants the full text of medRxiv preprints. Search is free (via the Europe PMC REST API). Full text is resolved from the official s3://medrxiv-src-monthly archive: the server maps a DOI to its month, scans that month's .meca archives once to build a local DOIβfile index, downloads the one matching archive, extracts its JATS XML, and converts it to GitHub-flavored markdown with pandoc.
Designed for AI assistants like Claude to read primary literature directly. Works with any MCP-compatible client (Claude Desktop, Claude Code, or custom integrations).
Related MCP server: medRxiv-MCP-Server
Installation
uvx --from git+https://github.com/yogsoth-ai/medrxiv-mcp medrxiv-mcpNo manual install needed β uvx fetches and runs it. The bundled pandoc binary ships with the package, so you do not need a system pandoc.
Quick Start
1. Add to your MCP client
Claude Code β .mcp.json in your project root
Claude Desktop β claude_desktop_config.json (~/Library/Application Support/Claude/ on macOS, %APPDATA%\Claude\ on Windows)
{
"mcpServers": {
"medrxiv": {
"command": "uvx",
"args": ["--from", "git+https://github.com/yogsoth-ai/medrxiv-mcp", "medrxiv-mcp"],
"env": {
"AWS_ACCESS_KEY_ID": "<your key id>",
"AWS_SECRET_ACCESS_KEY": "<your secret>",
"AWS_DEFAULT_REGION": "us-east-1"
}
}
}
}2. Supply your AWS key
search_preprints is free and needs no key. fetch_fulltext reads a Requester-Pays bucket and does β see below.
3. Restart your MCP client
The server starts automatically when the client needs it.
Requires your own AWS key (Requester-Pays)
medRxiv full text is only reliably reachable through the official s3://medrxiv-src-monthly Text-and-Data-Mining bucket, which is Requester-Pays: you supply an AWS key, and your account pays the (tiny) transfer cost. The package never ships a key β each user brings their own.
Costs are small:
Action | Cost |
| free (Europe PMC, no AWS) |
First | ~$0.03 one-time (indexes that month) |
Each | well under $0.01 |
Setup: in the AWS IAM console, create a user, attach the AmazonS3ReadOnlyAccess policy, create an access key, and put it in the env block above. Deactivate the key whenever you're done.
Optional env:
Variable | Default | Purpose |
|
| where the local DOIβfile sqlite cache lives |
|
| threads used when indexing a month |
Available Tools
Tool | Description |
| Search medRxiv by keyword (relevance-ranked, free via Europe PMC). Returns |
| Given a DOI, return the preprint's full text as markdown (reads the Requester-Pays S3 archive). |
The intended workflow: search_preprints to find a paper and get its DOI cheaply, then fetch_fulltext on that DOI when you want to read it.
Example Queries
Ask Claude things like:
"Search medRxiv for recent preprints on long-COVID cardiovascular outcomes and summarize the top 3 abstracts"
"Fetch the full text of a medRxiv DOI and walk me through its statistical methods"
"Find preprints on a vaccine-efficacy trial, then read the most relevant one in full"
For Developers
git clone https://github.com/yogsoth-ai/medrxiv-mcp.git
cd medrxiv-mcp
pip install -e .
python -m pytest -v # offline suite β no AWS, no network
python -m medrxiv_mcp.server # run the server locally (needs AWS env for fetch)The offline test suite covers the non-trivial logic (JATSβmarkdown conversion and the .meca DOI parser) against fixtures it builds itself β no AWS spend, no network. Live search and S3 fetch are smoke-tested manually.
Links
π GitHub repository
𧬠biorxiv-mcp β the bioRxiv twin
π Europe PMC REST API
π License
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/yogsoth-ai/medrxiv-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server