ArXiv MCP Server
The ArXiv MCP Server enables AI assistants to interact with arXiv papers through a programmatic interface. With this server, you can:
Search Papers: Query arXiv with filters for date ranges, categories, and result limits
Download Papers: Retrieve papers by arXiv ID, storing them locally for faster access
List Papers: View all previously downloaded papers stored locally
Read Papers: Access the full content of downloaded papers in markdown format
Research Tools: Utilize specialized prompts like "deep-paper-analysis" for comprehensive paper review
Development: Set up environments and run tests for development purposes
Provides a bridge between AI assistants and arXiv's research repository through the Message Control Protocol (MCP). Allows AI models to search for papers, download and read their content in a programmatic way.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@ArXiv MCP Serversearch for recent papers about large language models in the cs.AI category"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
ArXiv MCP Server
๐ Enable AI assistants to search and access arXiv papers through a simple MCP interface.
The ArXiv MCP Server provides a bridge between AI assistants and arXiv's research repository through the Model Context Protocol (MCP). It allows AI models to search for papers and access their content in a programmatic way.
๐ค Contribute โข ๐ Report Bug
โจ Core Features
๐ Paper Search: Query arXiv papers with filters for date ranges and categories
๐ Paper Access: Download and read paper content
๐ Paper Listing: View all downloaded papers
๐๏ธ Local Storage: Papers are saved locally for faster access
๐ Prompts: A set of research prompts for paper analysis
Related MCP server: bioRxiv-MCP-Server
๐ Security
Prompt Injection Risk
Paper content retrieved from arXiv is untrusted external input.
When an AI assistant downloads or reads a paper through this server, the paper's text is passed directly into the model's context. A maliciously crafted paper could embed adversarial instructions designed to hijack the AI's behavior โ for example, instructing it to exfiltrate data, invoke other tools with unintended arguments, or override system-level instructions. This is a known class of attack described by OWASP as LLM01: Prompt Injection and by the OWASP Agentic AI framework as AG01: Prompt Injection in LLM-Integrated Systems.
Recommended Mitigations
Use read-only MCP configurations โ where possible, configure the MCP client so that the arxiv-mcp-server cannot trigger write operations or invoke other tools on your behalf.
Review paper content before acting on AI summaries โ if an AI summary asks you to run commands or visit external URLs that were not part of your original request, treat that as a red flag.
Be cautious in multi-tool setups โ agentic pipelines that combine this server with filesystem, shell, or browser tools are higher risk; a prompt injection in a paper could chain tool calls unexpectedly.
Treat AI-generated summaries as data, not instructions โ always apply human judgment before executing any action the AI recommends after reading a paper.
References
๐ Quick Start
Installing via Smithery
To install ArXiv Server for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install arxiv-mcp-server --client claudeInstalling Manually
Important โ use
uv tool install, notuv pip installRunning
uv pip install arxiv-mcp-serverinstalls the package into the current virtual environment but does not place thearxiv-mcp-serverexecutable on yourPATH. You must useuv tool installso that uv creates an isolated environment and exposes the executable globally:
uv tool install arxiv-mcp-serverAfter this, the arxiv-mcp-server command will be available on your PATH.
PDF fallback (older papers): Most arXiv papers have an HTML version which the base install handles automatically. For older papers that only have a PDF, the server needs the
[pdf]extra (pymupdf4llm). Install it with:uv tool install 'arxiv-mcp-server[pdf]'
You can verify it with:
arxiv-mcp-server --helpIf you previously ran uv pip install arxiv-mcp-server and the command is
missing, uninstall it and re-install with uv tool install as shown above.
For development:
# Clone and set up development environment
git clone https://github.com/blazickjp/arxiv-mcp-server.git
cd arxiv-mcp-server
# Create and activate virtual environment
uv venv
source .venv/bin/activate
# Install with test dependencies (development only โ no global executable)
uv pip install -e ".[test]"๐ MCP Integration
Add this configuration to your MCP client config file:
{
"mcpServers": {
"arxiv-mcp-server": {
"command": "uv",
"args": [
"tool",
"run",
"arxiv-mcp-server",
"--storage-path", "/path/to/paper/storage"
]
}
}
}For Development:
{
"mcpServers": {
"arxiv-mcp-server": {
"command": "uv",
"args": [
"--directory",
"path/to/cloned/arxiv-mcp-server",
"run",
"arxiv-mcp-server",
"--storage-path", "/path/to/paper/storage"
]
}
}
}๐ Security Note
arXiv papers are user-generated, untrusted content. Paper text returned by this server may contain prompt injection attempts โ crafted text designed to manipulate an AI assistant's behavior. Treat all paper content as untrusted input.
In production environments, apply appropriate sandboxing and avoid feeding raw paper content into agentic pipelines that have access to sensitive tools or data without review. See SECURITY.md for the full security policy.
๐ก Available Tools
Core Workflow
The typical workflow for deep paper research is:
search_papers โ download_paper โ read_paperlist_papers shows what you have locally. semantic_search searches across your local collection.
1. Paper Search
Search arXiv with optional category, date, and boolean filters. Enforces arXiv's 3-second rate limit automatically. If rate limited, wait 60 seconds before retrying.
result = await call_tool("search_papers", {
"query": "\"KAN\" OR \"Kolmogorov-Arnold Networks\"",
"max_results": 10,
"date_from": "2024-01-01",
"categories": ["cs.LG", "cs.AI"],
"sort_by": "date" # or "relevance" (default)
})Supported categories include cs.AI, cs.LG, cs.CL, cs.CV, cs.NE, stat.ML, math.OC, quant-ph, eess.SP, and more. See tool description for the full list.
2. Paper Download
Download a paper by its arXiv ID. Tries HTML first, falls back to PDF. Stores the paper locally for read_paper and semantic_search.
result = await call_tool("download_paper", {
"paper_id": "2401.12345"
})For older papers that only have a PDF, install the
[pdf]extra:uv tool install 'arxiv-mcp-server[pdf]'
3. List Papers
List all papers downloaded locally. Returns arXiv IDs only โ use read_paper to access content.
result = await call_tool("list_papers", {})4. Read Paper
Read the full text of a locally downloaded paper in markdown. Requires download_paper to be called first.
result = await call_tool("read_paper", {
"paper_id": "2401.12345"
})๐ Research Prompts
The server offers specialized prompts to help analyze academic papers:
Paper Analysis Prompt
A comprehensive workflow for analyzing academic papers that only requires a paper ID:
result = await call_prompt("deep-paper-analysis", {
"paper_id": "2401.12345"
})This prompt includes:
Detailed instructions for using available tools (list_papers, download_paper, read_paper, search_papers)
A systematic workflow for paper analysis
Comprehensive analysis structure covering:
Executive summary
Research context
Methodology analysis
Results evaluation
Practical and theoretical implications
Future research directions
Broader impacts
Pro Prompt Pack
summarize_paper: concise structured summary for one paper.compare_papers: side-by-side technical comparison across paper IDs.literature_review: thematic synthesis across a topic and optional paper set.
โ๏ธ Configuration
Configure through environment variables:
Variable | Purpose | Default |
| Paper storage location | ~/.arxiv-mcp-server/papers |
๐งช Testing
Run the test suite:
python -m pytest๐งช Experimental Features
These features are not yet fully tested and may behave unexpectedly. Use with caution.
The following tools require additional dependencies and are under active development:
uv pip install -e ".[pro]"Semantic Search
Semantic similarity search over your locally downloaded papers only. Returns empty results if no papers have been downloaded yet. Requires [pro] dependencies.
result = await call_tool("semantic_search", {
"query": "test-time adaptation in multimodal transformers",
"max_results": 5
})
# or find papers similar to a known paper:
result = await call_tool("semantic_search", {
"paper_id": "2404.19756",
"max_results": 5
})Citation Graph
Fetch references and citing papers via Semantic Scholar. Works on any arXiv ID โ no local download required.
result = await call_tool("citation_graph", {
"paper_id": "2401.12345"
})Research Alerts
Save topic watches and poll for newly published papers since the last check. Uses the same query syntax as search_papers.
# Register a watch (idempotent โ calling again updates the existing watch)
await call_tool("watch_topic", {
"topic": "\"multi-agent reinforcement learning\"",
"categories": ["cs.AI", "cs.LG"],
"max_results": 10
})
# Check all watches โ returns only papers published since last check
result = await call_tool("check_alerts", {})
# Check a single watch
result = await call_tool("check_alerts", {"topic": "\"multi-agent reinforcement learning\""})Advanced Prompts
summarize_paper, compare_papers, and literature_review for deeper research workflows. Requires [pro] dependencies.
๐ License
Released under the MIT License. See the LICENSE file for details.
Made with โค๏ธ by the Pearl Labs Team
Appeared in Searches
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/blazickjp/arxiv-mcp-server'
If you have feedback or need assistance with the MCP directory API, please join our Discord server