Provides tools for searching arXiv papers, retrieving paper metadata (titles, abstracts, authors, categories), accessing full paper content, and browsing recent publications by category.
arXiv MCP Server
A Model Context Protocol (MCP) server that provides arXiv paper search and retrieval capabilities. This server enables LLMs to search for academic papers on arXiv and get cleaned titles, abstracts, authors, and content without dealing with complex HTML parsing.
Features
Search papers by query, author, category, and date
Advanced search with specific field filters
Get detailed paper metadata (title, abstract, authors, categories)
Retrieve full paper content via Jina Reader
Browse recent papers by category
List all arXiv categories
Pagination support for search results
Available Tools
search
Search arXiv for papers matching a query.
Argument | Type | Required | Description |
| string | Yes | Search query (e.g., 'LLM', 'transformer') |
| string | No | Filter by category (e.g., 'cs.AI', 'cs.LG') |
| string | No | Filter by author name |
| string | No | Sort order: 'relevance', 'date_desc', 'date_asc' |
| int | No | Page number (default: 1) |
| int | No | Results per page, max 50 (default: 25) |
searchAdvanced
Advanced search with specific field filters.
Argument | Type | Required | Description |
| string | No | Search in paper titles |
| string | No | Search in abstracts |
| string | No | Search by author name |
| string | No | Filter by category |
| string | No | Search by arXiv ID pattern |
| string | No | Start date (YYYY-MM-DD) |
| string | No | End date (YYYY-MM-DD) |
| string | No | Sort order |
| int | No | Page number |
| int | No | Results per page |
getPaper
Get detailed information about a specific arXiv paper.
Argument | Type | Required | Description |
| string | Yes | arXiv ID (e.g., '2301.00001') or full URL |
getContent
Get the full text content of an arXiv paper using Jina Reader.
Argument | Type | Required | Description |
| string | Yes | arXiv ID or full URL |
getRecent
Get recent papers from a specific arXiv category.
Argument | Type | Required | Description |
| string | No | Category code (default: 'cs.AI') |
| int | No | Number of papers, max 50 (default: 10) |
listCategories
List all common arXiv categories with their codes and names.
Installation
Using uv (Recommended)
Using pip
Configuration
Claude Desktop
Add to your Claude Desktop configuration (~/Library/Application Support/Claude/claude_desktop_config.json on macOS):
Claude Code
Add to your Claude Code MCP settings:
Usage Examples
Search for papers about LLMs
Find papers by a specific author
Get paper details
Browse recent papers
Development
Run tests
Run the server locally
Common arXiv Categories
Code | Name |
cs.AI | Artificial Intelligence |
cs.CL | Computation and Language |
cs.CV | Computer Vision and Pattern Recognition |
cs.LG | Machine Learning |
cs.NE | Neural and Evolutionary Computing |
stat.ML | Machine Learning (Statistics) |
Use listCategories tool to get the full list.
Changelog
v0.2.0
Breaking Changes:
Renamed entry point from
arxiv-server.pytoarxiv-mcpcommandRenamed
gettool togetContentfor clarity
New Features:
searchAdvanced- Advanced search with title, abstract, date range filtersgetPaper- Get detailed paper metadata (authors, categories, dates, PDF URL)getRecent- Browse recent papers by categorylistCategories- List 33 common arXiv categoriesPagination support (
page,page_sizeparameters)Sort options (
relevance,date_desc,date_asc)Filter by author and category in basic search
Improvements:
Migrated to
pyproject.tomlwith uv for dependency managementReplaced
requestswithhttpx(async-ready)Added Pydantic models for type-safe data structures
Reduced dependencies from 33 to 4 core packages
Added proper timeout handling (30s)
Modular project structure (
src/arxiv_mcp/)
v0.1.0
Initial release
Basic
searchandgettools
License
MIT License - see LICENSE for details.