arxiv-personalized
Provides tools for searching arXiv papers with Chinese query support, including translation, keyword extraction, and relevance ranking.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@arxiv-personalized找关于旋转位置编码的论文"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Personalized arXiv Research Paper MCP Service
A full-pipeline service that transforms Chinese queries into ranked arXiv paper recommendations, exposed as an MCP (Model Context Protocol) endpoint for AI agent integration.
Chinese Query → EN Translation → Keyword Extraction → arXiv Search → Semantic Reranking → MCP EndpointProject Structure
arxiv_mcp/
├── big_model.py # Talker class: translation + keyword extraction (DeepSeek API)
├── arxiv_server.py # FastAPI service (/score + /query, port 5216)
├── mcp_wrapper.py # fastapi-mcp wrapper exposing API as MCP SSE endpoint (port 8000)
├── quick_test.py # Quick functional test (no GPU / reranker required)
├── requirements.txt
└── README.mdRelated MCP server: kivv
Quick Start
1. Install Dependencies
pip install -r requirements.txtGPU users: install a CUDA-compatible PyTorch from pytorch.org for faster reranking.
2. Configure API Key
This project uses the DeepSeek API (OpenAI-compatible) for translation and keyword extraction. Set your API key via environment variable:
export DEEPSEEK_API_KEY="sk-xxxxxxxxxxxxxxxx"Alternatively, you can pass the key directly when creating a Talker instance:
from big_model import Talker
talker = Talker(api_key="sk-xxxxxxxxxxxxxxxx")3. Quick Functional Test (No GPU)
python quick_test.pyValidates translation and arXiv search without loading the reranker model.
4. Start the API Service
python arxiv_server.py
# or
uvicorn arxiv_server:app --host 127.0.0.1 --port 5216Swagger docs: http://127.0.0.1:5216/docs
5. Start the MCP Service
python mcp_wrapper.pyMCP SSE endpoint: http://127.0.0.1:8000/mcp
API Reference
POST /score — Relevance Scoring
Computes normalized relevance scores (0–1) for a query against multiple passages using BAAI/bge-reranker-v2-m3.
Request:
{
"query": "position embedding in transformer",
"passages": [
"We propose a novel rotary position embedding...",
"This paper introduces a new attention mechanism..."
]
}Response:
{
"scores": [0.9821, 0.6712]
}POST /query — Personalized Paper Search
End-to-end pipeline: Chinese query → translation → keyword extraction → arXiv search → reranking.
Request:
{
"query": "注意力机制中的旋转位置编码",
"max_results": 5
}Response:
{
"query_original": "注意力机制中的旋转位置编码",
"query_english": "Rotary Position Embedding in Attention Mechanisms",
"query_terms": "Rotary Position Embedding, Attention Mechanism, Transformer",
"papers_sorted": [
{
"paper_id": "2410.12345",
"title": "RoPE: Rotary Position Embedding for Transformers",
"summary_english": "We propose a novel method...",
"summary_chinese": "我们提出了一种新颖的方法...",
"authors": ["Author A", "Author B"],
"pdf_url": "https://arxiv.org/pdf/2410.12345",
"links": ["https://arxiv.org/abs/2410.12345"],
"relevance_score": 0.9821
}
]
}Claude Desktop Integration
Add to ~/.claude/claude_desktop_config.json:
{
"mcpServers": {
"arxiv-personalized": {
"url": "http://127.0.0.1:8000/mcp"
}
}
}Restart Claude Desktop to use query_papers and compute_similarity tools directly in conversations.
Architecture
User Chinese Query
│
▼
[big_model.Talker]
├─ trans_cn2en() CN → EN translation
└─ extract_key_word() Academic keyword extraction (JSON)
│
▼
[arxiv.Client] arXiv paper search
│
▼
[big_model.Talker]
└─ trans_en2cn() Abstract EN → CN translation
│
▼
[FlagReranker] bge-reranker-v2-m3 semantic reranking
│
▼
[FastAPI /query] Returns ranked paper list
│
▼
[fastapi-mcp /mcp] MCP SSE endpoint for AI agent consumptionDependencies
Package | Purpose |
| DeepSeek API client (OpenAI-compatible) |
| arXiv API Python client |
| BGE reranker model for semantic scoring |
| REST API server |
| MCP protocol adapter |
| Deep learning runtime for reranker |
| Numerical computation |
Notes
The reranker model
BAAI/bge-reranker-v2-m3(~1.1 GB) is auto-downloaded from HuggingFace on first run. SetHF_TOKENor configure a mirror for faster downloads.Without a GPU, the reranker falls back to CPU (3–10 seconds per scoring batch).
arXiv API imposes rate limits; keep
max_results≤ 10 for reliable operation.
This server cannot be installed
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/returnSGD/arxiv_MCP'
If you have feedback or need assistance with the MCP directory API, please join our Discord server