Enables searching, retrieving metadata, and downloading PDFs from arXiv's repository of physics, mathematics, computer science, and other scientific preprints
Provides access to PubMed's biomedical and life sciences literature database, enabling search, metadata retrieval, and download of open access papers
Integrates with Semantic Scholar's AI-powered academic search engine to find papers across disciplines, analyze citations, evaluate paper impact, and recommend related research
Academic MCP Server
π A unified Model Context Protocol (MCP) server that provides AI assistants access to multiple academic databases through a single, consistent interface.
π Features
Supported Databases
PubMed π₯ - Biomedical and life sciences literature (NCBI)
bioRxiv 𧬠- Biology preprints
medRxiv π - Medical preprints
arXiv π¬ - Physics, mathematics, computer science, and more
Semantic Scholar π€ - AI-powered academic search across disciplines
Sci-Hub π - Comprehensive academic paper access and download
Core Capabilities
β Unified Search: Search across all databases with a single query
β Advanced Filtering: Filter by title, author, date, journal, and more
β Metadata Access: Retrieve detailed paper information
β PDF Download: Download open access papers when available
β Deep Analysis: Generate comprehensive paper analysis prompts
β Local PDF Analysis: Support for both local and online PDF file analysis
β Citation Network Analysis: Analyze paper citation relationships and impact
β Complete Research Workflow: One-click retrieveβanalyzeβreadβsummarize
β Standardized Output: Consistent data format across all sources
π Quick Start
Prerequisites
Python 3.10+
MCP library
Internet connection
Installation
β Already Installed! Your Academic MCP Server is fully configured and ready to use.
If you need to set it up on another machine:
Clone or download this repository:
cd Academic-MCP-ServerCreate a virtual environment:
python -m venv venvActivate the virtual environment:
Windows:
venv\Scripts\activateMac/Linux:
source venv/bin/activate
Install dependencies:
pip install -r requirements.txt
Note: All PubMed functionality is integrated locally. No external dependencies required!
Configuration for Cursor
This project provides TWO MCP servers with complementary features:
academic- Basic search, metadata retrieval, and PDF downloads across 6 databases (PubMed, bioRxiv, medRxiv, arXiv, Semantic Scholar, Sci-Hub)academic-research- Advanced features including citation analysis, paper impact evaluation, local PDF analysis, and complete research workflows
Add this configuration to your MCP settings file (~/.cursor/mcp.json or C:\Users\YOUR_USERNAME\.cursor\mcp.json):
Windows:
Mac/Linux:
Note: Replace YOUR_USERNAME and path/to with your actual paths.
π Usage
Search Papers
Search across all databases:
Search specific database:
Advanced Search
PubMed-specific advanced search:
Get Paper Metadata
Download PDF
List Available Sources
Deep Paper Analysis
π MCP Tools Reference
Server: academic (Basic Search & Retrieval)
1. search_papers
Search for papers using keywords.
Parameters:
keywords(str): Search querysource(str): "all", "pubmed", "biorxiv", "medrxiv", "arxiv", "semantic_scholar", or "scihub"num_results(int): Number of results per source (default: 10)
2. search_papers_advanced
Advanced search with multiple filters.
Parameters:
title(str, optional): Search in titlesauthor(str, optional): Author namejournal(str, optional): Journal namestart_date(str, optional): Start dateend_date(str, optional): End dateterm(str, optional): General search termsource(str): Database sourcenum_results(int): Number of results
3. get_paper_metadata
Get detailed metadata for a specific paper.
Parameters:
identifier(str): Paper ID (PMID, DOI, arXiv ID, etc.)source(str): Database source
4. download_paper_pdf
Download PDF for a paper.
Parameters:
identifier(str): Paper IDsource(str): Database source
5. list_available_sources
List all available databases.
6. deep_paper_analysis
Generate comprehensive analysis prompt.
Parameters:
identifier(str): Paper IDsource(str): Database source
Server: academic-research (Advanced Analysis & Research)
1. analyze_citation_network
Analyze paper's citation network.
Parameters:
paper_id(str): Paper identifier (DOI, PMID, etc.)source(str): Data source (default: "semantic_scholar")max_depth(int): Network depth 1-3 layers (default: 2)
2. evaluate_paper_impact
Evaluate academic impact of a paper.
Parameters:
paper_id(str): Paper identifiersource(str): Data source (default: "semantic_scholar")
3. recommend_related_papers
Recommend related papers using multiple strategies.
Parameters:
paper_id(str): Source paper identifiersource(str): Data source (default: "semantic_scholar")num_recommendations(int): Number of recommendations (default: 10)strategy(str): "comprehensive", "citations", "similar", or "influential"
4. research_workflow_complete
β Recommended Core Feature - Complete research workflow: retrieve β analyze β read β summarize
Parameters:
topic(str): Research topic (e.g., "CRISPR gene editing")num_papers(int): Number of papers to retrieve (default: 5)include_analysis(bool): Include deep analysis (default: true)include_summary(bool): Include auto-summary (default: true)
5. analyze_local_paper
Comprehensively analyze local or online PDF papers.
Parameters:
pdf_path(str): PDF file path (local or URL)include_figures(bool): Analyze figures (default: true)include_summary(bool): Generate summary (default: true)
6. list_all_figures
List all figures from a PDF paper.
Parameters:
pdf_path(str): PDF file path (local or URL)
7. explain_specific_figure
Explain a specific figure from a PDF.
Parameters:
pdf_path(str): PDF file path (local or URL)figure_number(int): Figure number (e.g., 1, 2, 3)provide_context(bool): Include context paragraphs (default: true)
8. extract_text_from_pdf
Extract text content from PDF (supports both local and online URLs).
Parameters:
pdf_path(str): PDF path (local or URL)extract_sections(bool): Whether to extract by sectionspage_range(tuple, optional): Page range, e.g., (1, 10) for pages 1-10
9. batch_analyze_local_papers
Batch analyze all PDF papers in a folder (local folders only).
Parameters:
folder_path(str): Folder pathmax_papers(int): Maximum number of papers to analyze (default: 10)file_pattern(str): File matching pattern (default: "*.pdf")
10. compare_papers
Compare multiple papers.
Parameters:
paper_ids(list): List of paper IDs to compare (2-5 papers)comparison_aspects(list, optional): Comparison dimensions - "methodology", "findings", "impact", "timeline"
11. extract_key_information
Extract key information from papers.
Parameters:
paper_id(str): Paper identifiersource(str): Data source (default: "semantic_scholar")info_types(list, optional): List of information types to extract"methodology": Research methods
"findings": Main findings
"limitations": Study limitations
"datasets": Used datasets
"metrics": Evaluation metrics
"contributions": Main contributions
12. generate_paper_summary
Automatically generate paper summaries.
Parameters:
paper_id(str): Paper identifiersource(str): Data source (default: "semantic_scholar")summary_type(str): Summary type"brief": Brief summary (100-200 words)
"comprehensive": Comprehensive summary (500-800 words)
"technical": Technical details summary
"layman": Easy-to-understand version
13. extract_pdf_fulltext
Extract full text content from PDF.
Parameters:
pdf_url(str): PDF file URLextract_sections(bool): Whether to identify and extract sections (default: true)
π Standardized Output Format
All search results return papers in this standardized format:
Semantic Scholar results include additional fields:
citation_count: Number of citationsreference_count: Number of referencesfields_of_study: Research areas
π§ Architecture
Dual Server Design
This project provides two complementary MCP servers:
academic_server.py- Core search and retrieval functionalityacademic_research_advanced.py- Advanced analysis and research workflows
Project Structure
Adapter Pattern
Each database is wrapped in an adapter that implements a common interface:
Adding New Databases
To add a new database:
Create a new adapter in
adapters/Inherit from
BaseAdapterImplement all required methods
Register in
academic_server.py
Example:
π― Use Cases
For Researchers
Search across multiple preprint servers simultaneously
Find papers by specific authors or topics
Download open access papers automatically
Generate literature review materials
Analyze local PDF collections
Perform comprehensive citation network analysis
Generate automated paper summaries
For AI Assistants
Access comprehensive academic knowledge
Provide up-to-date research information
Help with citation and reference management
Analyze research trends and findings
Process and explain figures from academic papers
Conduct complete research workflows automatically
β οΈ Limitations & Notes
API Rate Limits
PubMed: No API key required, but rate-limited
bioRxiv/medRxiv: No authentication required
arXiv: Rate-limited (1 request per 3 seconds recommended)
Semantic Scholar: Free tier has rate limits; get API key for higher limits at https://www.semanticscholar.org/product/api
Sci-Hub: No authentication required; use responsibly
PDF Availability
PubMed: Only PMC open access articles
bioRxiv/medRxiv: All articles are open access
arXiv: All articles are open access
Semantic Scholar: Depends on publisher policies
Sci-Hub: Wide coverage of academic papers (use for research purposes only)
Local PDF Support
Full text extraction: Extract complete text from local or online PDFs
Figure analysis: List and explain figures from PDF papers
Section parsing: Automatically identify and extract paper sections
Batch processing: Analyze multiple PDFs in a folder simultaneously
Date Formats
PubMed:
YYYY/MM/DDOthers:
YYYY-MM-DD
π€ Contributing
Contributions are welcome! Feel free to:
Add new database adapters
Improve existing functionality
Fix bugs
Enhance documentation
π License
This project builds upon the PubMed-MCP-Server and follows similar open-source principles.
π Acknowledgments
PubMed-MCP-Server for the original PubMed integration
NCBI E-utilities
bioRxiv/medRxiv API
arXiv API
Semantic Scholar API
Sci-Hub MCP Server (JackKuo666/Sci-Hub-MCP-Server)
FastMCP framework
β οΈ Disclaimer
The Sci-Hub integration is provided for research and educational purposes only. Users are responsible for complying with copyright laws and institutional policies in their jurisdiction. The authors do not endorse or encourage copyright infringement. Please support publishers and authors by obtaining papers through legitimate channels when possible.
π Project Statistics
Supported Databases: 6 (PubMed, bioRxiv, medRxiv, arXiv, Semantic Scholar, Sci-Hub)
MCP Servers: 2 (academic, academic-research)
Basic MCP Tools: 6
Advanced Research Tools: 15+
Lines of Code: ~3,000
Supported Formats: PDF, metadata, citations, full-text analysis
PDF Support: Both local files and online URLs
π Enhanced Features
Advanced Research Capabilities
Citation Network Analysis: Understand paper relationships and impact
Automated Summarization: Generate summaries in multiple styles
Key Information Extraction: Extract methodology, findings, limitations
Complete Research Workflows: One-click research from topic to summary
PDF Processing
Local and Online Support: Process PDFs from local storage or URLs
Figure Explanation: AI-powered figure analysis and explanation
Section Recognition: Automatic identification of paper sections
Batch Analysis: Process multiple papers simultaneously
Smart Search Features
Concurrent Database Search: Search all databases simultaneously
Intelligent Result Merging: Deduplicate and rank results
Advanced Filtering: Multi-parameter search with date ranges
Source-Specific Optimization: Tailored search for each database
π Support
For issues or questions:
Check the documentation above
Review error messages in logs
Ensure all dependencies are installed
Verify your MCP configuration
Happy researching! ππ¬