Skip to main content
Glama
Pb-207

McpDeepResearch

by Pb-207

🔍 McpDeepResearch

An MCP (Model-Context-Protocol) Server for Deep Academic Research

一个用于深度学术研究的 MCP 服务器


English | 简体中文


English

McpDeepResearch is a lightweight but powerful MCP (Model-Context-Protocol) server that helps you quickly discover, retrieve, and read academic papers from the web using the familiar Google Scholar interface.

✨ Features

  • search_scholar_papers – Google Scholar search with optional year-filter & date-sort

  • fetch_md – Convert any public web page to clean Markdown

  • fetch_paper – Auto-detect the paper content (title, abstract, body, references) and strip the rest

🛠️ Prerequisites

  • Python ≥ 3.10

  • Google Chrome/Chromium (for headless fetching via Chrome DevTools Protocol)

  • Environment variables

    export CDP_ENDPOINT="http://localhost:9222"   # Chrome debugging port
    export GOOGLE_PROXY="http://proxy:port"        # (optional) HTTP(S) proxy

⚙️ Quick Start

  1. Install via pip

    pip install 
  2. (Optional) Manually launch Chrome if it isn’t already running
    The server will automatically launch a Chrome instance; if not successfully launched, you can launch it manually with:

    google-chrome --remote-debugging-port=9222 --user-data-dir=/tmp/chrome-profile &
    ``
    
  3. Run the MCP server

    # Launch with SSE
    mcp-deep-research --transport sse --port 8000
    # Launch with STDIO
    mcp-deep-research --transport stdio

    The server exposes 3 read-only tools to any MCP-capable client.

🧑‍🎓 Example Workflow in a Chat-UI

  1. Search
    “Find recent papers on diffusion models after 2022.”
    search_scholar_papers("diffusion models", year=2022, sort_bd=True)

  2. Fetch
    Pick an interesting PDF link from the results and call
    fetch_paper("https://arxiv.org/abs/2304.12345")

  3. Read
    The cleaned Markdown (title, abstract, full text) appears directly in the chat.

🔒 Security

  • 100 % read-only; no writes, no uploads, no local file access.

  • All traffic respects the original site’s robots.txt.

  • Proxies can be configured to stay within institutional or regional firewalls.

🤝 Contributing

PRs are welcome!


简体中文

McpDeepResearch 是一个轻量级、但功能完备的 MCP(Model-Context-Protocol)服务器,帮助你在 Google Scholar 上快速发现、抓取并阅读学术文献。

✨ 功能一览

  • search_scholar_papers – 使用关键词在 Google Scholar 中搜索,可过滤年份 / 按日期排序

  • fetch_md – 将任意公开网页渲染为整洁的 Markdown

  • fetch_paper – 智能提取网页中的论文主体,去除广告、导航条等噪声

🛠️ 前置条件

  • Python ≥ 3.10

  • Google Chrome / Chromium(通过 CDP 进行无头抓取)

  • 环境变量

    export CDP_ENDPOINT="http://localhost:9222"   # Chrome 调试端口
    export GOOGLE_PROXY="http://proxy:port"        # 可选:HTTP(S) 代理

⚙️ 快速开始

  1. 通过 pip 安装

    pip install 
  2. (可选)如果 Chrome 尚未启动可手动启动
    服务器启动时会自动运行 Chrome 实例。若未成功启动,可手动启动:

    google-chrome --remote-debugging-port=9222 --user-data-dir=/tmp/chrome-profile &
  3. 启动 MCP 服务器

    # 以 SSE 启动
    mcp-deep-research --transport sse --port 8000
    # 以 STDIO 启动
    mcp-deep-research --transport stdio

    服务器会对外暴露 3 个只读工具。

🧑‍🎓 对话界面中的典型工作流

  1. 搜索
    “找 2022 年之后关于扩散模型的论文。”
    search_scholar_papers("diffusion models", year=2022, sort_bd=True)

  2. 抓取
    从结果中挑选一篇 PDF 链接,调用
    fetch_paper("https://arxiv.org/abs/2304.12345")

  3. 阅读
    清洗后的 Markdown(含标题、摘要、全文)直接展示在聊天窗口。

🔒 安全性

  • 完全只读,不修改、不上传、不写入本地文件。

  • 所有请求均尊重目标站点的 robots.txt。

  • 可配置代理以符合校园网或公司网络的安全策略。

🤝 如何贡献

欢迎提 PR!


A
license - permissive license
-
quality - not tested
C
maintenance

Resources

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Pb-207/mcp-deep-research'

If you have feedback or need assistance with the MCP directory API, please join our Discord server