Deep Research MCP Server

by Hajime-Y
Apache 2.0
4
  • Linux
  • Apple

Integrations

  • Uses .env files for managing API keys and configuration variables

  • Integrates with GitHub repositories, including cloning from GitHub URLs as demonstrated in the installation instructions

  • Converts various file formats to Markdown through the mdconvert.py utility

Deep Research MCP Server

Deep Research is an agent-based tool that provides web search and advanced research capabilities. It leverages HuggingFace's smolagents and is implemented as an MCP server.

This project is based on HuggingFace's open_deep_research example.

Features

  • Web search and information gathering
  • PDF and document analysis
  • Image analysis and description
  • YouTube transcript retrieval
  • Archive site search

Requirements

  • Python 3.11 or higher
  • uv package manager
  • The following API keys:
    • OpenAI API key
    • HuggingFace token
    • SerpAPI key

Installation

  1. Clone the repository:
git clone https://github.com/Hajime-Y/deep-research-mcp.git cd deep-research-mcp
  1. Create a virtual environment and install dependencies:
uv venv source .venv/bin/activate # For Linux or Mac # .venv\Scripts\activate # For Windows uv sync

Environment Variables

Create a .env file in the root directory of the project and set the following environment variables:

OPENAI_API_KEY=your_openai_api_key HF_TOKEN=your_huggingface_token SERPER_API_KEY=your_serper_api_key

You can obtain a SERPER_API_KEY by signing up at Serper.dev.

Usage

Start the MCP server:

uv run deep_research.py

This will launch the deep_research agent as an MCP server.

Key Components

  • deep_research.py: Entry point for the MCP server
  • create_agent.py: Agent creation and configuration
  • scripts/: Various tools and utilities
    • text_web_browser.py: Text-based web browser
    • text_inspector_tool.py: File inspection tool
    • visual_qa.py: Image analysis tool
    • mdconvert.py: Converts various file formats to Markdown

License

This project is provided under the [License Name].

Acknowledgements

This project uses code from HuggingFace's smolagents and Microsoft's autogen projects.

-
security - not tested
A
license - permissive license
-
quality - not tested

An agent-based tool that provides web search and advanced research capabilities including document analysis, image description, and YouTube transcript retrieval.

  1. Features
    1. Requirements
      1. Installation
        1. Environment Variables
          1. Usage
            1. Key Components
              1. License
                1. Acknowledgements
                  ID: c18f8j7nfb