Deep Research Agent MCP Server
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Deep Research Agent MCP ServerResearch the latest breakthroughs in quantum computing"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
๐ Deep Research Agent MCP Server
An AI-powered deep research agent built with Python, FastMCP, and Streamlit.
Search โ Fetch โ Cluster โ Report. Fully automated. Fully open source.
๐ Overview
Deep Research Agent is a Python MCP server that automates the full research pipeline from web search to a structured executive report in under 10 seconds. It uses a 4-tool pipeline, powered by the Tavily Search API for retrieval and a pure-Python TF-IDF + K-Means engine for semantic clustering. The frontend is a sleek Streamlit chat interface that shows every step live.
๐ก Built as a portfolio project demonstrating: FastMCP tool design, async web scraping, NLP clustering without heavy ML dependencies, and full-stack Python app architecture.
โจ Features
๐ Multi-angle web search: 3 query variations per topic for broader coverage
๐ Async URL fetching: parallel page retrieval with smart HTML cleaning
๐งน Text denoising: strips HTML entities, SVG labels, nav boilerplate, repeated patterns
๐ง Semantic clustering: pure-Python TF-IDF + K-Means, zero ML framework required
๐ท๏ธ Auto cluster labeling: 13 topic categories (Quantum, Biotech, Climate, AI, Policy...)
๐ Structured reports: markdown or JSON output with sources, keywords, evidence
โฌ๏ธ One-click download: export
.mdreport directly from the UIโก Fast: full pipeline typically completes in 6โ12 seconds
๐๏ธ Architecture
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ Streamlit Frontend โ
โ app.py โ Chat UI โ
โ Live step cards ยท Download button โ
โโโโโโโโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ FastMCP Client (protocol-aware)
โโโโโโโโโโโโโโโโโโโโโผโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ FastMCP Server (server.py) โ
โ โ
โ ๐ search_web ๐ fetch_and_chunk โ
โ Tavily API httpx + HTML parser โ
โ โ
โ ๐ง cluster_findings ๐ generate_report โ
โ TF-IDF + K-Means Markdown / JSON โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ๐ ๏ธ Tech Stack
Layer | Technology |
๐ Language | Python 3.12+ |
๐ MCP Framework | FastMCP 3.x |
๐ฅ๏ธ Frontend | Streamlit |
๐ Search | Tavily Search API |
๐ HTTP Client | httpx (async) |
๐ง Clustering | Pure-Python TF-IDF + K-Means |
โ๏ธ Config | python-dotenv |
๐ Quick Start
1๏ธโฃ Clone the repository
git clone https://github.com/yourusername/deep-research-agent.git
cd deep-research-agent2๏ธโฃ Install dependencies
pip install -r requirements.txt3๏ธโฃ Configure environment
cp .env.example .envEdit .env and add your Tavily API key:
TAVILY_API_KEY=tvly-your-key-here
MCP_HOST=localhost
MCP_PORT=8000
TAVILY_SEARCH_DEPTH=advanced๐ Get a free Tavily API key at app.tavily.com
4๏ธโฃ Start the MCP server
# Terminal 1
python server.pyYou should see:
๐ Deep Research Agent MCP Server
Host : localhost
Port : 8000
Tools: search_web, fetch_and_chunk, cluster_findings, generate_report
Tavily key: โ set5๏ธโฃ Launch the Streamlit UI
# Terminal 2
streamlit run app.pyOpen http://localhost:8501 and start researching! ๐
๐ง MCP Tools Reference
๐ search_web(query, max_results)
Searches the web via Tavily API and returns ranked results with scores.
search_web(
query="AI energy crisis 2026",
max_results=8, # 1โ15
include_domains=None, # e.g. [".edu", ".gov"]
exclude_domains=None
)๐ fetch_and_chunk(urls, chunk_size)
Fetches pages asynchronously and splits content into overlapping text chunks.
fetch_and_chunk(
urls=["https://example.com/article"],
chunk_size=400, # words per chunk
chunk_overlap=50, # overlap between chunks
max_chunks_per_url=6
)๐ง cluster_findings(chunks, n_clusters)
Groups chunks into semantic themes using TF-IDF vectorization + K-Means clustering.
cluster_findings(
chunks=[...], # from fetch_and_chunk
n_clusters=4, # 2โ6 themes
top_terms_per_cluster=8
)๐ generate_report(topic, clusters)
Synthesizes all clusters into a structured research report.
generate_report(
topic="AI energy crisis 2026",
clusters=[...], # from cluster_findings
format="markdown", # or "json"
include_sources=True
)๐ Research Pipeline
๐ User enters topic
โ
โผ
๐ search_web() ร 3 queries โโโโโโโโโโโบ 24 ranked sources
โ
โผ
๐ fetch_and_chunk() on top 5 URLs โโโโบ 20โ30 text chunks
โ
โผ
๐ง cluster_findings() โโโโโโโโโโโโโโโโโบ 4 semantic themes
โ
โผ
๐ generate_report() โโโโโโโโโโโโโโโโโโบ Structured .md report
โ
โผ
โฌ๏ธ Download / Display in UI๐ Project Structure
deep-research-agent/
โ
โโโ ๐ server.py # FastMCP server โ all 4 tools
โโโ ๐ฅ๏ธ app.py # Streamlit frontend
โโโ ๐ requirements.txt # Python dependencies
โโโ ๐ .env.example # Environment variable template
โโโ ๐ README.md # This fileโ๏ธ Configuration
Variable | Default | Description |
| (required) | Your Tavily search API key |
|
| MCP server host |
|
| MCP server port |
|
|
|
๐ค Contributing
Contributions are welcome and appreciated! Here's how to get involved:
๐ Reporting Bugs
Check the Issues page to see if it's already reported
Open a new issue with:
A clear title and description
Steps to reproduce
Expected vs actual behaviour
Your Python version and OS
๐ก Suggesting Features
Open an issue with the enhancement label and describe:
The problem you're trying to solve
Your proposed solution
Why would it benefit other users
๐ง Submitting Pull Requests
Fork the repository
Create a feature branch
git checkout -b feature/your-feature-nameMake your changes with clear, descriptive commits
git commit -m "feat: add BM25 ranking to cluster_findings"Test your changes thoroughly
Push to your fork
git push origin feature/your-feature-nameOpen a Pull Request with a clear description of what you changed and why
๐ Code Style
Follow PEP 8 for Python code
Use type hints wherever possible
Add docstrings to all new functions
Keep functions focused โ one responsibility per function
๐ฑ Good First Issues
Looking for a place to start? Check issues tagged
good first issue:Adding more cluster label categories to
_infer_cluster_label()Improving the HTML cleaning regex patterns
Adding a progress bar to the Streamlit UI
Supporting additional output formats (PDF, DOCX)
Writing unit tests for the clustering functions
๐ License
MIT License
Copyright (c) 2026 Deep Research Agent Contributors
Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including, without limitation, the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES, OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT, OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.๐ Acknowledgements
FastMCP โ Python MCP server framework
Tavily โ AI-optimised search API
Streamlit โ Python web app framework
httpx โ Async HTTP client
Made with โค๏ธ and Python ยท Star โญ this repo if you found it useful!
This server cannot be installed
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/melisasvr/Deep-Research-Agent-MCP-Server'
If you have feedback or need assistance with the MCP directory API, please join our Discord server