Converts HTML documentation to markdown format and saves the converted documentation to a specified output path
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Doc Scraper MCP Serverscrape the Python docs from docs.python.org/3/tutorial/index.html and save as tutorial.md"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Doc Scraper MCP Server
A Model Context Protocol (MCP) server that provides documentation scraping functionality. This server converts web-based documentation into markdown format using jina.ai's conversion service.
Features
Scrapes documentation from any web URL
Converts HTML documentation to markdown format
Saves the converted documentation to a specified output path
Integrates with the Model Context Protocol (MCP)
Related MCP server: Markdownify MCP Server - UTF-8 Enhanced
Installation
Installing via Smithery
To install Doc Scraper for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @askjohngeorge/mcp-doc-scraper --client claudeClone the repository:
git clone https://github.com/askjohngeorge/mcp-doc-scraper.git
cd mcp-doc-scraperCreate and activate a virtual environment:
python -m venv venv
source venv/bin/activate # On Windows, use: venv\Scripts\activateInstall the dependencies:
pip install -e .Usage
The server can be run using Python:
python -m mcp_doc_scraperTool Description
The server provides a single tool:
Name:
scrape_docsDescription: Scrape documentation from a URL and save as markdown
Input Parameters:
url: The URL of the documentation to scrapeoutput_path: The path where the markdown file should be saved
Project Structure
doc_scraper/
├── __init__.py
├── __main__.py
└── server.pyDependencies
aiohttp
mcp
pydantic
Development
To set up the development environment:
Install development dependencies:
pip install -r requirements.txtThe server uses the Model Context Protocol. Make sure to familiarize yourself with MCP documentation.
License
MIT License
This server cannot be installed
Resources
Looking for Admin?
Admins can modify the Dockerfile, update the server description, and track usage metrics. If you are the server author, to access the admin panel.