Skip to main content
Glama

Doc Scraper MCP Server

Doc Scraper MCP Server

A Model Context Protocol (MCP) server that provides documentation scraping functionality. This server converts web-based documentation into markdown format using jina.ai's conversion service.

Features

  • Scrapes documentation from any web URL
  • Converts HTML documentation to markdown format
  • Saves the converted documentation to a specified output path
  • Integrates with the Model Context Protocol (MCP)

Installation

Installing via Smithery

To install Doc Scraper for Claude Desktop automatically via Smithery:

npx -y @smithery/cli install @askjohngeorge/mcp-doc-scraper --client claude
  1. Clone the repository:
git clone https://github.com/askjohngeorge/mcp-doc-scraper.git cd mcp-doc-scraper
  1. Create and activate a virtual environment:
python -m venv venv source venv/bin/activate # On Windows, use: venv\Scripts\activate
  1. Install the dependencies:
pip install -e .

Usage

The server can be run using Python:

python -m mcp_doc_scraper

Tool Description

The server provides a single tool:

  • Name: scrape_docs
  • Description: Scrape documentation from a URL and save as markdown
  • Input Parameters:
    • url: The URL of the documentation to scrape
    • output_path: The path where the markdown file should be saved

Project Structure

doc_scraper/ ├── __init__.py ├── __main__.py └── server.py

Dependencies

  • aiohttp
  • mcp
  • pydantic

Development

To set up the development environment:

  1. Install development dependencies:
pip install -r requirements.txt
  1. The server uses the Model Context Protocol. Make sure to familiarize yourself with MCP documentation.

License

MIT License

-
security - not tested
F
license - not found
-
quality - not tested

hybrid server

The server is able to function both locally and remotely, depending on the configuration or use case.

Converts web-based documentation into markdown format using jina.ai's conversion service, allowing users to scrape documentation from any URL and save it as markdown files.

  1. Features
    1. Installation
      1. Installing via Smithery
    2. Usage
      1. Tool Description
    3. Project Structure
      1. Dependencies
        1. Development
          1. License

            Related MCP Servers

            • A
              security
              A
              license
              A
              quality
              Converts various file types and web content to Markdown format. It provides a set of tools to transform PDFs, images, audio files, web pages, and more into easily readable and shareable Markdown text.
              Last updated -
              10
              11
              1,966
              TypeScript
              MIT License
              • Apple
            • A
              security
              A
              license
              A
              quality
              This server converts webpages into clean, structured Markdown optimized for language model consumption, removing unnecessary content and supporting JavaScript rendering.
              Last updated -
              1
              10
              JavaScript
              MIT License
              • Apple
            • A
              security
              A
              license
              A
              quality
              A document conversion server that transforms various file formats (PDFs, documents, images, audio, web content) to Markdown with improved multilingual and UTF-8 support.
              Last updated -
              10
              2
              9
              TypeScript
              MIT License
              • Linux
              • Apple
            • A
              security
              F
              license
              A
              quality
              A server that converts various file formats (PDF, images, Office documents, etc.) to Markdown descriptions using Cloudflare AI services.
              Last updated -
              1
              11
              33
              JavaScript
              • Linux
              • Apple

            View all related MCP servers

            MCP directory API

            We provide all the information about MCP servers via our MCP API.

            curl -X GET 'https://glama.ai/api/mcp/v1/servers/askjohngeorge/mcp-doc-scraper'

            If you have feedback or need assistance with the MCP directory API, please join our Discord server