Search for:

Methods to Scrape a Website and Fetch Markdown Files from GitHub

  • Why this server?

    This server can retrieve web page content and convert HTML to markdown, which aligns with the user's need to '抓取整个文档网站'.

    A
    security
    A
    license
    A
    quality
    This server enables LLMs to retrieve and process content from web pages, converting HTML to markdown for easier consumption.
    1
    37,047
    JavaScript
    MIT License
  • Why this server?

    This server allows access to GitHub repositories, which can be used to retrieve markdown files as requested by the user.

    -
    security
    A
    license
    -
    quality
    A server that allows AI assistants to browse and read files from specified GitHub repositories, providing access to repository contents via the Model Context Protocol.
    3
    JavaScript
    MIT License
    • Apple
  • Why this server?

    This provides GitHub data analysis, useful if the user wants to grab markdown from a github repo, it may give some useful context for analysis

    A
    security
    F
    license
    A
    quality
    Provides GitHub data analysis for repositories, developers, and organizations, enabling insights into open source ecosystems through API calls and natural language queries.
    5
    2
    JavaScript
  • Why this server?

    This is a scraper tool which would allow for web scraping of documentation sites. It allows flexible options for parsing and rendering.

    A
    security
    A
    license
    A
    quality
    A scraper tool that leverages the Oxylabs Web Scraper API to fetch and process web content with flexible options for parsing and rendering pages, enabling efficient content extraction from complex websites.
    2
    14
    Python
    MIT License
    • Apple
    • Linux
  • Why this server?

    This provides RAG capabilities which are needed to retrieve documents, and provides the semantic document search that is helpful.

    -
    security
    A
    license
    -
    quality
    Provides RAG capabilities for semantic document search using Qdrant vector database and Ollama/OpenAI embeddings, allowing users to add, search, list, and delete documentation with metadata support.
    5
    4
    TypeScript
    Apache 2.0
  • Why this server?

    This will be useful to fetch web content. The MCP provides content in multiple formats (HTML, JSON, Markdown, text) with automatic format detection.

    -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that enables LLMs to fetch and process web content in multiple formats (HTML, JSON, Markdown, text) with automatic format detection.
    TypeScript
    • Apple
  • Why this server?

    For more complete and in depth Git Repository operations, this server provides read, search, and manipulation Git repositories.

    -
    security
    A
    license
    -
    quality
    A Model Context Protocol server for Git repository interaction and automation. This server provides tools to read, search, and manipulate Git repositories via Large Language Models.
    12
    37,047
    JavaScript
    MIT License
  • Why this server?

    This will be able to access repository information and manage workflows.

    A
    security
    F
    license
    A
    quality
    An MCP server that enables integration with GitHub Enterprise API, allowing users to access repository information, manage issues, pull requests, workflows, and other GitHub features through Cursor.
    16
    139
    10
    TypeScript
    • Linux
    • Apple
  • Why this server?

    If the documentation site is on Google drive, this server enables listing, reading, and searching over files

    -
    security
    A
    license
    -
    quality
    Enables integration with Google Drive for listing, reading, and searching over files, supporting various file types with automatic export for Google Workspace files.
    1,327
    9
    JavaScript
    MIT License
  • Why this server?

    This allows interacting with web pages through accessibility snapshots, useful for documentation sites.

    A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
    21
    24,368
    8,158
    TypeScript
    Apache 2.0
    • Linux
    • Apple