Search for:

Information on Web Scraping

  • Why this server?

    This server enables web content scanning and analysis, including fetching, analyzing, and extracting information from web pages, which is directly related to scraping.

    A
    security
    A
    license
    A
    quality
    Enables web content scanning and analysis by fetching, analyzing, and extracting information from web pages using tools like page fetching, link extraction, site crawling, and more.
    Last updated -
    6
    3
    TypeScript
    MIT License
  • Why this server?

    This server uses the UseScraper API to provide web scraping capabilities, allowing users to extract content from webpages in various formats.

    A
    security
    A
    license
    A
    quality
    A TypeScript-based MCP server utilizing the UseScraper API to provide web scraping capabilities, allowing users to extract content from webpages in various formats.
    Last updated -
    1
    1
    JavaScript
    MIT License
    • Apple
  • Why this server?

    This server integrates with FireCrawl for advanced web scraping capabilities.

    A
    security
    A
    license
    A
    quality
    A Model Context Protocol (MCP) server implementation that integrates with FireCrawl for advanced web scraping capabilities.
    Last updated -
    9
    15,275
    2,745
    JavaScript
    MIT License
    • Apple
    • Linux
  • Why this server?

    This server extracts and transforms webpage content into clean, LLM-optimized Markdown, which is often needed after scraping raw HTML.

    A
    security
    A
    license
    A
    quality
    Extracts and transforms webpage content into clean, LLM-optimized Markdown. Returns article title, main content, excerpt, byline and site name. Uses Mozilla's Readability algorithm to remove ads, navigation, footers and non-essential elements while preserving the core content structure.
    Last updated -
    1
    4
    11
    MIT License
  • Why this server?

    This server enables LLMs to retrieve and process content from web pages, converting HTML to markdown which is very useful for scraping and post-processing.

    A
    security
    A
    license
    A
    quality
    This server enables LLMs to retrieve and process content from web pages, converting HTML to markdown for easier consumption.
    Last updated -
    1
    43,205
    JavaScript
    MIT License
    • Linux
    • Apple
  • Why this server?

    This is a powerful MCP server for fetching and transforming web content into various formats (HTML, JSON, Markdown, Plain Text) making it useful for different scraping scenarios.

    A
    security
    A
    license
    A
    quality
    A powerful MCP server for fetching and transforming web content into various formats (HTML, JSON, Markdown, Plain Text) with ease.
    Last updated -
    4
    146
    12
    TypeScript
    MIT License
    • Apple
    • Linux
  • Why this server?

    This server enables web browsing capabilities using BeautifulSoup4, which is a common tool for web scraping.

  • Why this server?

    This server acts as a web browser for LLMs and RAG pipelines, similar to web search in ChatGPT, meaning it can be used to scrape web content.

    A
    security
    A
    license
    A
    quality
    Implementation of an MCP server for the RAG Web Browser Actor. This Actor serves as a web browser for large language models (LLMs) and RAG pipelines, similar to a web search in ChatGPT.
    Last updated -
    1
    330
    77
    JavaScript
    Apache 2.0
    • Apple
  • Why this server?

    This tool downloads entire websites using wget and preserves the site structure while converting links to work locally, useful for scraping the entirety of a site.

    A
    security
    F
    license
    A
    quality
    Provides a tool to download entire websites using wget. It preserves the website structure and converts links to work locally.
    Last updated -
    1
    40
    JavaScript
    • Apple
    • Linux
  • Why this server?

    This server enables real-time web research by integrating Google search and capturing webpage content, making it useful for various scraping scenarios that require real-time data.

    A
    security
    A
    license
    A
    quality
    The MCP Web Research Server enables real-time web research with Claude by integrating Google search, capturing webpage content and screenshots, and tracking research sessions.
    Last updated -
    3
    53
    46
    TypeScript
    MIT License
    • Apple