Search for:

Developing a web scraper

  • Why this server?

    Enables fetching web content using the Node.js undici library, supporting various HTTP methods, content formats, and request configurations, which is essential for a web scraper.

    -
    security
    A
    license
    -
    quality
    An MCP server that enables fetching web content using the Node.js undici library, supporting various HTTP methods, content formats, and request configurations.
    66
    8
    TypeScript
    MIT License
    • Apple
    • Linux
  • Why this server?

    Provides unified access to multiple search engines and content processing services, useful for a comprehensive web scraping project.

    A
    security
    A
    license
    A
    quality
    🔍 A Model Context Protocol (MCP) server providing unified access to multiple search engines (Tavily, Brave, Kagi), AI tools (Perplexity, FastGPT), and content processing services (Jina AI, Kagi). Combines search, AI responses, content processing, and enhancement features through a single interface.
    15
    47
    30
    TypeScript
    MIT License
    • Linux
  • Why this server?

    A server that provides AgentQL's data extraction capabilities enabling AI agents to get structured data from unstructured web, which helps in extracting data for a web scraper.

    A
    security
    A
    license
    A
    quality
    A server that provides AgentQL's data extraction capabilities enabling AI agents to get structured data from unstructured web
    1
    183
    28
    JavaScript
    MIT License
    • Apple
    • Linux
  • Why this server?

    Enables browser automation using Python scripts, offering operations like taking webpage screenshots, retrieving HTML content, and executing JavaScript, which can be part of a scraper.

    A
    security
    F
    license
    A
    quality
    Enables browser automation using Python scripts, offering operations like taking webpage screenshots, retrieving HTML content, and executing JavaScript.
    4
    15
    Python
    • Linux
  • Why this server?

    Implementation of an MCP server for the RAG Web Browser Actor, which serves as a web browser for large language models (LLMs) and RAG pipelines, similar to a web search in ChatGPT. Useful for understanding how a browser is used in context.

    A
    security
    A
    license
    A
    quality
    Implementation of an MCP server for the RAG Web Browser Actor. This Actor serves as a web browser for large language models (LLMs) and RAG pipelines, similar to a web search in ChatGPT.
    1
    330
    77
    JavaScript
    Apache 2.0
    • Apple
  • Why this server?

    Enables retrieval and processing of web page content for LLMs by converting HTML to markdown, with support for content truncation and pagination, important for preparing web content.

    -
    security
    A
    license
    -
    quality
    Enables retrieval and processing of web page content for LLMs by converting HTML to markdown, with support for content truncation and pagination.
    1
    1
    Python
    MIT License
  • Why this server?

    A server that allows fetching Instagram posts using Chrome's existing login session, which may be helpful for specific web scraping tasks.

    A
    security
    F
    license
    A
    quality
    A server that allows fetching Instagram posts using Chrome's existing login session via Model Context Protocol (MCP).
    1
    4
    TypeScript
  • Why this server?

    Provides functionality to fetch web content in various formats, including HTML, JSON, plain text, and Markdown, which are all needed for scraping data.

    A
    security
    F
    license
    A
    quality
    Provides functionality to fetch web content in various formats, including HTML, JSON, plain text, and Markdown.
    4
    137,083
    150
    TypeScript
  • Why this server?

    It crawls website

  • Why this server?

    A Python implementation of an MCP server that extracts webpage content, removes ads and non-essential elements, and transforms it into clean, LLM-optimized Markdown - essential for focused data extraction from scraped pages.

    -
    security
    A
    license
    -
    quality
    A Python implementation of an MCP server that extracts webpage content, removes ads and non-essential elements, and transforms it into clean, LLM-optimized Markdown.
    1
    Python
    MIT License
    • Linux
    • Apple