Why this server?
This server fetches web content, which is the first step in extracting text from a webpage. It supports various HTTP methods and content formats.
AlicenseBqualityCmaintenanceAn MCP server that enables fetching web content using the Node.js undici library, supporting various HTTP methods, content formats, and request configurations.Last updated39211Why this server?
This server acts as a web browser for LLMs, crawling webpages similar to web search in ChatGPT, making it suitable for the crawling aspect.

mcp-server-rag-web-browserofficial
Alicense-qualityBmaintenanceImplementation of an MCP server for the RAG Web Browser Actor. This Actor serves as a web browser for large language models (LLMs) and RAG pipelines, similar to a web search in ChatGPT.Last updated42203Apache 2.0Why this server?
This MCP server offers a unified access to multiple search engines and content processing services, useful for both crawling and processing of webpage content.
AlicenseBqualityBmaintenance🔍 A Model Context Protocol (MCP) server providing unified access to multiple search engines (Tavily, Brave, Kagi), AI tools (Perplexity, FastGPT), and content processing services (Jina AI, Kagi). Combines search, AI responses, content processing, and enhancement features through a single interface.Last updated3600301Why this server?
Enables retrieval and processing of web page content for LLMs by converting HTML to markdown, with support for content truncation and pagination, making it suitable for extracting text from web pages after crawling.
AlicenseBqualityCmaintenanceEnables retrieval and processing of web page content for LLMs by converting HTML to markdown, with support for content truncation and pagination.Last updated13Why this server?
Provides functionality to fetch web content in various formats, including HTML, JSON, plain text, and Markdown. Useful for both crawling and initial text extraction.
AlicenseAqualityCmaintenanceProvides functionality to fetch web content in various formats, including HTML, JSON, plain text, and Markdown.Last updated244,561743MITWhy this server?
This server enables LLMs to retrieve and process content from web pages, converting HTML to markdown for easier consumption. Useful for text extraction.
-licenseBquality-maintenanceThis server enables LLMs to retrieve and process content from web pages, converting HTML to markdown for easier consumption.Last updated22184,890MITWhy this server?
Extracts webpage content, removes ads and non-essential elements, and transforms it into clean, LLM-optimized Markdown, helping with the extraction of text after crawling.
Alicense-qualityCmaintenanceA Python implementation of an MCP server that extracts webpage content, removes ads and non-essential elements, and transforms it into clean, LLM-optimized Markdown.Last updated4Why this server?
This server enables users to download entire websites and their assets for offline access, which is effectively crawling, then the user can use text extraction tools.
AlicenseBqualityCmaintenanceThis server enables users to download entire websites and their assets for offline access, supporting configurable depth and concurrency settings.Last updated15MITWhy this server?
It crawls website.