Why this server?
This server fetches web content, which is the first step in extracting text from a webpage. It supports various HTTP methods and content formats.
AsecurityAlicense-qualityAn MCP server that enables fetching web content using the Node.js undici library, supporting various HTTP methods, content formats, and request configurations.Last updated a year ago311111MITWhy this server?
This server acts as a web browser for LLMs, crawling webpages similar to web search in ChatGPT, making it suitable for the crawling aspect.

mcp-server-rag-web-browserofficial
AsecurityAlicense-qualityImplementation of an MCP server for the RAG Web Browser Actor. This Actor serves as a web browser for large language models (LLMs) and RAG pipelines, similar to a web search in ChatGPT.Last updated 8 months ago40202Apache 2.0Why this server?
This MCP server offers a unified access to multiple search engines and content processing services, useful for both crawling and processing of webpage content.
AsecurityAlicense-quality🔍 A Model Context Protocol (MCP) server providing unified access to multiple search engines (Tavily, Brave, Kagi), AI tools (Perplexity, FastGPT), and content processing services (Jina AI, Kagi). Combines search, AI responses, content processing, and enhancement features through a single interface.Last updated 3 days ago7600290MITWhy this server?
Enables retrieval and processing of web page content for LLMs by converting HTML to markdown, with support for content truncation and pagination, making it suitable for extracting text from web pages after crawling.
AsecurityAlicense-qualityEnables retrieval and processing of web page content for LLMs by converting HTML to markdown, with support for content truncation and pagination.Last updated a year ago13MITWhy this server?
Provides functionality to fetch web content in various formats, including HTML, JSON, plain text, and Markdown. Useful for both crawling and initial text extraction.
AsecurityAlicense-qualityProvides functionality to fetch web content in various formats, including HTML, JSON, plain text, and Markdown.Last updated a month ago43,457739MITWhy this server?
This server enables LLMs to retrieve and process content from web pages, converting HTML to markdown for easier consumption. Useful for text extraction.
Asecurity-license-qualityThis server enables LLMs to retrieve and process content from web pages, converting HTML to markdown for easier consumption.Last updated 15 days ago203183,086Why this server?
Extracts webpage content, removes ads and non-essential elements, and transforms it into clean, LLM-optimized Markdown, helping with the extraction of text after crawling.
-securityAlicense-qualityA Python implementation of an MCP server that extracts webpage content, removes ads and non-essential elements, and transforms it into clean, LLM-optimized Markdown.Last updated a year ago4MITWhy this server?
This server enables users to download entire websites and their assets for offline access, which is effectively crawling, then the user can use text extraction tools.
AsecurityAlicense-qualityThis server enables users to download entire websites and their assets for offline access, supporting configurable depth and concurrency settings.Last updated a year ago15MITWhy this server?
It crawls website.