Why this server?
This server directly addresses both parts of the request by integrating multiple search engines, including SearXNG, and providing scraping, crawling, and content extraction capabilities to process the resulting web links.
AsecurityAlicense-qualityA Model Context Protocol server that enables web search, scraping, crawling, and content extraction through multiple engines including SearXNG, Firecrawl, and Tavily.Last updated4697102MITWhy this server?
This solution explicitly links SearXNG search results with a tool (Puppeteer-scraper) capable of navigating and extracting live content from the identified web links, achieving the required workflow.
AsecurityAlicense-qualityAn MCP server implementation that integrates the SearXNG API for powerful web search capabilities and uses @missionsquad/puppeteer-scraper to read and process live web content.Last updated726MITWhy this server?
This server combines SearXNG search functionality with explicit capabilities for 'website content scraping,' making it suitable for finding links via SearXNG and then extracting the page content.
-securityAlicense-qualityA Model Context Protocol server that enables web search with category support, website content scraping with citation metadata, and timezone-aware date/time tools.Last updated36MITWhy this server?
This general-purpose tool is ideal for the second step: crawling and extracting data from web pages given the links identified by SearXNG, outputting structured data in formats like Markdown.
-securityFlicense-qualityEnables web scraping and crawling capabilities for LLM clients, supporting single-page scraping, multi-page website crawling, and web search with multiple engines (Playwright, Cheerio, Puppeteer) and flexible output formats including markdown, HTML, text, and screenshots.Last updated55Why this server?
This tool explicitly supports both web search (finding the links) and subsequent crawling and content extraction (scraping the content from those links).
AsecurityFlicense-qualityBuilt as a Model Context Protocol (MCP) server that provides advanced web search, content extraction, web crawling, and scraping capabilities using the Firecrawl API.Last updated41Why this server?
Specializes in high-quality scraping and data extraction from any website globally, making it a robust option for accessing and extracting content from the pages linked by SearXNG.
-security-license-qualityEnables AI models to scrape and extract data from any website globally using Thordata's 195+ country proxy network. Bypasses anti-bot systems and renders JavaScript content, outputting structured data in Markdown, HTML, or Links format.Last updatedMITWhy this server?
This server provides the first necessary step: access to the SearXNG metasearch engine to identify the relevant web links.
AsecurityAlicense-qualityAn MCP server implementation that integrates the SearxNG API, providing web search capabilities.Last updated210,171621MITWhy this server?
This tool is essential for the second step, enabling the fetching of content (HTML, text, etc.) from any given URL, such as the links returned by the SearXNG servers.
AsecurityAlicense-qualityA Model Context Protocol (MCP) server that enables Claude or other LLMs to fetch content from URLs, supporting HTML, JSON, text, and images with configurable request parameters.Last updated32MITWhy this server?
Useful after fetching the content, this server extracts and cleans the main webpage content, transforming raw HTML into clean, organized Markdown for easy analysis.
AsecurityAlicense-qualityExtracts and transforms webpage content into clean, LLM-optimized Markdown. Returns article title, main content, excerpt, byline and site name. Uses Mozilla's Readability algorithm to remove ads, navigation, footers and non-essential elements while preserving the core content structure.Last updated13616MIT