Web scraping tool for extracting content from SearXNG search results

Glama

Search for:

Web scraping tool for extracting content from SearXNG search results

View all MCP Servers

Why this server?
This server directly addresses both parts of the request by integrating multiple search engines, including SearXNG, and providing scraping, crawling, and content extraction capabilities to process the resulting web links.
OneSearch MCP Server
yokingma
A
security
A
license
A
quality
A Model Context Protocol server that enables web search, scraping, crawling, and content extraction through multiple engines including SearXNG, Firecrawl, and Tavily.
Last updated -
4
59
59
MIT License
Why this server?
This solution explicitly links SearXNG search results with a tool (Puppeteer-scraper) capable of navigating and extracting live content from the identified web links, achieving the required workflow.
@missionsquad/mcp-searxng-puppeteerofficial
MissionSquad
A
security
A
license
A
quality
An MCP server implementation that integrates the SearXNG API for powerful web search capabilities and uses @missionsquad/puppeteer-scraper to read and process live web content.
Last updated -
8
2
22
MIT License
Why this server?
This server combines SearXNG search functionality with explicit capabilities for 'website content scraping,' making it suitable for finding links via SearXNG and then extracting the page content.
MCP SearXNG Enhanced
OvertliDS
-
security
A
license
-
quality
A Model Context Protocol server that enables web search with category support, website content scraping with citation metadata, and timezone-aware date/time tools.
Last updated -
25
MIT License
Why this server?
This general-purpose tool is ideal for the second step: crawling and extracting data from web pages given the links identified by SearXNG, outputting structured data in formats like Markdown.
AnyCrawl MCP Server
any4ai
-
security
F
license
-
quality
Enables web scraping and crawling capabilities for LLM clients, supporting single-page scraping, multi-page website crawling, and web search with multiple engines (Playwright, Cheerio, Puppeteer) and flexible output formats including markdown, HTML, text, and screenshots.
Last updated -
1
4
Why this server?
This tool explicitly supports both web search (finding the links) and subsequent crawling and content extraction (scraping the content from those links).
WebSearch
josemartinrodriguezmortaloni
A
security
F
license
A
quality
Built as a Model Context Protocol (MCP) server that provides advanced web search, content extraction, web crawling, and scraping capabilities using the Firecrawl API.
Last updated -
1
Why this server?
Specializes in high-quality scraping and data extraction from any website globally, making it a robust option for accessing and extracting content from the pages linked by SearXNG.
Thordata MCP Server
xja1023789-collab
-
security
-
license
-
quality
Enables AI models to scrape and extract data from any website globally using Thordata's 195+ country proxy network. Bypasses anti-bot systems and renders JavaScript content, outputting structured data in Markdown, HTML, or Links format.
Last updated -
MIT License
Why this server?
This server provides the first necessary step: access to the SearXNG metasearch engine to identify the relevant web links.
SearXNG Server
ihor-sokoliuk
A
security
A
license
A
quality
An MCP server implementation that integrates the SearxNG API, providing web search capabilities.
Last updated -
2
2,070
319
MIT License
Why this server?
This tool is essential for the second step, enabling the fetching of content (HTML, text, etc.) from any given URL, such as the links returned by the SearXNG servers.
URL Fetch MCP
aelaguiz
A
security
A
license
A
quality
A Model Context Protocol (MCP) server that enables Claude or other LLMs to fetch content from URLs, supporting HTML, JSON, text, and images with configurable request parameters.
Last updated -
3
2
MIT License
Why this server?
Useful after fetching the content, this server extracts and cleans the main webpage content, transforming raw HTML into clean, organized Markdown for easy analysis.
Mozilla Readability Parser MCP Server
emzimmer
A
security
A
license
A
quality
Extracts and transforms webpage content into clean, LLM-optimized Markdown. Returns article title, main content, excerpt, byline and site name. Uses Mozilla's Readability algorithm to remove ads, navigation, footers and non-essential elements while preserving the core content structure.
Last updated -
1
14
14
MIT License
Why this server?
A specialized tool that converts scraped web content into clean, structured Markdown, solving the post-scraping data cleaning requirement of the user's task.
Skrape MCP Serverofficial
skrapeai
A
security
A
license
A
quality
This server converts webpages into clean, structured Markdown optimized for language model consumption, removing unnecessary content and supporting JavaScript rendering.
Last updated -
11
MIT License

@missionsquad/mcp-searxng-puppeteerofficial

Skrape MCP Serverofficial