Retrieve relevance-ranked web content, including text chunks, tables, and code blocks, for AI grounding and RAG pipelines. Enables direct reasoning over page substance without manual fetching.
127,227 tools. Last updated 2026-05-05 10:29
"A tool for fetching and retrieving web pages or web content" matching MCP tools:
- Retrieve web search results with rich metadata, including news, videos, and local listings, using Brave Search API with filters for country, language, freshness, and safety.
- Convert webpages into clean, formatted markdown for extracting content from documentation, articles, and web pages. Fetches any webpage and transforms HTML content into readable markdown format.MIT
- Extract clean content from multiple web pages simultaneously to compare information across sources or gather data from several pages at once.Apache 2.0
- Extract web page content and convert it to clean, readable markdown format for analysis, bypassing paywalls and obtaining structured text data from websites.Apache 2.0
- Capture web page screenshots in JPEG format for visual inspection, analysis, or demonstration. Specify URL and choose between single-screen or full-page capture.Apache 2.0
Matching MCP Servers
- AlicenseAqualityCmaintenanceA Model Context Protocol server that provides real-time web search capabilities to AI assistants through pluggable search providers, currently integrated with the Brave Search API.Last updated114MIT
- Flicense-qualityCmaintenanceEnables performing Google searches and retrieving content from the top 5 non-social media results. Returns crawled web page content as a single consolidated string for analysis.Last updated5
Matching MCP Connectors
AI web extraction: send URLs + a JSON Schema, get clean structured data. Pay-per-use via x402.
Search the web and extract clean, readable text from webpages. Process multiple URLs at once to sp…
- Perform web searches using SearXNG API to gather information, find news and articles, or explore diverse online sources for general queries and recent events.MIT
- Perform web searches to gather real-time information, news, and detailed content analysis with customizable parameters for results, domains, and timeframes.
- Extract and process web content from URLs for data collection, content analysis, and research tasks, supporting multiple formats and extraction depths.
- Analyze web page quality using Google's Lighthouse metrics to measure performance, accessibility, and SEO factors for optimization.Apache 2.0
- Extract content matching regex patterns from web pages while avoiding bot detection. Retrieve specific website data with configurable modes for different security levels.Apache 2.0
- Search previously fetched pages stored in the local SQLite FTS5 index. Use to recall content already cached, avoiding re-fetching. Supports keyword queries, phrases, and boolean operators. Returns highlighted snippets with titles and URLs. Requires populated cache—use the search tool first to add pages.
- Extract structured data from web pages including markdown content, links, tables, images, and metadata. Supports JavaScript-rendered pages with optional CSS selector waiting for dynamic content.
- Fetch dynamic web page content after JavaScript rendering and convert it to Markdown format using chunked streaming for large pages.MIT
- Extract raw text content from PDFs, web pages, pasted text, or YouTube transcripts for content export without AI processing.MIT
- Convert web pages to clean markdown format by extracting content, removing unnecessary elements, and ranking information for RAG applications.MIT
- Extract raw text content from PDFs, web pages, pasted text, or YouTube transcripts for content export without AI processing.MIT
- Retrieve raw text content from PDFs, web pages, pasted text, or YouTube transcripts for content export without AI processing.MIT
- Extract clean Markdown content from web pages by removing boilerplate and executing JavaScript for dynamic sites, optimized for AI processing.
- Capture web page screenshots for verifying updates, with automatic tiling for full pages and optimized processing for CLI tools.MIT