MCP server for web scraping and content extraction

Search for:

MCP server for web scraping and content extraction

View all MCP Servers

Why this server?
This server is a strong fit as it explicitly states its ability to 'scrape and extract data from any website globally,' bypassing anti-bot systems and rendering JavaScript content.
Thordata MCP Server
Web Scraping Browser Automation
xja1023789-collab
-
license
-
quality
-
maintenance
Enables AI models to scrape and extract data from any website globally using Thordata's 195+ country proxy network. Bypasses anti-bot systems and renders JavaScript content, outputting structured data in Markdown, HTML, or Links format.
Last updated 2025-09-23
Why this server?
This server directly provides 'web scraping and crawling capabilities' for LLM clients, supporting various scraping methods and output formats, making it highly relevant to fetching web content.
AnyCrawl MCP Server
Web Scraping Browser Automation
any4ai
A
license
-
quality
C
maintenance
Enables web scraping and crawling capabilities for LLM clients, supporting single-page scraping, multi-page website crawling, and web search with multiple engines (Playwright, Cheerio, Puppeteer) and flexible output formats including markdown, HTML, text, and screenshots.
Last updated 2026-03-19
5
6
MIT
Why this server?
This server is designed for 'advanced search and retrieval for web crawler data,' allowing AI clients to filter and analyze web content autonomously.
mcp-server-webcrawl
RAG Systems Search Web Scraping
pragmar
F
license
-
quality
C
maintenance
Bridge the gap between your web crawl and AI language models. With mcp-server-webcrawl, your AI client filters and analyzes web content under your direction or autonomously, extracting insights from your web content. Supports WARC, wget, InterroBot, Katana, and SiteOne crawlers.
Last updated 2026-05-31
44
Python
Why this server?
This server offers 'web content extraction' and 'browser automation' capabilities using Playwright, which is directly relevant to capturing information from web pages.
Low Cost Browsing MCP Server
Browser Automation Web Scraping RAG Systems
lcbro
F
license
-
quality
D
maintenance
Enables browser automation, web content extraction, and LLM-powered data transformation using Playwright. Supports session management, authentication flows, and works with local LLMs (Ollama, JAN AI) or external providers to clean and structure extracted web data.
Last updated 2025-09-14
36
6
Why this server?
This server facilitates 'comprehensive web research' by utilizing search and crawl APIs to 'gather and structure data' for document creation, indicating strong web content retrieval features.
Deep Research MCP
Search Web Scraping RAG Systems
ali-kh7
-
license
B
quality
-
maintenance
A Model Context Protocol compliant server that facilitates comprehensive web research by utilizing Tavily's Search and Crawl APIs to gather and structure data for high-quality markdown document creation.
Last updated 2025-12-16
1
37
12
Why this server?
This server enables LLMs to 'fetch content from URLs,' supporting various formats like HTML, JSON, and text, which directly addresses the user's need for retrieving web content.
URL Fetch MCP
Browser Automation Web Scraping
aelaguiz
A
license
A
quality
D
maintenance
A Model Context Protocol (MCP) server that enables Claude or other LLMs to fetch content from URLs, supporting HTML, JSON, text, and images with configurable request parameters.
Last updated 2025-03-19
3
3
MIT
Why this server?
This server explicitly supports 'web search and content scraping' through the Google Custom Search API, allowing for comprehensive information gathering from the internet.
Web Search MCP Server
Browser Automation Web Scraping Search
Mantraa-Zzz
F
license
B
quality
D
maintenance
Enables web searching and content scraping through Google Custom Search API. Provides tools to search the internet, extract webpage content, and automatically scrape search results for comprehensive information gathering.
Last updated 2025-09-02
3
Why this server?
This server offers 'web content extraction,' 'screenshot capture,' and 'web search' through Jina AI's APIs, including tools for reading URLs as markdown, making it highly suitable.
Jina AI Remote MCP Serverofficial
Web Scraping Search Image & Video Processing
jina-ai
A
license
A
quality
C
maintenance
Enables web content extraction, screenshot capture, web search, arXiv paper search, and image search through Jina AI's APIs. Provides tools for reading URLs as markdown, searching the web for current information, and finding academic papers or images.
Last updated 2026-06-02
19
753
Apache 2.0
Why this server?
This server allows users to 'search the web' using DuckDuckGo and 'fetch and summarize content from search results,' which directly matches the user's request for web content.
DuckDuckGo Web Search MCP
Search Text Summarization Browser Automation
kouui
F
license
B
quality
D
maintenance
Allows you to search the web using DuckDuckGo and optionally fetch and summarize content from search results.
Last updated 2025-03-17
2
4

Jina AI Remote MCP Serverofficial