Scraping Public Documents

Search for:

Scraping Public Documents

View all MCP Servers

Why this server?
This server enables LLMs to retrieve and process content from web pages, converting HTML to markdown for easier consumption, which is useful for scraping public documents.
Fetch MCP Serverofficial
Browser Automation
modelcontextprotocol
A
license
A
quality
B
maintenance
This server enables LLMs to retrieve and process content from web pages, converting HTML to markdown for easier consumption.
Last updated 2026-07-10
8
1
88,579
MIT
Why this server?
A scraper tool that leverages the Oxylabs Web Scraper API to fetch and process web content with flexible options for parsing and rendering pages, enabling efficient content extraction from complex websites, helpful for accessing various public documents.
Oxylabs MCP Serverofficial
Web Scraping Browser Automation RAG Systems
oxylabs
A
license
A
quality
C
maintenance
A scraper tool that leverages the Oxylabs Web Scraper API to fetch and process web content with flexible options for parsing and rendering pages, enabling efficient content extraction from complex websites.
Last updated 2026-06-08
4
100
MIT
Why this server?
A Model Context Protocol server that enables LLMs to fetch and process web content in multiple formats (HTML, JSON, Markdown, text) with automatic format detection, which can be used to retrieve and scrape data from different public sources.
MCP URL Fetcher
Browser Automation Web Scraping Search
nathanonn
F
license
B
quality
D
maintenance
A Model Context Protocol server that enables LLMs to fetch and process web content in multiple formats (HTML, JSON, Markdown, text) with automatic format detection.
Last updated 2025-03-30
5
5
Why this server?
A Model Context Protocol server enabling LLMs to search, retrieve, and manage documents through Rememberizer's knowledge management API; good for managing documents extracted.
Rememberizer MCP Server
Knowledge & Memory RAG Systems Databases
skydeckai
A
license
-
quality
D
maintenance
A Model Context Protocol server enabling LLMs to search, retrieve, and manage documents through Rememberizer's knowledge management API.
Last updated 2026-04-17
35
Apache 2.0
Why this server?
Provides RAG capabilities for semantic document search using Qdrant vector database and Ollama/OpenAI embeddings, allowing users to add, search, list, and delete documentation with metadata support.
RagDocs MCP Server
RAG Systems Vector Databases Web Scraping
heltonteixeira
A
license
-
quality
-
maintenance
Provides RAG capabilities for semantic document search using Qdrant vector database and Ollama/OpenAI embeddings, allowing users to add, search, list, and delete documentation with metadata support.
Last updated 2025-01-05
35
16
Why this server?
A server that enables AI assistants like Claude to perform web searches using the Exa AI Search API, providing real-time web information in a safe and controlled way, useful for finding publicly available documents.
Exa MCP Server
Web Scraping Browser Automation Search
geezerrrr
A
license
A
quality
D
maintenance
A server that enables AI assistants like Claude to perform web searches using the Exa AI Search API, providing real-time web information in a safe and controlled way.
Last updated 2025-03-21
2
16,232
MIT
Why this server?
An official MCP server implementation that allows AI assistants to capture website screenshots through the ScreenshotOne API, enabling visual context from web pages during conversations.
ScreenshotOne MCP Serverofficial
Browser Automation Web Scraping Image & Video Processing
screenshotone
A
license
B
quality
C
maintenance
An official MCP server implementation that allows AI assistants to capture website screenshots through the ScreenshotOne API, enabling visual context from web pages during conversations.
Last updated 2026-07-10
1
36
35
MIT
Why this server?
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Playwright MCP Serverofficial
Browser Automation Web Scraping Agent Orchestration
microsoft
A
license
B
quality
A
maintenance
A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
Last updated 2026-07-25
14
24
6,397,156
35,581
Apache 2.0
Why this server?
Go server implementing Model Context Protocol (MCP) for filesystem operations; may assist in accessing local documents.
Filesystem MCP Server
File Systems OS Automation
mark3labs
A
license
-
quality
F
maintenance
Go server implementing Model Context Protocol (MCP) for filesystem operations.
Last updated 2025-11-24
665
MIT

Scraping Public Documents

Fetch MCP Serverofficial

Oxylabs MCP Serverofficial

MCP URL Fetcher

Rememberizer MCP Server

RagDocs MCP Server

Exa MCP Server

ScreenshotOne MCP Serverofficial

Playwright MCP Serverofficial

Filesystem MCP Server