Methods to Parse a Web Page or Data

Search for:

Methods to Parse a Web Page or Data

View all MCP Servers

Why this server?
Enables fetching web content using the Node.js undici library, supporting various HTTP methods and content formats, which is essential for parsing pages.
MCP Node Fetch
Browser Automation Web Scraping
mcollina
A
license
B
quality
D
maintenance
An MCP server that enables fetching web content using the Node.js undici library, supporting various HTTP methods, content formats, and request configurations.
Last updated 2025-03-10
3
51
11
MIT
Why this server?
Provides unified access to multiple search engines and content processing services, allowing parsing of the fetched content through its content processing capabilities.
mcp-omnisearch
Search Web Scraping RAG Systems
spences10
A
license
B
quality
B
maintenance
🔍 A Model Context Protocol (MCP) server providing unified access to multiple search engines (Tavily, Brave, Kagi), AI tools (Perplexity, FastGPT), and content processing services (Jina AI, Kagi). Combines search, AI responses, content processing, and enhancement features through a single interface.
Last updated 2026-07-27
3
218
336
MIT
Why this server?
Enables retrieval and processing of web page content for LLMs by converting HTML to markdown, which aids in parsing and understanding web page structures.
Fetch MCP Server
Web Scraping Browser Automation RAG Systems
ExactDoug
A
license
B
quality
D
maintenance
Enables retrieval and processing of web page content for LLMs by converting HTML to markdown, with support for content truncation and pagination.
Last updated 2025-02-13
1
3
MIT
Why this server?
Provides functionality to fetch web content in various formats, including HTML, JSON, plain text, and Markdown, making it suitable for parsing diverse page structures.
Fetch MCP Server
Web Scraping Browser Automation
zcaceres
A
license
A
quality
D
maintenance
Provides functionality to fetch web content in various formats, including HTML, JSON, plain text, and Markdown.
Last updated 2026-03-12
4
2,506
797
MIT
Why this server?
A Model Context Protocol server that provides web content fetching and conversion capabilities, useful for parsing and processing page contents.
MCP Server Fetch TypeScript
Web Scraping Browser Automation Search
tatn
A
license
A
quality
D
maintenance
A Model Context Protocol server that provides web content fetching and conversion capabilities.
Last updated 2025-03-13
4
478
4
MIT
Why this server?
A Python implementation that extracts webpage content, removes ads and non-essential elements, and transforms it into clean, LLM-optimized Markdown, facilitating easier parsing.
Mozilla Readability Parser MCP
Web Scraping
jmh108
A
license
-
quality
D
maintenance
A Python implementation of an MCP server that extracts webpage content, removes ads and non-essential elements, and transforms it into clean, LLM-optimized Markdown.
Last updated 2025-01-06
4
MIT
Why this server?
Provides tools for reading and extracting text from PDF files, supporting both local files and URLs, which might be needed when dealing with PDFs that are technically considered pages.
PDF Reader MCP Server
File Systems App Automation
trafflux
F
license
-
quality
D
maintenance
Provides tools for reading and extracting text from PDF files, supporting both local files and URLs.
Last updated 2025-02-20
46
Why this server?
A Model Context Protocol server that enables interaction with markdown documentation files, providing capabilities for document management, metadata handling, search, and documentation health analysis.
MCP Documentation Service
Documentation Access Knowledge & Memory Note Taking
alekspetrov
A
license
A
quality
D
maintenance
A Model Context Protocol implementation that enables AI assistants to interact with markdown documentation files, providing capabilities for document management, metadata handling, search, and documentation health analysis.
Last updated 2026-02-11
14
99
58
MIT
Why this server?
Uses PyPDF2 to extract pages and search PDFs, relevant when the pages are PDF files.
mcp-pdf-tools
File Systems Content Management Systems App Automation
hanweg
A
license
B
quality
D
maintenance
mcp using PyPDF2 to: • merge-pdfs • extract-pages • search-pdfs • merge-pdfs-ordered (merge in user spec. order) • find-related-pdfs (regex extracted text for related PDF files)
Last updated 2024-12-22
5
74
The Unlicense