The Mozilla Readability Parser MCP Server extracts and transforms webpage content into clean, LLM-optimized Markdown.
Extracts only relevant content using Mozilla's Readability algorithm
Removes ads, navigation, footers, and other non-essential elements
Converts clean HTML into well-formatted Markdown
Returns article metadata (title, excerpt, byline, site name)
Reduces token usage by removing unnecessary HTML/CSS
Handles errors gracefully
Provides consistent formatting for better LLM processing
Uses Mozilla's Readability algorithm to extract and transform webpage content into clean Markdown by removing ads, navigation, footers and non-essential elements while preserving core content structure.
Mozilla Readability Parser MCP Server
An model context protocol (MCP) server that extracts and transforms webpage content into clean, LLM-optimized Markdown. Returns article title, main content, excerpt, byline and site name. Uses Mozilla's Readability algorithm to remove ads, navigation, footers and non-essential elements while preserving the core content structure. More about MCP.
Features
Removes ads, navigation, footers and other non-essential content
Converts clean HTML into well-formatted Markdown (also uses Turndown)
Returns article metadata (title, excerpt, byline, site name)
Handles errors gracefully
Related MCP server: Skrape MCP Server
Why Not Just Fetch?
Unlike simple fetch requests, this server:
Extracts only relevant content using Mozilla's Readability algorithm
Eliminates noise like ads, popups, and navigation menus
Reduces token usage by removing unnecessary HTML/CSS
Provides consistent Markdown formatting for better LLM processing
Includes useful metadata about the content
Installation
Installing via Smithery
To install Mozilla Readability Parser for Claude Desktop automatically via Smithery:
Manual Installation
Tool Reference
parse
Fetches and transforms webpage content into clean Markdown.
Arguments:
Returns:
Usage with Claude Desktop
Add to your claude_desktop_config.json:
Dependencies
@mozilla/readability - Content extraction
turndown - HTML to Markdown conversion
jsdom - DOM parsing
axios - HTTP requests
License
MIT