Mozilla Readability Parser MCP Server

Mozilla Readability Parser MCP Server

An model context protocol (MCP) server that extracts and transforms webpage content into clean, LLM-optimized Markdown. Returns article title, main content, excerpt, byline and site name. Uses Mozilla's Readability algorithm to remove ads, navigation, footers and non-essential elements while preserving the core content structure. More about MCP.

<a href="https://glama.ai/mcp/servers/jdcx8fmajm"><img width="380" height="200" src="https://glama.ai/mcp/servers/jdcx8fmajm/badge" alt="Mozilla Readability Parser Server MCP server" /></a>

Features

  • Removes ads, navigation, footers and other non-essential content
  • Converts clean HTML into well-formatted Markdown (also uses Turndown)
  • Returns article metadata (title, excerpt, byline, site name)
  • Handles errors gracefully

Why Not Just Fetch?

Unlike simple fetch requests, this server:

  • Extracts only relevant content using Mozilla's Readability algorithm
  • Eliminates noise like ads, popups, and navigation menus
  • Reduces token usage by removing unnecessary HTML/CSS
  • Provides consistent Markdown formatting for better LLM processing
  • Includes useful metadata about the content

Installation

npm install server-moz-readability

Tool Reference

parse

Fetches and transforms webpage content into clean Markdown.

Arguments:

{ "url": { "type": "string", "description": "The website URL to parse", "required": true } }

Returns:

{ "title": "Article title", "content": "Markdown content...", "metadata": { "excerpt": "Brief summary", "byline": "Author information", "siteName": "Source website name" } }

Usage with Claude Desktop

Add to your claude_desktop_config.json:

{ "mcpServers": { "readability": { "command": "npx", "args": ["-y", "server-moz-readability"] } } }

Dependencies

  • @mozilla/readability - Content extraction
  • turndown - HTML to Markdown conversion
  • jsdom - DOM parsing
  • axios - HTTP requests

License

MIT

A
security – no known vulnerabilities (report Issue)
A
license - permissive license
A
quality - confirmed to work

Extracts and transforms webpage content into clean, LLM-optimized Markdown. Returns article title, main content, excerpt, byline and site name. Uses Mozilla's Readability algorithm to remove ads, navigation, footers and non-essential elements while preserving the core content structure.

  1. Features
    1. Why Not Just Fetch?
      1. Installation
        1. Tool Reference
          1. parse
          2. Usage with Claude Desktop
            1. Dependencies
              1. License