Mozilla Readability Parser MCP Server

by emzimmer
MIT License
4
11

Integrations

  • Uses Mozilla's Readability algorithm to extract and transform webpage content into clean Markdown by removing ads, navigation, footers and non-essential elements while preserving core content structure.

Mozilla Readability Parser MCP Server

An model context protocol (MCP) server that extracts and transforms webpage content into clean, LLM-optimized Markdown. Returns article title, main content, excerpt, byline and site name. Uses Mozilla's Readability algorithm to remove ads, navigation, footers and non-essential elements while preserving the core content structure. More about MCP.

Features

  • Removes ads, navigation, footers and other non-essential content
  • Converts clean HTML into well-formatted Markdown (also uses Turndown)
  • Returns article metadata (title, excerpt, byline, site name)
  • Handles errors gracefully

Why Not Just Fetch?

Unlike simple fetch requests, this server:

  • Extracts only relevant content using Mozilla's Readability algorithm
  • Eliminates noise like ads, popups, and navigation menus
  • Reduces token usage by removing unnecessary HTML/CSS
  • Provides consistent Markdown formatting for better LLM processing
  • Includes useful metadata about the content

Installation

Installing via Smithery

To install Mozilla Readability Parser for Claude Desktop automatically via Smithery:

npx -y @smithery/cli install server-moz-readability --client claude

Manual Installation

npm install server-moz-readability

Tool Reference

parse

Fetches and transforms webpage content into clean Markdown.

Arguments:

{ "url": { "type": "string", "description": "The website URL to parse", "required": true } }

Returns:

{ "title": "Article title", "content": "Markdown content...", "metadata": { "excerpt": "Brief summary", "byline": "Author information", "siteName": "Source website name" } }

Usage with Claude Desktop

Add to your claude_desktop_config.json:

{ "mcpServers": { "readability": { "command": "npx", "args": ["-y", "server-moz-readability"] } } }

Dependencies

  • @mozilla/readability - Content extraction
  • turndown - HTML to Markdown conversion
  • jsdom - DOM parsing
  • axios - HTTP requests

License

MIT

You must be authenticated.

A
security – no known vulnerabilities
A
license - permissive license
A
quality - confirmed to work

remote-capable server

The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.

Tools

Extracts and transforms webpage content into clean, LLM-optimized Markdown. Returns article title, main content, excerpt, byline and site name. Uses Mozilla's Readability algorithm to remove ads, navigation, footers and non-essential elements while preserving the core content structure.

  1. Features
    1. Why Not Just Fetch?
      1. Installation
        1. Installing via Smithery
        2. Manual Installation
      2. Tool Reference
        1. parse
      3. Usage with Claude Desktop
        1. Dependencies
          1. License

            Related MCP Servers

            • A
              security
              A
              license
              A
              quality
              This server enables LLMs to retrieve and process content from web pages, converting HTML to markdown for easier consumption.
              Last updated -
              1
              43,638
              JavaScript
              MIT License
              • Linux
              • Apple
            • A
              security
              F
              license
              A
              quality
              Provides functionality to fetch web content in various formats, including HTML, JSON, plain text, and Markdown.
              Last updated -
              4
              137,083
              150
              TypeScript
            • A
              security
              A
              license
              A
              quality
              This server converts webpages into clean, structured Markdown optimized for language model consumption, removing unnecessary content and supporting JavaScript rendering.
              Last updated -
              1
              5
              JavaScript
              MIT License
              • Apple
            • -
              security
              A
              license
              -
              quality
              Enables retrieval and processing of web page content for LLMs by converting HTML to markdown, with support for content truncation and pagination.
              Last updated -
              1
              1
              Python
              MIT License

            View all related MCP servers

            ID: jdcx8fmajm