Mozilla Readability Parser MCP Server

MIT License

Overview InspectNew Schema Related Servers Reviews Score

MCP-server-readability-python

# MCP Server Readability Parser (Python / FastMCP)

## Credits/Reference
This project is based on the original [server-moz-readability](https://github.com/emzimmer/server-moz-readability) implementation of [emzimmer](https://github.com/emzimmer). (For the original README documentation, please refer to the [original README.md](https://github.com/emzimmer/server-moz-readability/blob/main/readme.md).)

This Python implementation adapts the original concept to run as python based MCP using [FastMCP](https://github.com/jlowin/fastmcp)



# Mozilla Readability Parser MCP Server

A Python implementation of the [Model Context Protocol (MCP)](https://github.com/modelcontextprotocol) server that extracts and transforms webpage content into clean, LLM-optimized Markdown.

## Table of Contents
- [Features](#features)
- [Why Not Just Fetch?](#why-not-just-fetch)
- [Installation](#installation)
- [Quick Start](#quick-start)
- [Tool Reference](#tool-reference)
- [Dependencies](#dependencies)
- [License](#license)

## Features
- Removes ads, navigation, footers and other non-essential content
- Converts clean HTML into well-formatted Markdown
- Handles errors gracefully
- Optimized for LLM processing
- Lightweight and fast

## Why Not Just Fetch?
Unlike simple fetch requests, this server:
- Extracts only relevant content using Readability algorithm
- Eliminates noise like ads, popups, and navigation menus
- Reduces token usage by removing unnecessary HTML/CSS
- Provides consistent Markdown formatting for better LLM processing
- Handles complex web pages with dynamic content

## Installation

1. Clone the repository:
```bash
git clone https://github.com/jmh108/MCP-server-readability-python.git
cd MCP-server-readability-python
```

2. Create and activate a virtual environment:
```bash
python -m venv venv
source venv/bin/activate  # On Windows use: venv\Scripts\activate
```

3. Install dependencies:
```bash
pip install -r requirements.txt
```

## Quick Start

1. Start the server:
```bash
fastmcp run server.py
```

2. Example request:
```bash
curl -X POST http://localhost:8000/tools/extract_content \
  -H "Content-Type: application/json" \
  -d '{"url": "https://example.com/article"}'
```

## Tool Reference

### `extract_content`
Fetches and transforms webpage content into clean Markdown.

**Arguments:**
```json
{
  "url": {
    "type": "string",
    "description": "The website URL to parse",
    "required": true
  }
}
```

**Returns:**
```json
{
  "content": "Markdown content..."
}
```

## MCP Server Configuration

To configure the MCP server, add the following to your MCP settings file:

```json
{
  "mcpServers": {
    "readability": {
      "command": "fastmcp",
      "args": ["run", "server.py"],
      "env": {}
    }
  }
}
```

The server can then be started using the MCP protocol and accessed via the `parse` tool.

## Dependencies
- [readability-lxml](https://github.com/buriy/python-readability) - Content extraction
- [html2text](https://github.com/Alir3z4/html2text) - HTML to Markdown conversion
- [beautifulsoup4](https://www.crummy.com/software/BeautifulSoup/) - DOM parsing
- [requests](https://docs.python-requests.org/) - HTTP requests

## License
MIT License - See [LICENSE](LICENSE) for details.

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/jmh108/MCP-server-readability-python'

If you have feedback or need assistance with the MCP directory API, please join our Discord server