defuddle-mcp
Extracts main content from web pages and raw HTML, converting it into cleaned Markdown while stripping away ads, navigation, and sidebars to provide structured text and metadata.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@defuddle-mcpextract the main content from https://example.com/blog-post"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
defuddle-mcp
MCP server that extracts the main content from web pages using defuddle. Returns cleaned Markdown or HTML with metadata (title, author, description, etc.), stripping away ads, navigation, sidebars, and other clutter.
No authentication required — defuddle runs locally as a CLI tool.
Tools
Tool | Description |
| Fetch a URL and extract its main content |
| Extract main content from raw HTML |
Both tools return a structured result with: title, author, description, domain, content, word_count, and other metadata fields. Set markdown=false to get HTML instead of Markdown.
Setup
Requires Python >= 3.10 and Node.js >= 18.
uv syncThe defuddle npm package is installed automatically on first server start.
Run
uv run python -m src.mainThe server starts on http://127.0.0.1:8080/mcp.
Test
uv run python -m src.clientThis server cannot be installed
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/NickyHeC/defuddle-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server