Trafilatura MCP Server

Apache 2.0

fetch_and_extract

Extract main content, metadata, and optional comments from web pages by providing a URL. Returns structured JSON data with text, title, author, and date information.

Instructions

Fetches a URL and extracts the main content, metadata, and comments. Returns a JSON object with the extracted data.

Input Schema

Name	Required	Description
`include_comments`	No	Whether to include comment sections at the bottom of articles.
`include_tables`	No	Extract text from HTML <table> elements.
`url`	Yes	The URL of the web page to process.

Input Schema (JSON Schema)

{ "properties": { "include_comments": { "default": false, "description": "Whether to include comment sections at the bottom of articles.", "title": "Include Comments", "type": "boolean" }, "include_tables": { "default": false, "description": "Extract text from HTML <table> elements.", "title": "Include Tables", "type": "boolean" }, "url": { "description": "The URL of the web page to process.", "title": "Url", "type": "string" } }, "required": [ "url" ], "type": "object" }

This server cannot be installed

Other Tools

fetch_and_extract

Related Tools

Latest Blog Posts

OpenTelemetry for Model Context Protocol (MCP) Analytics and Agent Observability
By Om-Shree-0709 on .
observability
mcp
opentelemetry
Securing Enterprise AI Agents with Unique Identities in the Model Context Protocol (MCP)
By Om-Shree-0709 on .
When Your Year of Work Gets Copied Overnight: What Actually Matters?
By punkpeye on .
startups

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/fvanevski/trafilatura_mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server