Skip to main content
Glama

Trafilatura MCP Server

by fvanevski

fetch_and_extract

Extract main content, metadata, and optional comments from web pages by providing a URL. Returns structured JSON data with text, title, author, and date information.

Instructions

Fetches a URL and extracts the main content, metadata, and comments. Returns a JSON object with the extracted data.

Input Schema

NameRequiredDescriptionDefault
include_commentsNoWhether to include comment sections at the bottom of articles.
include_tablesNoExtract text from HTML <table> elements.
urlYesThe URL of the web page to process.

Input Schema (JSON Schema)

{ "properties": { "include_comments": { "default": false, "description": "Whether to include comment sections at the bottom of articles.", "title": "Include Comments", "type": "boolean" }, "include_tables": { "default": false, "description": "Extract text from HTML <table> elements.", "title": "Include Tables", "type": "boolean" }, "url": { "description": "The URL of the web page to process.", "title": "Url", "type": "string" } }, "required": [ "url" ], "type": "object" }

Other Tools from Trafilatura MCP Server

Related Tools

    MCP directory API

    We provide all the information about MCP servers via our MCP API.

    curl -X GET 'https://glama.ai/api/mcp/v1/servers/fvanevski/trafilatura_mcp'

    If you have feedback or need assistance with the MCP directory API, please join our Discord server