webpage-to-markdown
Transform webpage content into markdown format using URL input. Ideal for simplifying web content into readable, structured Markdown with enhanced UTF-8 support.
Instructions
Convert a webpage to markdown
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | URL of the webpage to convert |
Input Schema (JSON Schema)
{
"properties": {
"url": {
"description": "URL of the webpage to convert",
"type": "string"
}
},
"required": [
"url"
],
"type": "object"
}
Implementation Reference
- src/server.ts:47-58 (handler)Handler logic in the CallToolRequest handler for URL-based tools (YouTube, Bing search, webpage-to-markdown): validates the URL argument and delegates to Markdownify.toMarkdown for conversion.case tools.YouTubeToMarkdownTool.name: case tools.BingSearchResultToMarkdownTool.name: case tools.WebpageToMarkdownTool.name: if (!validatedArgs.url) { throw new Error("URL is required for this tool"); } result = await Markdownify.toMarkdown({ url: validatedArgs.url, projectRoot: validatedArgs.projectRoot, uvPath: validatedArgs.uvPath || process.env.UV_PATH, }); break;
- src/tools.ts:49-62 (schema)Input schema definition for the webpage-to-markdown tool, specifying the required 'url' parameter.export const WebpageToMarkdownTool = ToolSchema.parse({ name: "webpage-to-markdown", description: "Convert a webpage to markdown", inputSchema: { type: "object", properties: { url: { type: "string", description: "URL of the webpage to convert", }, }, required: ["url"], }, });
- src/server.ts:31-35 (registration)Tool registration via ListToolsRequestSchema handler, exposing all tools from tools.ts including webpage-to-markdown.server.setRequestHandler(ListToolsRequestSchema, async () => { return { tools: Object.values(tools), }; });
- src/Markdownify.ts:51-92 (helper)Core helper function Markdownify.toMarkdown that handles webpage conversion: fetches HTML from URL, saves to temporary file, processes it using the _markitdown method (which executes markitdown tool), and returns the markdown file path and content.static async toMarkdown({ filePath, url, projectRoot = path.resolve(__dirname, ".."), uvPath = "~/.local/bin/uv", }: { filePath?: string; url?: string; projectRoot?: string; uvPath?: string; }): Promise<MarkdownResult> { try { let inputPath: string; let isTemporary = false; if (url) { const response = await fetch(url); const content = await response.text(); inputPath = await this.saveToTempFile(content); isTemporary = true; } else if (filePath) { inputPath = filePath; } else { throw new Error("Either filePath or url must be provided"); } const text = await this._markitdown(inputPath, projectRoot, uvPath); const outputPath = await this.saveToTempFile(text); if (isTemporary) { fs.unlinkSync(inputPath); } return { path: outputPath, text }; } catch (e: unknown) { if (e instanceof Error) { throw new Error(`Error processing to Markdown: ${e.message}`); } else { throw new Error("Error processing to Markdown: Unknown error occurred"); } } }