Skip to main content
Glama
JDJR2024

Markdownify MCP Server - UTF-8 Enhanced

by JDJR2024

webpage-to-markdown

Transform webpage content into markdown format using URL input. Ideal for simplifying web content into readable, structured Markdown with enhanced UTF-8 support.

Instructions

Convert a webpage to markdown

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
urlYesURL of the webpage to convert

Implementation Reference

  • Handler logic in the CallToolRequest handler for URL-based tools (YouTube, Bing search, webpage-to-markdown): validates the URL argument and delegates to Markdownify.toMarkdown for conversion.
    case tools.YouTubeToMarkdownTool.name:
    case tools.BingSearchResultToMarkdownTool.name:
    case tools.WebpageToMarkdownTool.name:
      if (!validatedArgs.url) {
        throw new Error("URL is required for this tool");
      }
      result = await Markdownify.toMarkdown({
        url: validatedArgs.url,
        projectRoot: validatedArgs.projectRoot,
        uvPath: validatedArgs.uvPath || process.env.UV_PATH,
      });
      break;
  • Input schema definition for the webpage-to-markdown tool, specifying the required 'url' parameter.
    export const WebpageToMarkdownTool = ToolSchema.parse({
      name: "webpage-to-markdown",
      description: "Convert a webpage to markdown",
      inputSchema: {
        type: "object",
        properties: {
          url: {
            type: "string",
            description: "URL of the webpage to convert",
          },
        },
        required: ["url"],
      },
    });
  • src/server.ts:31-35 (registration)
    Tool registration via ListToolsRequestSchema handler, exposing all tools from tools.ts including webpage-to-markdown.
    server.setRequestHandler(ListToolsRequestSchema, async () => {
      return {
        tools: Object.values(tools),
      };
    });
  • Core helper function Markdownify.toMarkdown that handles webpage conversion: fetches HTML from URL, saves to temporary file, processes it using the _markitdown method (which executes markitdown tool), and returns the markdown file path and content.
    static async toMarkdown({
      filePath,
      url,
      projectRoot = path.resolve(__dirname, ".."),
      uvPath = "~/.local/bin/uv",
    }: {
      filePath?: string;
      url?: string;
      projectRoot?: string;
      uvPath?: string;
    }): Promise<MarkdownResult> {
      try {
        let inputPath: string;
        let isTemporary = false;
    
        if (url) {
          const response = await fetch(url);
          const content = await response.text();
          inputPath = await this.saveToTempFile(content);
          isTemporary = true;
        } else if (filePath) {
          inputPath = filePath;
        } else {
          throw new Error("Either filePath or url must be provided");
        }
    
        const text = await this._markitdown(inputPath, projectRoot, uvPath);
        const outputPath = await this.saveToTempFile(text);
    
        if (isTemporary) {
          fs.unlinkSync(inputPath);
        }
    
        return { path: outputPath, text };
      } catch (e: unknown) {
        if (e instanceof Error) {
          throw new Error(`Error processing to Markdown: ${e.message}`);
        } else {
          throw new Error("Error processing to Markdown: Unknown error occurred");
        }
      }
    }
Install Server

Other Tools

Related Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/JDJR2024/markdownify-mcp-utf8'

If you have feedback or need assistance with the MCP directory API, please join our Discord server