docx-to-markdown
Convert DOCX files into Markdown format using the Markdownify MCP Server. Extract and transform content for easy readability and sharing in Markdown.
Instructions
Convert a DOCX file to markdown
Input Schema
TableJSON Schema
| Name | Required | Description | Default |
|---|---|---|---|
| filepath | Yes | Absolute path of the DOCX file to convert |
Implementation Reference
- src/server.ts:72-86 (handler)Dispatch handler for the docx-to-markdown tool (shared with other file converters). Validates filepath argument and invokes Markdownify.toMarkdown to perform the conversion.case tools.PDFToMarkdownTool.name: case tools.ImageToMarkdownTool.name: case tools.AudioToMarkdownTool.name: case tools.DocxToMarkdownTool.name: case tools.XlsxToMarkdownTool.name: case tools.PptxToMarkdownTool.name: if (!validatedArgs.filepath) { throw new Error("File path is required for this tool"); } result = await Markdownify.toMarkdown({ filePath: validatedArgs.filepath, projectRoot: validatedArgs.projectRoot, uvPath: validatedArgs.uvPath || process.env.UV_PATH, }); break;
- src/tools.ts:96-109 (schema)Input schema definition for the docx-to-markdown tool, requiring an absolute filepath.export const DocxToMarkdownTool = ToolSchema.parse({ name: "docx-to-markdown", description: "Convert a DOCX file to markdown", inputSchema: { type: "object", properties: { filepath: { type: "string", description: "Absolute path of the DOCX file to convert", }, }, required: ["filepath"], }, });
- src/server.ts:33-37 (registration)Registers the docx-to-markdown tool (along with others from tools.ts) for listing via MCP listTools request.server.setRequestHandler(ListToolsRequestSchema, async () => { return { tools: Object.values(tools), }; });
- src/Markdownify.ts:20-52 (helper)Helper function that executes the 'markitdown' CLI tool from the project's .venv using 'uv run' to convert the DOCX file to markdown output.private static async _markitdown( filePath: string, projectRoot: string, uvPath: string, ): Promise<string> { const venvPath = path.join(projectRoot, ".venv"); const markitdownPath = path.join( venvPath, process.platform === "win32" ? "Scripts" : "bin", `markitdown${process.platform === "win32" ? ".exe" : ""}`, ); if (!fs.existsSync(markitdownPath)) { throw new Error("markitdown executable not found"); } // Expand tilde in uvPath if present const expandedUvPath = expandHome(uvPath); // Use execFile to prevent command injection const { stdout, stderr } = await execFileAsync(expandedUvPath, [ "run", markitdownPath, filePath, ]); if (stderr) { throw new Error(`Error executing command: ${stderr}`); } return stdout; }
- src/Markdownify.ts:74-124 (helper)Main conversion helper invoked by the tool handler. For filePath (used by docx-to-markdown), sets inputPath to filepath, calls _markitdown to convert, saves output to temp MD file, and returns path and text.static async toMarkdown({ filePath, url, projectRoot = path.resolve(__dirname, ".."), uvPath = "~/.local/bin/uv", }: { filePath?: string; url?: string; projectRoot?: string; uvPath?: string; }): Promise<MarkdownResult> { try { let inputPath: string; let isTemporary = false; if (url) { const response = await fetch(url); let extension = null; if (url.endsWith(".pdf")) { extension = "pdf"; } const arrayBuffer = await response.arrayBuffer(); const content = Buffer.from(arrayBuffer); inputPath = await this.saveToTempFile(content, extension); isTemporary = true; } else if (filePath) { inputPath = filePath; } else { throw new Error("Either filePath or url must be provided"); } const text = await this._markitdown(inputPath, projectRoot, uvPath); const outputPath = await this.saveToTempFile(text); if (isTemporary) { fs.unlinkSync(inputPath); } return { path: outputPath, text }; } catch (e: unknown) { if (e instanceof Error) { throw new Error(`Error processing to Markdown: ${e.message}`); } else { throw new Error("Error processing to Markdown: Unknown error occurred"); } } }