WebforAI Text Extractor MCP Server

MIT License
2

Integrations

  • Deploys to Cloudflare Workers for serverless hosting with global distribution of the MCP server.

  • Provides a serverless deployment platform for the MCP server with global distribution.

  • Offers one-click deployment of the MCP server from the GitHub repository.

WebforAI Text Extractor - MCP Server

A Cloudflare Workers-based Model Context Protocol (MCP) server that extracts plain text from web pages using WebforAI.

🌟 What is WebforAI?

WebforAI is a powerful library designed to make web content accessible to AI models. It provides tools to:

  • Convert HTML to clean, structured Markdown
  • Extract meaningful content from web pages
  • Process tables, links, and images intelligently
  • Prepare web content for AI consumption

This MCP server leverages WebforAI's capabilities to extract plain text from any web page URL, making it easy to feed web content into AI models through the Model Context Protocol.

📋 Features

  • Simple API: Extract text from any web page with a single API call
  • Clean Output: Receive well-formatted Markdown text without HTML noise
  • Error Handling: Robust error handling for failed requests
  • Cloudflare Workers: Serverless deployment with global distribution
  • MCP Compatible: Works with any MCP client like Claude Desktop or Cloudflare AI Playground

🚀 Getting Started

Deploy to Cloudflare Workers

This will deploy your MCP server to a URL like: webforai-mcp-server.<your-account>.workers.dev/sse

Local Development

  1. Clone this repository:
    git clone https://github.com/yutakobayashidev/webforai-mcp-server.git cd webforai-mcp-server
  2. Install dependencies:
    pnpm install
  3. Start the development server:
    pnpm dev
  4. Your server will be available at http://localhost:8787

🔧 Using the Text Extraction Tool

The extractWebPageText tool accepts a URL to a web page and returns the extracted text content in markdown format:

{ "url": "https://example.com/page" }

The response will contain the extracted text in Markdown format, with:

  • Links converted to plain text
  • Tables converted to plain text
  • Images hidden

🔌 Connecting to MCP Clients

Cloudflare AI Playground

  1. Go to Cloudflare AI Playground
  2. Enter your deployed MCP server URL (webforai-mcp-server.<your-account>.workers.dev/sse)
  3. You can now use your text extraction tool directly from the playground!

Claude Desktop

To connect to your MCP server from Claude Desktop:

  1. Follow Anthropic's Quickstart
  2. In Claude Desktop go to Settings > Developer > Edit Config
  3. Update with this configuration:
{ "mcpServers": { "webforaiExtractor": { "command": "npx", "args": [ "mcp-remote", "http://localhost:8787/sse" // or webforai-mcp-server.your-account.workers.dev/sse ] } } }
  1. Restart Claude and you should see the text extraction tool become available

📚 Learn More

📄 License

MIT

-
security - not tested
A
license - permissive license
-
quality - not tested

A Cloudflare Workers-based server that extracts clean, formatted text from web pages using WebforAI and makes it accessible to AI models through the Model Context Protocol.

  1. 🌟 What is WebforAI?
    1. 📋 Features
      1. 🚀 Getting Started
        1. Deploy to Cloudflare Workers
        2. Local Development
      2. 🔧 Using the Text Extraction Tool
        1. 🔌 Connecting to MCP Clients
          1. Cloudflare AI Playground
          2. Claude Desktop
        2. 📚 Learn More
          1. 📄 License

            Related MCP Servers

            • A
              security
              A
              license
              A
              quality
              A production-ready Model Context Protocol server that enables language models to leverage AI-powered web scraping capabilities, offering tools for transforming webpages to markdown, extracting structured data, and executing AI-powered web searches.
              Last updated -
              3
              16
              Python
              MIT License
              • Apple
            • -
              security
              -
              license
              -
              quality
              An open standard server implementation that enables AI assistants to directly access APIs and services through Model Context Protocol, built using Cloudflare Workers for scalability.
              Last updated -
              TypeScript
            • A
              security
              A
              license
              A
              quality
              A Model Context Protocol server enabling AI assistants to scrape web content with high accuracy and flexibility, supporting multiple scraping modes and content formatting options.
              Last updated -
              4
              222
              1
              TypeScript
              MIT License
              • Linux
              • Apple
            • A
              security
              A
              license
              A
              quality
              A server that leverages Playwright for automated browser testing and integrates with Cloudflare Workers, enabling AI assistants to control web browsers for navigation, interaction, and screenshots.
              Last updated -
              21
              56,029
              37
              TypeScript
              Apache 2.0

            View all related MCP servers

            ID: 48hhtkok90