Skip to main content
Glama
213,524 tools. Last updated 2026-06-19 18:25

"Tools and methods for extracting HTML content from websites" matching MCP tools:

  • Retrieve raw HTML from anti-bot or JavaScript-heavy websites. Fully renders the page using Web Unblocker or Chromium to bypass blocks and execute JavaScript.
    MIT
  • Convert webpages into clean, formatted markdown for extracting content from documentation, articles, and web pages. Fetches any webpage and transforms HTML content into readable markdown format.
    MIT
  • Fetch raw HTML content from any URL with optional JavaScript rendering for dynamic websites and Single Page Applications.
    MIT
  • Search the web for current information, news, articles, and websites to find up-to-date content, research topics, or answer questions about recent events.
    Apache 2.0
  • Extract web page content and convert it to clean, readable markdown format for analysis, bypassing paywalls and obtaining structured text data from websites.
    Apache 2.0
  • Extract plain text content from websites by fetching URLs and converting HTML to readable text with configurable length and starting point.
    MIT

Matching MCP Servers

  • A
    license
    -
    quality
    C
    maintenance
    Provides MCP tool adapters for Bioconductor methods like limma, DESeq2, and fgsea, enabling statistical analysis of omics data through containerized R execution. It serves as a bridge between MCP clients and bioinformatics tools for reproducible research workflows.
    Last updated
    Apache 2.0
  • A
    license
    B
    quality
    B
    maintenance
    Extract content from URLs, documents, videos, and audio files using intelligent auto-engine selection. Supports web pages, PDFs, Word docs, YouTube transcripts, and more with structured JSON responses.
    Last updated
    1
    160
    MIT

Matching MCP Connectors

  • GOV.UK Content + Search APIs (every gov.uk page + full search)

  • Transform any blog post or article URL into ready-to-post social media content for Twitter/X threads, LinkedIn posts, Instagram captions, Facebook posts, and email newsletters. Pay-per-event: $0.07 for all 5 platforms, $0.03 for single platform.

  • Extract structured financial data from investor relations websites and online sources for investment research when APIs are unavailable.
    MIT
  • Render websites to images, PDFs, HTML, or markdown with full control over viewport, content blocking, and metadata extraction.
    MIT
  • Generate a PDF from HTML content. Accepts HTML input and returns base64-encoded PDF bytes, enabling automated document creation from HTML templates.
    MIT
  • Extract raw HTML from any URL for data extraction, content analysis, or price monitoring. Automatically handles anti-bot protection.
    MIT
  • Fetch web content from any URL and save it directly to a file in your workspace, with options for raw HTML, cleaned HTML, or Markdown formats.
    MIT
  • Retrieve plain text or HTML content from DEVONthink records using UUID. Returns text for text-based records, HTML for web content, or null for binary files like PDFs and images.
    MIT
  • Retrieve complete HTML component details including HTML, CSS, and JavaScript content from Circuitry's visual workflow platform for integration and analysis.
  • Extract web content and convert it to clean Markdown for reading documentation, analyzing content, and gathering information from websites while preserving links and structure.
    MIT
  • Retrieve raw XML or HTML content from XBRL filings using XPath queries for advanced extraction when standard tools are insufficient.
    Apache 2.0
  • Retrieve web page content from any URL and process it into raw HTML, cleaned HTML, or readable Markdown format for analysis and integration.
    MIT
  • Remove content items from LightCMS websites while preserving restoration options. This tool soft-deletes content and removes associated static HTML pages, allowing for content recovery when needed.
    MIT
  • Convert HTML content to clean Markdown optimized for LLM processing, extracting main content and resolving links for better AI analysis.
    MIT