Skip to main content
Glama
261,376 tools. Last updated 2026-07-05 12:09

"A tool for extracting text from a webpage after crawling it" matching MCP tools:

  • Convert webpages into clean, formatted markdown for extracting content from documentation, articles, and web pages. Fetches any webpage and transforms HTML content into readable markdown format.
    MIT
  • Retrieve the original tool output from a flattened session by providing the session ID and tool use ID found in the [FLATTENED] marker. Returns the full text or image.
    MIT
  • Initiates an asynchronous crawl of a website, extracting content from multiple pages. Use for comprehensive site coverage; monitor progress with returned operation ID.
    MIT
  • Ingest a document file locally by extracting its text and storing it as a recallable entity in memory. Supports plaintext, markdown, and structured text; DOCX and PDF require a specific build.
    MIT

Matching MCP Servers

Matching MCP Connectors

  • Manage your Canvas coursework with quick access to courses, assignments, and grades. Track upcomin…

  • Search the AI Tool Directory catalog: tool details, status checks (alive/acquired/deceased + cause and date), alternatives, and side-by-side comparisons. Read-only.

  • Extract webpage content into Markdown format, bypassing bot detection and CAPTCHA protections for reliable data collection.
    MIT
  • Extract clean, readable content from any webpage as Markdown, JSON, or HTML. Captures content after JavaScript runs, stripping ads and boilerplate for LLM input or summarization.
    MIT
  • Retrieve webpage metadata like title, description, and status code to assess accessibility and basic information without extracting full content. Ideal for quick page checks in web scraping workflows.
    MIT
  • Capture a screenshot of any webpage and return it as a PNG image. Specify the URL and optional geolocation to control request location.
    ISC
  • Analyze a website's robots.txt file to determine crawl permissions and ensure compliance with ethical web scraping practices. Provides insights into allowed and disallowed paths for crawling.
    MIT
  • Import a software tool by providing its URL. The name and description are auto-detected from the webpage, with optional overrides.
    Apache 2.0