Skip to main content
Glama
206,060 tools. Last updated 2026-06-17 10:17

"Tools for extracting structured data from web pages for LLM use" matching MCP tools:

  • Extract structured data from web pages using LLM capabilities. Define specific information to retrieve with custom prompts and JSON schemas for organized output.
  • Perform AI-powered web searches to extract structured data from search results for research, competitive analysis, and multi-source information gathering.
    MIT
  • Extract structured metadata from web pages: JSON-LD, OpenGraph, microdata. Retrieve fields like price, rating, author, and date for analysis.
    MIT
  • Returns a structured research prompt for pharmacogenomic data when no local study exists. Use web search then save results via save_drug_research.
    MIT

Matching MCP Servers

Matching MCP Connectors

  • A fully autonomous, Agent-to-Agent (A2A) patent data marketplace powered by the Model Context Protocol (MCP) and A2A standards. This server provides highly structured, AI-optimized JSON patent datasets curated for autonomous R&D agents, LLMs, and Quants. Currently exclusively hosting AI-ready patents from IPC/CPC Sections G (Physics & Computing) and H (Electricity).

  • Autonomous A2A marketplace providing AI-ready, structured USPTO patent JSON datasets. Features IPC/CPC Sections G (Physics/Computing, e.g., G01 Sensors, G06 AI/ML) and H (Electricity, e.g., H01 Semiconductors, H04 5G). Enables instant M2M data delivery via automated on-chain payment verification. Networks: Base (USDC), Polygon (USDC), Oasis (ROSE).

  • Convert webpages into clean, formatted markdown for extracting content from documentation, articles, and web pages. Fetches any webpage and transforms HTML content into readable markdown format.
    MIT
  • Retrieve a structured summary of a browser session, including pages visited, interaction counts, and error counts. Use this for an overview before inspecting specific console or network errors.
    Apache 2.0
  • Add LPM packages to your project by extracting source files for customization. Use for UI components, blocks, templates, and MCP servers.
    ISC
  • Extract web page content and convert it to clean, readable markdown format for analysis, bypassing paywalls and obtaining structured text data from websites.
    Apache 2.0
  • Extract structured data from web pages using JSON schemas or natural language prompts, automatically bypassing bot protection when detected.
    AGPL 3.0
  • Execute web searches to retrieve structured information for AI agents, delivering LLM-optimized results with reduced token usage.
    AGPL 3.0
  • Extract structured data from web pages by supplying URLs and a prompt, with options for web search and retries.
    MIT
  • Extract content from multiple website pages by starting an asynchronous crawl job to comprehensively gather data across related webpages.
    MIT
  • Initiates an asynchronous crawl of a website, extracting content from multiple pages. Use for comprehensive site coverage; monitor progress with returned operation ID.
    MIT
  • Extract content from multiple pages on a website by starting a crawl job. Use to comprehensively gather data from related pages with configurable depth and limits.
    MIT
  • Autonomously browses the web to find and extract structured data from multiple sources based on your natural language query, handling complex research tasks across the internet.