206,060 tools. Last updated 2026-06-17 10:17

"Tools for extracting structured data from web pages for LLM use" matching MCP tools:

firecrawl_extractA
Firecrawl MCP Server
Extract structured data from web pages using LLM capabilities. Define specific information to retrieve with custom prompts and JSON schemas for organized output.
brave_llm_contextA
Brave Search MCP Server
Retrieve relevance-ranked web content with actual page text, tables, and code for AI grounding and RAG pipelines.
searchscraperA
ScrapeGraph MCP Server
Perform AI-powered web searches to extract structured data from search results for research, competitive analysis, and multi-source information gathering.
MIT
smartcrawler_initiateA
ScrapeGraph MCP Server
Start multi-page web crawling to extract structured data with AI or convert content to markdown from a starting URL.
MIT
extract_structuredA
free-search-mcp
Extract structured metadata from web pages: JSON-LD, OpenGraph, microdata. Retrieve fields like price, rating, author, and date for analysis.
MIT
search_drug_pgxA
openpgx
Returns a structured research prompt for pharmacogenomic data when no local study exists. Use web search then save results via save_drug_research.
MIT

Matching MCP Servers

MCP Web Tools Server
Browser Automation Search Web Scraping
JoaoPedroLanca
F
license
D
quality
D
maintenance
Provides web connectivity tools for searching the web via DuckDuckGo or SerpAPI, fetching URL content, and extracting readable text from web pages.
Last updated 2025-11-02
3
1
Structured-shofficial
Knowledge & Memory Databases
structured-sh
A
license
-
quality
B
maintenance
MCP server providing managed persistent memory for AI agents. Read and write structured state across sessions, tools, and restarts at 1000+ requests per second, with no infrastructure to self-host or operate.
Last updated 2026-04-09
2
Apache 2.0

Matching MCP Connectors

Mirelia-Structured-Data-Marketplace
A fully autonomous, Agent-to-Agent (A2A) patent data marketplace powered by the Model Context Protocol (MCP) and A2A standards. This server provides highly structured, AI-optimized JSON patent datasets curated for autonomous R&D agents, LLMs, and Quants. Currently exclusively hosting AI-ready patents from IPC/CPC Sections G (Physics & Computing) and H (Electricity).
Mirelia-Structured-Data-Marketplace
Autonomous A2A marketplace providing AI-ready, structured USPTO patent JSON datasets. Features IPC/CPC Sections G (Physics/Computing, e.g., G01 Sensors, G06 AI/ML) and H (Electricity, e.g., H01 Semiconductors, H04 5G). Enables instant M2M data delivery via automated on-chain payment verification. Networks: Base (USDC), Polygon (USDC), Oasis (ROSE).

markdownifyA
ScrapeGraph MCP Server
Convert webpages into clean, formatted markdown for extracting content from documentation, articles, and web pages. Fetches any webpage and transforms HTML content into readable markdown format.
MIT
get_session_summaryA
peek
Retrieve a structured summary of a browser session, including pages visited, interaction counts, and error counts. Use this for an overview before inspecting specific console or network errors.
Apache 2.0
parallel_read_urlA
Jina AI Remote MCP Server
Extract clean content from multiple web pages simultaneously to compare information across sources or gather data from several pages at once.
Apache 2.0
lpm_addA
@lpm-registry/mcp-server
Add LPM packages to your project by extracting source files for customization. Use for UI components, blocks, templates, and MCP servers.
ISC
read_urlA
Jina AI Remote MCP Server
Extract web page content and convert it to clean, readable markdown format for analysis, bypassing paywalls and obtaining structured text data from websites.
Apache 2.0
extractA
webclaw
Extract structured data from web pages using JSON schemas or natural language prompts, automatically bypassing bot protection when detected.
AGPL 3.0
searchB
webclaw
Execute web searches to retrieve structured information for AI agents, delivering LLM-optimized results with reduced token usage.
AGPL 3.0
firecrawlExtractC
Emblem AI
Extract structured data from web pages by supplying URLs and a prompt, with options for web search and retries.
MIT
firecrawl_crawlA
Firecrawl MCP Server
Extract content from multiple website pages by starting an asynchronous crawl job for comprehensive coverage of related content.
MIT
firecrawl_crawlA
Firecrawl MCP Server
Extract content from multiple website pages by starting an asynchronous crawl job for comprehensive coverage of related content.
MIT
firecrawl_crawlA
Firecrawl MCP Server
Extract content from multiple website pages by starting an asynchronous crawl job to comprehensively gather data across related webpages.
MIT
firecrawl_crawlA
Firecrawl MCP Server
Initiates an asynchronous crawl of a website, extracting content from multiple pages. Use for comprehensive site coverage; monitor progress with returned operation ID.
MIT
firecrawl_crawlA
Firecrawl MCP Server
Extract content from multiple pages on a website by starting a crawl job. Use to comprehensively gather data from related pages with configurable depth and limits.
MIT
firecrawl_agentA
Firecrawl MCP Server
Autonomously browses the web to find and extract structured data from multiple sources based on your natural language query, handling complex research tasks across the internet.

"Tools for extracting structured data from web pages for LLM use" matching MCP tools:

firecrawl_extractA

brave_llm_contextA

searchscraperA

smartcrawler_initiateA

extract_structuredA

search_drug_pgxA

Matching MCP Servers

MCP Web Tools Server

Structured-shofficial

Matching MCP Connectors

markdownifyA

get_session_summaryA

parallel_read_urlA

lpm_addA

read_urlA

extractA

searchB

firecrawlExtractC

firecrawl_crawlA

firecrawl_crawlA

firecrawl_crawlA

firecrawl_crawlA

firecrawl_crawlA

firecrawl_agentA