126,802 tools. Last updated 2026-05-05 05:52

"How to fetch or scrape data from a website for use in training an LLM" matching MCP tools:

handoffs.partnershipA
AI Design Blueprint Doctrine
Initiate a partnerships handoff for design partner, ecosystem, training, or advisory conversations requiring human review. Provide the reason, organization, role, and website to trigger operator review.
MIT
brand_extract_webA
brandsystem-mcp
Extract brand colors, fonts, and logo from any website URL to identify brand identity from a live site.
MIT
extract_dataA
AIMLPM/markcrawl
Use an LLM to extract structured fields from crawled pages. Define fields or let the LLM auto-discover by sampling. Results saved to extracted.jsonl. Ideal for competitive research, API analysis, and dataset creation.
scrape_apolloA
scrapercity-cli
Scrape B2B leads from Apollo.io by submitting a search URL. Returns a runId to check status later. Use webhooks for async delivery instead of polling.
MIT
download_training_datasetB
CrowdCent MCP Server
Download specific training datasets from CrowdCent prediction challenges to a specified .parquet file path for model training and analysis.
fetch_htmlC
Fetch MCP Server
Retrieve website HTML content directly from URLs for web scraping, data extraction, or content analysis purposes.
MIT

Matching MCP Servers

Website to Markdown MCP Server
Web Scraping Browser Automation Documentation Access
SunZhi-Will
A
license
B
quality
D
maintenance
Fetches website content and converts it to Markdown format with AI-powered content cleanup, ad removal, and full OpenAPI/Swagger specification support for easy processing by AI assistants.
Last updated 2025-06-27
4
11
3
MIT
Fetch Weather from wttr
Weather Services Search Remote
melody26613
A
license
-
quality
D
maintenance
Fetches current and three-day weather forecasts for any city using the wttr weather service through a Docker-based MCP server.
Last updated 2025-06-16
1
MIT

Matching MCP Connectors

arjunkmrm-fetchOAuth
Fetch web pages and extract exactly the content you need. Select elements with CSS and retrieve co…
website-search
Improve security writing, score it against rubrics, plan IR and product strategy.

fetch_markdownC
Fetch MCP Server
Convert website content to Markdown format by fetching URLs, enabling structured extraction of web data for documentation or analysis.
MIT
firecrawl_crawlA
Firecrawl MCP Server
Extract content from multiple pages on a website by starting a crawl job. Use to comprehensively gather data from related pages with configurable depth and limits.
MIT
firecrawl_crawlA
Firecrawl MCP Server
Extract content from multiple website pages by starting an asynchronous crawl job for comprehensive coverage of related content.
firecrawl_crawlA
Firecrawl MCP Server
Extract content from multiple website pages by starting an asynchronous crawl job for comprehensive coverage of related content.
MIT
zapfetch_crawlA
ZapFetch MCP Server
Crawl a website to gather content from multiple pages. Returns a job ID for async polling. Best for whole-site extraction; for single pages use scrape, for URL discovery use map.
firecrawl_crawlA
Firecrawl MCP Server
Extract content from multiple website pages by starting an asynchronous crawl job to comprehensively gather data across related webpages.
MIT
get_ai_visibility_trendA
competlab-mcp-server
Track how LLM brand perception changes over time. Analyze up to 200 data points per query, with optional provider filter for OpenAI, Claude, or Gemini. Ideal for time-series analysis of AI visibility.
MIT
firecrawl_crawlA
Firecrawl MCP Server
Extract content from multiple website pages by starting a crawl job. Use for comprehensive coverage of related pages, with options to control depth and scope.
firecrawl_mapA
Firecrawl MCP Server
Discover all indexed URLs on a website to identify pages for scraping or locate specific content when scrape results are incomplete.
firecrawl_mapA
Firecrawl MCP Server
Discover all indexed URLs on a website to identify pages for scraping or locate specific sections. Returns an array of found URLs.
MIT
create_jobA
Tuning Engines
Fine-tune an LLM on a GitHub repository to learn code patterns and conventions. Choose a training agent: Cody for code autocomplete or SIERA for bug-fix specialization.
MIT
firecrawl_mapA
Firecrawl MCP Server
Discover all indexed URLs on a website to identify pages for scraping or locate specific site sections. Returns a list of found URLs.
MIT
monitor_the_situationA
SimpleFunctions
Monitor situations by scraping web content, analyzing with an LLM, and cross-referencing prediction markets for insights. Use for scheduled or one-shot URL ingestion with optional analysis and market enrichment.
MIT
firecrawl_mapA
Firecrawl MCP Server
Discover all indexed URLs on a website to identify pages for scraping or locate specific sections before content extraction.

"How to fetch or scrape data from a website for use in training an LLM" matching MCP tools:

handoffs.partnershipA

brand_extract_webA

extract_dataA

scrape_apolloA

download_training_datasetB

fetch_htmlC

Matching MCP Servers

Website to Markdown MCP Server

Fetch Weather from wttr

Matching MCP Connectors

fetch_markdownC

firecrawl_crawlA

firecrawl_crawlA

firecrawl_crawlA

zapfetch_crawlA

firecrawl_crawlA

get_ai_visibility_trendA

firecrawl_crawlA

firecrawl_mapA

firecrawl_mapA

create_jobA

firecrawl_mapA

monitor_the_situationA

firecrawl_mapA