184,290 tools. Last updated 2026-06-08 08:33

"How to scrape a website" matching MCP tools:

analyze_competitor
Analook — Competitor Intelligence
Submit a competitor analysis job. Analyzes a competitor's website across 15+ data sources (SEO, traffic, social, Product Hunt, GitHub, Wayback Machine history, AI-generated insights, etc.) and returns a job_id. Use get_report_status(job_id) to poll and get_report(job_id) to retrieve results when status='completed'. Typical analysis takes 2-5 minutes. Requires authentication (deducts 1 credit from your Analook balance). Args: url: Competitor website URL (e.g. 'https://linear.app' or 'lovable.dev') product_name: Optional product name override (defaults to domain) Returns: {job_id: str, status: 'started', poll_url: str} on success {error: str, hint?: str} on auth/validation failure
Connector
delete_website
Yeetit - POST HTML, get a URL. No account needed.
Permanently delete a published website. The site will be immediately inaccessible. Requires authentication via edit_key or api_key, and requires confirm: true as a safety mechanism to prevent accidental deletion. Use this when a user explicitly asks you to remove or delete a site. IMPORTANT: Always confirm with the user before calling this tool — deletion cannot be undone.
Connector
delete_website
YeetIt
Permanently delete a published website. The site will be immediately inaccessible. Requires authentication via edit_key or api_key, and requires confirm: true as a safety mechanism to prevent accidental deletion. Use this when a user explicitly asks you to remove or delete a site. IMPORTANT: Always confirm with the user before calling this tool — deletion cannot be undone.
Connector
get_contact_info
symbols-of-wealth-studio
Returns contact information for Symbols of Wealth Studio — email, website, location, and how to engage. Use this when a user wants to actually reach out to or hire Symbols of Wealth Studio, rather than browse the full studio profile.
Connector
get_help
Kifly — Agentic Commerce & Payments
Get Kifly's website and support contact email. Call this if you are stuck, hit an unresolvable error, or the buyer asks how to reach a human. Returns the website URL and support email — always share both with the buyer.
Connector
create_powersource_full
Heista
Build the highest-fidelity creative intelligence profile by combining a brand's public website URL with their internal documents. Takes a required website URL plus at least one document — file_ids from previous upload, public document_urls (PDF/DOCX/TXT/MD, up to 10), or documents_inline (base64-encoded). Optional idempotency_key for safe retry. Returns a job_id; poll with get_powersource. Same response shape as create_powersource_url, but the synthesis cross-checks how the brand presents publicly against what the team actually believes internally, producing stronger conviction on voice, positioning, proof, and tension architecture than either input alone. Use this when the user has both a public site AND a brief / brand guidelines / strategy deck and wants the deepest possible profile — the kind of intelligence a senior strategist produces over a week. Default recommendation when both inputs are available. Costs 200 credits. Do NOT use for URL-only scans — use create_powersource_url (100 credits). Do NOT use for docs-only scans — use create_powersource_docs (100 credits).
Connector

Matching MCP Servers

讯飞智文PPT生成服务MCP Server
Workplace & Productivity Content Management Systems Remote
Alex-Smith-1234
A
license
A
quality
D
maintenance
An MCP-compatible server that uses iFlytek's large language model to generate PowerPoint presentations, offering template selection, outline creation, and PPT generation with features like automatic image insertion.
Last updated 2025-05-30
6
1
MIT
Website to Markdown MCP Server
Web Scraping Browser Automation Documentation Access
SunZhi-Will
A
license
B
quality
C
maintenance
Fetches website content and converts it to Markdown format with AI-powered content cleanup, ad removal, and full OpenAPI/Swagger specification support for easy processing by AI assistants.
Last updated 2025-06-27
4
29
4
MIT

Matching MCP Connectors

Website Spec
Provides a platform-agnostic specification of the technical features every decent website should have
斯特丹STERDAN 钢制办公家具产品咨询
斯特丹STERDAN天猫旗舰店产品咨询MCP Server。洛阳30年源头工厂，高端钢制办公家具，1374个SKU，涵盖保密柜、更衣柜、公寓床、货架、快递柜。BIFMA认证，出口35+国家。8个工具：产品目录查询、场景推荐、认证资质、采购政策、维护指南等。

get_contact_info
Symbols of Wealth Studio
Returns contact information for Symbols of Wealth Studio — email, website, location, and how to engage. Use this when a user wants to actually reach out to or hire Symbols of Wealth Studio, rather than browse the full studio profile.
Connector
get_help
Kifly — Agentic Commerce & Payments
Get Kifly's website and support contact email. Call this if you are stuck, hit an unresolvable error, or the buyer asks how to reach a human. Returns the website URL and support email — always share both with the buyer.
Connector
create_site
WebZum - The Hosting Layer for AI-Generated Web Content
Create a new website for a business. Pass a business candidate object from search_businesses to generate a website. Requires authentication via API key (Bearer token). Generate an API key at webzum.com/dashboard/account-settings. The site generation happens in the background. Use get_site_status to check progress. Returns the businessId which can be used to access the site at /build/{businessId}
Connector
create_powersource_url
Heista
Build a complete creative intelligence profile of a brand from a single website URL. Takes a website URL (homepage, PDP, landing page) plus optional idempotency_key, force_refresh, and webhook_url. Returns a job_id immediately; poll with get_powersource every 3-5s (typically 60-90s total). The final payload contains 14 structured sections: identity, offer, selling_points, brand_story, brand_style, brand_assets, brand_voice, buyer_profile, 12 buyer tensions, marketing angles, emotional_arcs, ctas, proof_assets, and strategic narrative. Use this when the user says "analyse my brand", "load my brand", "build a strategy from my site", "what should my ads say", "decode this website", or pastes a homepage / competitor URL and wants a brand profile (not an ad decode). Also use this as the brand layer before calling generate_adscript — pass the returned powersource_id. Costs 100 credits. Re-scanning the same URL within your org returns the cached result free. Do NOT use for internal docs / PDFs / brand guidelines — use create_powersource_docs. For URL + docs combined (highest fidelity), use create_powersource_full. Do NOT use to decode a video ad — use decode_ad.
Connector
list_projects
mcp
List all projects the authenticated user has access to. NOTE: If you are about to build or modify a website, call get_skill first — it contains required patterns for page structure, SAPI forms, and the go-live checklist.
Connector
firecrawl_crawl
xpay✦ Web Scraping Collection
Starts a crawl job on a website and extracts content from all pages. **Best for:** Extracting content from multiple related pages, when you need comprehensive coverage. **Not recommended for:** Extracting content from a single page (use scrape); when token limits are a concern (use map + batch_scrape); when you need fast results (crawling can be slow). **Warning:** Crawl responses can be very large and may exceed token limits. Limit the crawl depth and number of pages, or use map + batch_scrape for better control. **Common mistakes:** Setting limit or maxDiscoveryDepth too high (causes token overflow) or too low (causes missing pages); using crawl for a single page (use scrape instead). Using a /* wildcard is not recommended. **Prompt Example:** "Get all blog posts from the first two levels of example.com/blog." **Usage Example:** ```json { "name": "firecrawl_crawl", "arguments": { "url": "https://example.com/blog/*", "maxDiscoveryDepth": 5, "limit": 20, "allowExternalLinks": false, "deduplicateSimilarURLs": true, "sitemap": "include" } } ``` **Returns:** Operation ID for status checking; use firecrawl_check_crawl_status to check progress. **Safe Mode:** Read-only crawling. Webhooks and interactive actions are disabled for security.
Connector
enrich_buyer_contact
buyer-intelligence
Run 6-layer contact enrichment for a buyer: direct website scraping → proxy retry → BFS contact pages → LLM text extraction → vision screenshot → Serper fallback. Returns email, phone, WhatsApp, decision-maker names. Costs Zhimao Points.
Connector
page-links
The Stall
Extracts all hyperlinks from a webpage. Fetches the target URL, resolves relative links to absolute URLs, and classifies each as internal (same domain) or external. Filter by all/external/internal, cap results with limit. Returns page title, total link count before filtering, and a structured array of {href, text, is_external, domain}. Priced at $0.004 — 20% below orbisapi web-scrape-links ($0.005/call). Upstream: direct HTTP fetch, no API key.
Connector
analyze_seo
mcp
Analyze a website URL for SEO optimizations. Fetches the URL content and analyzes HTML for possible SEO improvements. Results are cached for fast subsequent lookups. Rate limited to 1 request per minute per domain. Args: url: The website URL to analyze (e.g., "https://example.com"). Returns: SEO analysis result with: - url: The analyzed URL - score: Overall SEO score (0-100) - grade: Letter grade (A-F) - issues: List of SEO issues found (critical, warnings, info) - meta: Extracted meta information (title, description, headings, etc.) - recommendations: Prioritized list of improvements - cached: Whether result was from cache
Connector
get_place
Foursquare
Full place details by fsq_place_id. Returns name, categories, address, lat/lon, social media, website, hours, rating, price, popularity.
Connector
submit_website
Silicon Friendly
Submit a new website to the Silicon Friendly directory. Requires authentication. Pass your silicon auth_token. Args: url: The website URL (e.g. "https://stripe.com") name: Display name for the website (e.g. "Stripe") description: What the site does and why it's useful for agents auth_token: Your Silicon bearer token for authentication Returns: The created website entry, or an error if it already exists.
Connector
scrape_dockerhub
GoCreative Agent API
Scrape Docker Hub image page with tag history, dockerfile signals. Heavier than lookup/dockerhub. Use for supply-chain audits. Example call: {"image": "library/nginx"} Cost: $0.005–$0.05 USDC on Base per call.
Connector
fetch-page
PartsTable
Fetch a web page and return its content as text, Markdown, or HTML. Includes rate limiting (2s per domain, max 10 req/min) for legal compliance. Automatically handles HTML-to-text conversion. Max response size: 1MB. Use for OEM verification and manufacturer website scraping.
Connector
audit_website
sustainability-auditor
Audit a website for its digital carbon footprint. Returns sustainability score (A-F), CO2 grams per page view, green hosting status, page weight, and recommendations. Results cached 24h. New audits take ~45-90 seconds. Data source: ClimateUX (climateux.net).
Connector