Skip to main content
Glama
21,811 servers. Last updated

Matching MCP tools:

Matching MCP Connectors:

"Web scraping and crawling tools" matching MCP servers:

  • A
    security
    F
    license
    A
    quality
    Provides local web search and content fetching capabilities for AI assistants, enabling them to search DuckDuckGo and extract clean text from web pages. All requests originate from the user's machine to ensure direct network control and bypass external proxies.
    Last updated
    2
  • A
    security
    A
    license
    A
    quality
    Comprehensive web research toolkit with 13 tools for searching (via SearXNG), crawling, package discovery, GitHub metrics, error translation, API documentation lookup, data extraction, technology comparison, and service status checking.
    Last updated
    13
    9
    MIT
  • A
    security
    A
    license
    B
    quality
    A comprehensive web scraping server that transforms web content into clean, agent-ready Markdown with automatic citations and efficient caching. It features a robust suite of tools for metadata extraction, sentiment analysis, SEO auditing, and security scanning while strictly adhering to robots.txt policies.
    Last updated
    48
    2
    10
    MIT
  • -
    security
    F
    license
    -
    quality
    Enables web scraping and crawling capabilities for LLM clients, supporting single-page scraping, multi-page website crawling, and web search with multiple engines (Playwright, Cheerio, Puppeteer) and flexible output formats including markdown, HTML, text, and screenshots.
    Last updated
    3
    6
  • -
    security
    A
    license
    -
    quality
    A headless web scraping server that extracts main content from web pages into Markdown, text, or HTML for AI and automation integration. It features per-domain rate limiting and robust error handling using Playwright and BeautifulSoup.
    Last updated
    MIT
  • A
    security
    F
    license
    A
    quality
    Enables retrieval and cleaning of official documentation content for popular AI/Python libraries (uv, langchain, openai, llama-index) through web scraping and LLM-powered content extraction. Uses Serper API for search and Groq API to clean HTML into readable text with source attribution.
    Last updated
    1
    1
  • -
    security
    F
    license
    -
    quality
    An advanced web search and scraping server that enables AI models to perform targeted DuckDuckGo searches and extract clean content, tables, and metadata from webpages. It provides specialized tools for news discovery, link extraction, and comprehensive search-and-scrape workflows.
    Last updated
    • Apple
  • A
    security
    F
    license
    B
    quality
    Enables web scraping, React app testing, and React Native web app inspection using Playwright with multi-browser support. Provides backward compatibility with regular websites while offering enhanced features for React applications including mobile viewport emulation and component analysis.
    Last updated
    10
  • A
    security
    F
    license
    B
    quality
    Enables intelligent web searching using SearXNG with content crawling via Creeper, then summarizes webpage content using LLM to avoid token limit issues. Supports smart filtering with domain blacklist/whitelist and optional LLM-based relevance filtering.
    Last updated
    1
  • -
    security
    A
    license
    -
    quality
    Web scraping MCP server for Al agents. 6 tools: extract clean text/markdown from any URL, structured scraping with CSS selectors, full-page screenshots via Playwright, link extraction with regex filtering, metadata extraction (OG tags, Twitter cards), and Google search. Free tier: 50 requests/IP/day.
    Last updated
    3
    MIT
  • A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that enables web scraping, crawling, and content extraction capabilities through integration with Firecrawl.
    Last updated
    8
    30,761
    MIT
    • Apple
  • -
    security
    F
    license
    -
    quality
    An intelligent web crawling server that uses Cloudflare's headless browser to render dynamic pages and Workers AI to extract relevant links based on natural language queries. It enables AI assistants to search and filter website content while providing secure access through GitHub OAuth authentication.
    Last updated
    3
  • A
    security
    A
    license
    A
    quality
    Integrates Firecrawl web scraping capabilities including scraping, crawling, searching, extracting structured data, deep research, and batch processing with support for both cloud and self-hosted instances.
    Last updated
    10
    30,761
    2
    MIT
  • -
    security
    A
    license
    -
    quality
    Integrates Firecrawl for web scraping, crawling, search, and content extraction capabilities. Supports single/batch scraping, URL discovery, structured data extraction, deep research, and AI-powered web analysis with automatic retries and rate limiting.
    Last updated
    30,761
    MIT
    • Apple
  • -
    security
    A
    license
    -
    quality
    An MCP server that provides web scraping and Bing search capabilities, supporting both static and dynamic content through Puppeteer. It allows AI assistants to extract page summaries, SEO metadata, and full text with automatic detection for headless browser rendering.
    Last updated
  • A
    security
    A
    license
    A
    quality
    Description: An MCP server with 15 tools covering web search, scraping, extraction, crawling, and autonomous data gathering via the SearchClaw API. Tagline: "The complete web data pipeline for AI agents โ€” Search, Extract, Crawl in One API."
    Last updated
    15
    5
    1
    MIT
  • A
    security
    A
    license
    A
    quality
    Enables AI assistants to access real-time web data through search, markdown scraping, and browser automation while bypassing anti-bot protections. It provides tools for web research, e-commerce monitoring, and data extraction from across the globe.
    Last updated
    4
    3,498
    5
    MIT
  • -
    security
    F
    license
    -
    quality
    Provides web scraping, crawling, and site mapping capabilities by connecting directly to a self-hosted Firecrawl instance. It enables users to extract website content in markdown format and manage crawling jobs without an external API key.
    Last updated
  • A
    security
    A
    license
    B
    quality
    Enables LLMs and AI agents to access real-time web data, search websites, and navigate the web without getting blocked. Includes 5,000 free monthly requests and supports web scraping, browser automation, and bypassing geo-restrictions.
    Last updated
    60
    3,498
    1
    MIT
    • Apple
    • Linux
  • -
    security
    A
    license
    -
    quality
    Enables comprehensive web searching and content extraction using multiple search engines (Bing, Brave, DuckDuckGo) without API keys. Provides tools for full web searches with content extraction, quick search summaries, and single webpage content retrieval.
    Last updated
    774
    MIT
  • -
    security
    A
    license
    -
    quality
    A high-performance search service that converts results from Google, Bing, and DuckDuckGo into structured JSON or Markdown. It features multi-layer depth crawling and uses the Camoufox anti-detection browser for reliable content extraction and fallback search logic.
    Last updated
    39
    MIT
  • A
    security
    F
    license
    B
    quality
    Enables intelligent web document scraping and conversation memory management for Cursor IDE. Supports multi-level link crawling, automatic content extraction, and organized storage of conversations and technical documentation.
    Last updated
    3
    1
  • A
    security
    A
    license
    A
    quality
    Web scraping, crawling, and structured data extraction for AI agents. 5 tools: scrape (clean markdown from any URL), crawl (entire sites), map (discover URLs), extract (structured JSON), and search. 833ms avg latency, single binary, self-hostable.
    Last updated
    4
    48
    AGPL 3.0
  • -
    security
    F
    license
    -
    quality
    Provides AI agents with real-time web capabilities including live search results, markdown web scraping, business lead generation, and detailed company information. It enables agents to bypass knowledge cutoffs by accessing current web data through a monetized Apify Actor.
    Last updated
  • -
    security
    F
    license
    -
    quality
    ๐Ÿ“‡ โ˜๏ธ - Capture web pages as cryptographically signed, tamper-evident evidence. Ed25519 signatures, RFC 3161 timestamps, and WACZ archives. Four tools: capture_url, get_capture, list_captures, verify_capture.
    Last updated
    1
    1