Search for:

Tools for Web Crawling

  • Why this server?

    This server provides tools to scrape websites and extract structured data from them using Firecrawl's APIs, supporting both basic website scraping in multiple formats and custom schema-based data extraction, making it suitable as a '爬虫工具'.

    A
    security
    F
    license
    A
    quality
    A server that provides tools to scrape websites and extract structured data from them using Firecrawl's APIs, supporting both basic website scraping in multiple formats and custom schema-based data extraction.
    2
    JavaScript
  • Why this server?

    A Model Context Protocol (MCP) server implementation that integrates with FireCrawl for advanced web scraping capabilities. As such, it directly addresses the need for a '爬虫工具'.

    A
    security
    A
    license
    A
    quality
    A Model Context Protocol (MCP) server implementation that integrates with FireCrawl for advanced web scraping capabilities.
    9
    7,623
    2,584
    JavaScript
    MIT License
    • Apple
    • Linux
  • Why this server?

    This server enables AI systems to integrate with Tavily's search and data extraction tools, providing real-time web information access, acting as a '爬虫工具'.

    A
    security
    A
    license
    A
    quality
    This server enables AI systems to integrate with Tavily's search and data extraction tools, providing real-time web information access and domain-specific searches.
    2
    2,232
    164
    JavaScript
    MIT License
    • Apple
    • Linux
  • Why this server?

    An MCP (Model Context Protocol) server that provides Google search capabilities and webpage content analysis tools. This server enables AI models to perform Google searches and analyze webpage content programmatically, which makes it a suitable '爬虫工具'.

    A
    security
    F
    license
    A
    quality
    An MCP (Model Context Protocol) server that provides Google search capabilities and webpage content analysis tools. This server enables AI models to perform Google searches and analyze webpage content programmatically.
    3
    15
    28
    TypeScript
  • Why this server?

    A Python implementation of an MCP server that extracts webpage content, removes ads and non-essential elements, and transforms it into clean, LLM-optimized Markdown, effectively serving as a '爬虫工具' for extracting relevant information.

    -
    security
    A
    license
    -
    quality
    A Python implementation of an MCP server that extracts webpage content, removes ads and non-essential elements, and transforms it into clean, LLM-optimized Markdown.
    1
    Python
    MIT License
    • Linux
    • Apple
  • Why this server?

    Provides stealth browser capabilities using Playwright with anti-detection techniques, allowing MCP clients to navigate websites and take screenshots while evading common bot detection systems, thus functioning as a sophisticated '爬虫工具'.

    A
    security
    A
    license
    A
    quality
    Provides stealth browser capabilities using Playwright with anti-detection techniques, allowing MCP clients to navigate websites and take screenshots while evading common bot detection systems.
    1
    4
    TypeScript
    MIT License
  • Why this server?

    Enables AI agents to control web browsers via a standardized interface for operations like launching, interacting with, and closing browsers, acting as a '爬虫工具' for automation.

  • Why this server?

    An MCP server that enables fetching web content using the Node.js undici library, supporting various HTTP methods and content formats, making it suitable as a '爬虫工具'.

    -
    security
    A
    license
    -
    quality
    An MCP server that enables fetching web content using the Node.js undici library, supporting various HTTP methods, content formats, and request configurations.
    66
    8
    TypeScript
    MIT License
    • Apple
    • Linux
  • Why this server?

    A Model Context Protocol server that allows users to check if a website is experiencing downtime by querying isitdownrightnow.com, providing status information and details about recent downtime events, acting as a specific type of '爬虫工具' for website status.

    A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that allows AI agents to perform WHOIS lookups, enabling users to directly ask the AI about domain availability, ownership, registration details, and other domain information.
    4
    10
    1
    JavaScript
    MIT License
    • Linux
    • Apple
  • Why this server?

    The Search MCP Server enables seamless integration of network and local search capabilities in tools like Claude Desktop and Cursor, utilizing the Brave Search API for high-concurrency and asynchronous requests, functioning as a '爬虫工具' for broad search.

    -
    security
    A
    license
    -
    quality
    The Search MCP Server enables seamless integration of network and local search capabilities in tools like Claude Desktop and Cursor, utilizing the Brave Search API for high-concurrency and asynchronous requests.
    1
    52
    Python
    MIT License
    • Linux