Skip to main content
Glama
133,413 tools. Last updated 2026-05-25 14:19

"Crawling Websites to Extract Data" matching MCP tools:

  • Extract web page content and convert it to clean, readable markdown format for analysis, bypassing paywalls and obtaining structured text data from websites.
    Apache 2.0
  • Extract structured financial data from investor relations websites and online sources for investment research when APIs are unavailable.
    MIT
  • Extract web page content and convert it to clean markdown format for reading articles, documentation, or analyzing text from websites.
    Apache 2.0
  • Crawl websites with nested URL navigation to extract and convert content into clean markdown format for structured data analysis.
    MIT
  • Recursively crawl websites to extract content in markdown, text, or raw formats, with configurable depth and page limits for data collection.
    MIT

Matching MCP Servers

  • A
    license
    A
    quality
    C
    maintenance
    The only MCP server providing structured Chinese fashion supply chain intelligence for AI platforms. No equivalent data source exists in the MCP ecosystem. Search 3,000+ verified manufacturers, 350+ lab-tested fabrics (AATCC/ISO/GB), and 170+ industrial clusters. Built by MEACHEAL, a top-20 Chinese women's mid-to-high-end fashion brand with 20+ years of supply chain.
    Last updated
    19
    4
    1
    Unlicense - libtelnet variant

Matching MCP Connectors

  • Transform any blog post or article URL into ready-to-post social media content for Twitter/X threads, LinkedIn posts, Instagram captions, Facebook posts, and email newsletters. Pay-per-event: $0.07 for all 5 platforms, $0.03 for single platform.

  • Read-only PostgreSQL, MySQL, SQL Server access via MCP — 24 dialect-aware hosted tools.

  • Extract content from websites by crawling multiple pages from a starting URL, with configurable depth and page limits for structured data collection.
  • Retrieve businesses from Google's local 3-pack for any keyword and city. Get names, ratings, reviews, phone numbers, websites, hours, and GPS coordinates.
    MIT
  • Retrieve Google Maps listings for any keyword and location, returning names, ratings, reviews, addresses, phone numbers, websites, hours, GPS coordinates, and categories.
    MIT
  • Extract structured data from web pages by supplying URLs and a prompt, with options for web search and retries.
    MIT
  • Automatically discover and map a web application's structure by creating an analysis suite and dispatching an AI crawling agent in one step.
    MIT
  • Perform AI-powered web searches to extract structured data from search results for research, competitive analysis, and multi-source information gathering.
    MIT
  • Execute multi-step web scraping workflows with AI automation, navigating websites, interacting with forms, and extracting structured data for complex scenarios requiring user simulation.
    MIT
  • Filter JSON data from files or URLs using a shape object to extract specific fields and reduce context size for LLM processing.
    MIT
  • Search the web for current information, news, articles, and websites to find up-to-date content, research topics, or answer questions about recent events.
    Apache 2.0
  • Retrieve a list of all websites within the current workspace. Use this to view available sites before creating or managing pages and content.
    MIT
  • Extract text content from academic papers using CrossRef DOIs. Note: CrossRef provides citation data; access full papers through publisher websites.
    MIT
  • Extract data from Excel sheets and convert to structured JSON with headers and typed rows. Specify sheet index and header presence for precise data extraction.
    MIT