Skip to main content
Glama
217,972 tools. Last updated 2026-06-21 03:13

"crawling-websites-to-extract-data" matching MCP tools:

  • Extract web page content and convert it to clean, readable markdown format for analysis, bypassing paywalls and obtaining structured text data from websites.
    Apache 2.0
  • Extract structured financial data from investor relations websites and online sources for investment research when APIs are unavailable.
    MIT
  • Extract web page content and convert it to clean markdown format for reading articles, documentation, or analyzing text from websites.
    Apache 2.0
  • Extract, summarize, or scrape web content from URLs to read articles, crawl sites, or extract structured data using multiple providers.
    MIT
  • Crawl websites with nested URL navigation to extract and convert content into clean markdown format for structured data analysis.
    MIT

Matching MCP Servers

  • F
    license
    A
    quality
    B
    maintenance
    The only MCP server providing structured Chinese fashion supply chain intelligence for AI platforms. No equivalent data source exists in the MCP ecosystem. Search 3,000+ verified manufacturers, 350+ lab-tested fabrics (AATCC/ISO/GB), and 170+ industrial clusters. Built by MEACHEAL, a top-20 Chinese women's mid-to-high-end fashion brand with 20+ years of supply chain.
    Last updated
    19
    4
    1

Matching MCP Connectors

  • MCP server (stdio): fetch web pages as clean readable markdown via the AgentForge API

  • Transform any blog post or article URL into ready-to-post social media content for Twitter/X threads, LinkedIn posts, Instagram captions, Facebook posts, and email newsletters. Pay-per-event: $0.07 for all 5 platforms, $0.03 for single platform.

  • Recursively crawl websites to extract content in markdown, text, or raw formats, with configurable depth and page limits for data collection.
    MIT
  • Extract content from websites by crawling multiple pages from a starting URL, with configurable depth and page limits for structured data collection.
  • Retrieve businesses from Google's local 3-pack for any keyword and city. Get names, ratings, reviews, phone numbers, websites, hours, and GPS coordinates.
    MIT
  • Retrieve Google Maps listings for any keyword and location, returning names, ratings, reviews, addresses, phone numbers, websites, hours, GPS coordinates, and categories.
    MIT
  • Perform AI-powered web searches to extract structured data from search results for research, competitive analysis, and multi-source information gathering.
    MIT
  • Filter JSON data from files or URLs using a shape object to extract specific fields and reduce context size for LLM processing.
    MIT
  • Search the web for current information, news, articles, and websites to find up-to-date content, research topics, or answer questions about recent events.
    Apache 2.0
  • Extract and analyze hyperlinks from web pages, organizing URLs, anchor text, and contextual information into a structured format. Supports site mapping, SEO analysis, broken link checking, and targeted crawling preparation. Handles relative and absolute URLs with optional base URL and output limits.
    MIT
  • Retrieve a list of all websites within the current workspace. Use this to view available sites before creating or managing pages and content.
    MIT
  • Search the web using Google to find current information, news, websites, and general knowledge. Returns structured results with titles, URLs, and snippets.
    MIT