Search for:

Scraping Public Documents

  • Why this server?

    This server enables LLMs to retrieve and process content from web pages, converting HTML to markdown for easier consumption, which is useful for scraping public documents.

    A
    security
    A
    license
    A
    quality
    This server enables LLMs to retrieve and process content from web pages, converting HTML to markdown for easier consumption.
    1
    37,968
    JavaScript
    MIT License
  • Why this server?

    A scraper tool that leverages the Oxylabs Web Scraper API to fetch and process web content with flexible options for parsing and rendering pages, enabling efficient content extraction from complex websites, helpful for accessing various public documents.

    A
    security
    A
    license
    A
    quality
    A scraper tool that leverages the Oxylabs Web Scraper API to fetch and process web content with flexible options for parsing and rendering pages, enabling efficient content extraction from complex websites.
    2
    14
    Python
    MIT License
    • Apple
    • Linux
  • Why this server?

    A Model Context Protocol server that enables LLMs to fetch and process web content in multiple formats (HTML, JSON, Markdown, text) with automatic format detection, which can be used to retrieve and scrape data from different public sources.

    -
    security
    F
    license
    -
    quality
    A Model Context Protocol server that enables LLMs to fetch and process web content in multiple formats (HTML, JSON, Markdown, text) with automatic format detection.
    TypeScript
    • Apple
  • Why this server?

    A Model Context Protocol server enabling LLMs to search, retrieve, and manage documents through Rememberizer's knowledge management API; good for managing documents extracted.

    -
    security
    A
    license
    -
    quality
    A Model Context Protocol server enabling LLMs to search, retrieve, and manage documents through Rememberizer's knowledge management API.
    19
    Python
    Apache 2.0
  • Why this server?

    Provides RAG capabilities for semantic document search using Qdrant vector database and Ollama/OpenAI embeddings, allowing users to add, search, list, and delete documentation with metadata support.

    -
    security
    A
    license
    -
    quality
    Provides RAG capabilities for semantic document search using Qdrant vector database and Ollama/OpenAI embeddings, allowing users to add, search, list, and delete documentation with metadata support.
    5
    4
    TypeScript
    Apache 2.0
  • Why this server?

    A server that enables AI assistants like Claude to perform web searches using the Exa AI Search API, providing real-time web information in a safe and controlled way, useful for finding publicly available documents.

    -
    security
    A
    license
    -
    quality
    A server that enables AI assistants like Claude to perform web searches using the Exa AI Search API, providing real-time web information in a safe and controlled way.
    1,858
    MIT License
    • Apple
  • Why this server?

    An official MCP server implementation that allows AI assistants to capture website screenshots through the ScreenshotOne API, enabling visual context from web pages during conversations.

    A
    security
    A
    license
    A
    quality
    An official MCP server implementation that allows AI assistants to capture website screenshots through the ScreenshotOne API, enabling visual context from web pages during conversations.
    1
    6
    TypeScript
    MIT License
    • Apple
  • Why this server?

    A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.

    A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that enables LLMs to interact with web pages through structured accessibility snapshots without requiring vision models or screenshots.
    21
    31,050
    8,259
    TypeScript
    Apache 2.0
    • Linux
    • Apple
  • Why this server?

    Go server implementing Model Context Protocol (MCP) for filesystem operations; may assist in accessing local documents.

  • Why this server?

    A server that provides AgentQL's data extraction capabilities enabling AI agents to get structured data from unstructured web, allowing structured extraction from websites.

    A
    security
    A
    license
    A
    quality
    A server that provides AgentQL's data extraction capabilities enabling AI agents to get structured data from unstructured web
    1
    183
    28
    JavaScript
    MIT License
    • Apple
    • Linux