Search for:

Methods to Parse a Web Page or Data

  • Why this server?

    Enables fetching web content using the Node.js undici library, supporting various HTTP methods and content formats, which is essential for parsing pages.

    -
    security
    A
    license
    -
    quality
    An MCP server that enables fetching web content using the Node.js undici library, supporting various HTTP methods, content formats, and request configurations.
    66
    8
    TypeScript
    MIT License
    • Apple
    • Linux
  • Why this server?

    Provides unified access to multiple search engines and content processing services, allowing parsing of the fetched content through its content processing capabilities.

    A
    security
    A
    license
    A
    quality
    🔍 A Model Context Protocol (MCP) server providing unified access to multiple search engines (Tavily, Brave, Kagi), AI tools (Perplexity, FastGPT), and content processing services (Jina AI, Kagi). Combines search, AI responses, content processing, and enhancement features through a single interface.
    15
    62
    24
    TypeScript
    MIT License
    • Linux
  • Why this server?

    Enables retrieval and processing of web page content for LLMs by converting HTML to markdown, which aids in parsing and understanding web page structures.

    -
    security
    A
    license
    -
    quality
    Enables retrieval and processing of web page content for LLMs by converting HTML to markdown, with support for content truncation and pagination.
    1
    1
    Python
    MIT License
  • Why this server?

    Provides functionality to fetch web content in various formats, including HTML, JSON, plain text, and Markdown, making it suitable for parsing diverse page structures.

    A
    security
    F
    license
    A
    quality
    Provides functionality to fetch web content in various formats, including HTML, JSON, plain text, and Markdown.
    4
    137,083
    150
    TypeScript
  • Why this server?

    A Model Context Protocol server that provides web content fetching and conversion capabilities, useful for parsing and processing page contents.

    A
    security
    A
    license
    A
    quality
    A Model Context Protocol server that provides web content fetching and conversion capabilities.
    4
    89
    2
    JavaScript
    MIT License
    • Apple
  • Why this server?

    A Python implementation that extracts webpage content, removes ads and non-essential elements, and transforms it into clean, LLM-optimized Markdown, facilitating easier parsing.

    -
    security
    A
    license
    -
    quality
    A Python implementation of an MCP server that extracts webpage content, removes ads and non-essential elements, and transforms it into clean, LLM-optimized Markdown.
    1
    Python
    MIT License
    • Linux
    • Apple
  • Why this server?

    Provides tools for reading and extracting text from PDF files, supporting both local files and URLs, which might be needed when dealing with PDFs that are technically considered pages.

    -
    security
    F
    license
    -
    quality
    Provides tools for reading and extracting text from PDF files, supporting both local files and URLs.
    3
    Python
  • Why this server?

    A Model Context Protocol server that enables interaction with markdown documentation files, providing capabilities for document management, metadata handling, search, and documentation health analysis.

    A
    security
    A
    license
    A
    quality
    A Model Context Protocol implementation that enables AI assistants to interact with markdown documentation files, providing capabilities for document management, metadata handling, search, and documentation health analysis.
    14
    346
    11
    TypeScript
    MIT License
    • Apple
    • Linux
  • Why this server?

    Uses PyPDF2 to extract pages and search PDFs, relevant when the pages are PDF files.

    A
    security
    A
    license
    A
    quality
    mcp using PyPDF2 to: • merge-pdfs • extract-pages • search-pdfs • merge-pdfs-ordered (merge in user spec. order) • find-related-pdfs (regex extracted text for related PDF files)
    5
    19
    Python
    The Unlicense
  • Why this server?

    Provides comprehensive document processing, including reading, converting, and manipulating various document formats with advanced text and HTML processing capabilities, which helps in parsing various documents.

    A
    security
    A
    license
    A
    quality
    Provides comprehensive document processing, including reading, converting, and manipulating various document formats with advanced text and HTML processing capabilities.
    16
    21
    5
    TypeScript
    MIT License