Why this server?
This server is an excellent fit as it explicitly mentions 'web searching and webpage scraping using pure crawler technology', 'batch webpage scraping', and 'content extraction', which aligns directly with the information gathering and harvesting capabilities of 'theHarvester'.
AsecurityAlicense-qualityEnables web searching and webpage scraping using pure crawler technology without requiring official APIs. Supports Bing web and news search, batch webpage scraping, and content extraction through Puppeteer automation.Last updated 8 months ago41MITWhy this server?
This server provides 'web scraping and crawling capabilities', supporting 'single-page scraping', 'multi-page website crawling', 'web search', and 'content extraction', making it highly relevant to the data collection function of 'theHarvester'.
-securityFlicense-qualityEnables web scraping and crawling capabilities for LLM clients, supporting single-page scraping, multi-page website crawling, and web search with multiple engines (Playwright, Cheerio, Puppeteer) and flexible output formats including markdown, HTML, text, and screenshots.Last updated 20 days ago95Why this server?
As a 'web scraping server that allows... to extract various types of data from websites', this server directly matches the core functionality of 'theHarvester' for data acquisition.
-securityFlicense-qualityA lightweight web scraping server that allows Claude Desktop users to extract various types of data from websites, including text, links, images, tables, headlines, and metadata using CSS selectors.Last updated 10 months ago4Why this server?
This server offers 'advanced web scraping' and 'smart content extraction', which are key aspects of 'theHarvester's' data harvesting processes.
-securityAlicense-qualityProvides advanced web scraping with HTTP client, smart content extraction to Markdown, browser automation via Playwright, screenshot/PDF generation, and Docker sandbox execution environments.Last updated 5 months ago1MITWhy this server?
Explicitly named 'Scraper MCP' and described as a 'context-optimized web scraping server', it directly correlates with the scraping and data gathering activities performed by 'theHarvester'.
-securityAlicense-qualityA context-optimized web scraping server that converts HTML to markdown/text and applies CSS selectors server-side, reducing token usage by 70-90% while providing AI tools with clean, filtered web content.Last updated 3 days ago6MITWhy this server?
This server is a 'web scraping server' offering 'content extraction rules' for both static and dynamic websites, closely matching the functionality of 'theHarvester'.
AsecurityAlicense-qualityA TypeScript-based web scraping server built on the Model Context Protocol that offers multiple export formats, content extraction rules, and support for both static and dynamic (SPA) websites.Last updated 7 months ago721MITWhy this server?
Described as a 'scraper tool' that fetches and processes web content with 'efficient content extraction', this server's capabilities are very similar to the data harvesting done by 'theHarvester'.

Oxylabs MCP Serverofficial
AsecurityAlicense-qualityA scraper tool that leverages the Oxylabs Web Scraper API to fetch and process web content with flexible options for parsing and rendering pages, enabling efficient content extraction from complex websites.Last updated 4 months ago291MITWhy this server?
While specific to QQ channels, its description highlights 'automated collection and downloading of multimedia content' and 'comprehensive media harvesting', making the term 'harvesting' directly relevant.
-securityFlicense-qualityEnables automated collection and downloading of multimedia content (images, GIFs, videos) from QQ channels. Features efficient video scraping, incremental updates, and intelligent fallback mechanisms for comprehensive media harvesting.Last updated 5 months agoWhy this server?
This server is a 'powerful tool for fetching and extracting text content from web pages and APIs', supporting 'web scraping', which is a primary method used by 'theHarvester'.
AsecurityAlicense-qualityA powerful tool for fetching and extracting text content from web pages and APIs, supporting web scraping, REST API requests, and Google Custom Search integration.Last updated 2 months ago510MIT