Developing a web scraper

Glama

Search for:

Developing a web scraper

View all MCP Servers

Why this server?
Enables fetching web content using the Node.js undici library, supporting various HTTP methods, content formats, and request configurations, which is essential for a web scraper.
MCP Node Fetch
mcollina
A
security
A
license
A
quality
An MCP server that enables fetching web content using the Node.js undici library, supporting various HTTP methods, content formats, and request configurations.
Last updated 5 months ago
3
419
11
TypeScript
MIT License
Why this server?
Provides unified access to multiple search engines and content processing services, useful for a comprehensive web scraping project.
mcp-omnisearch
spences10
A
security
A
license
A
quality
🔍 A Model Context Protocol (MCP) server providing unified access to multiple search engines (Tavily, Brave, Kagi), AI tools (Perplexity, FastGPT), and content processing services (Jina AI, Kagi). Combines search, AI responses, content processing, and enhancement features through a single interface.
Last updated 5 days ago
15
576
145
TypeScript
MIT License
Why this server?
A server that provides AgentQL's data extraction capabilities enabling AI agents to get structured data from unstructured web, which helps in extracting data for a web scraper.
AgentQL MCP Server
tinyfish-io
A
security
A
license
A
quality
A server that provides AgentQL's data extraction capabilities enabling AI agents to get structured data from unstructured web
Last updated 7 days ago
1
568
89
JavaScript
MIT License
Why this server?
Enables browser automation using Python scripts, offering operations like taking webpage screenshots, retrieving HTML content, and executing JavaScript, which can be part of a scraper.
Browser Use Server
ztobs
A
security
F
license
A
quality
Enables browser automation using Python scripts, offering operations like taking webpage screenshots, retrieving HTML content, and executing JavaScript.
Last updated 3 months ago
4
17
Python
Why this server?
Implementation of an MCP server for the RAG Web Browser Actor, which serves as a web browser for large language models (LLMs) and RAG pipelines, similar to a web search in ChatGPT. Useful for understanding how a browser is used in context.
mcp-server-rag-web-browserofficial
apify
A
security
A
license
A
quality
Implementation of an MCP server for the RAG Web Browser Actor. This Actor serves as a web browser for large language models (LLMs) and RAG pipelines, similar to a web search in ChatGPT.
Last updated 4 months ago
1
61
175
JavaScript
Apache 2.0
Why this server?
Enables retrieval and processing of web page content for LLMs by converting HTML to markdown, with support for content truncation and pagination, important for preparing web content.
Fetch MCP Server
ExactDoug
-
security
A
license
-
quality
Enables retrieval and processing of web page content for LLMs by converting HTML to markdown, with support for content truncation and pagination.
Last updated 5 months ago
1
1
Python
MIT License
Why this server?
A server that allows fetching Instagram posts using Chrome's existing login session, which may be helpful for specific web scraping tasks.
Instagram MCP Server
duhlink
A
security
F
license
A
quality
A server that allows fetching Instagram posts using Chrome's existing login session via Model Context Protocol (MCP).
Last updated 7 months ago
1
30
TypeScript
Why this server?
Provides functionality to fetch web content in various formats, including HTML, JSON, plain text, and Markdown, which are all needed for scraping data.
Fetch MCP Server
zcaceres
A
security
A
license
A
quality
Provides functionality to fetch web content in various formats, including HTML, JSON, plain text, and Markdown.
Last updated 3 months ago
4
120,112
502
TypeScript
MIT License
Why this server?
It crawls website
mcp-crawler
orange-fruit01
-
security
F
license
-
quality
It crawls website
Last updated 4 months ago
Python
Why this server?
A Python implementation of an MCP server that extracts webpage content, removes ads and non-essential elements, and transforms it into clean, LLM-optimized Markdown - essential for focused data extraction from scraped pages.
Mozilla Readability Parser MCP Server
jmh108
-
security
A
license
-
quality
A Python implementation of an MCP server that extracts webpage content, removes ads and non-essential elements, and transforms it into clean, LLM-optimized Markdown.
Last updated 7 months ago
2
Python
MIT License

Developing a web scraper

mcp-server-rag-web-browserofficial