Wafle-Scraper
Fetches public posts from a subreddit via the official Reddit API.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Wafle-Scraperscrape the front page of Hacker News"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Wafle-Scraper — Universal MCP Server for Safe Web Scraping
Wafle-Scraper is an MCP server that lets AI agents (OpenCode, Claude, Cursor) extract data from the web safely and responsibly.
Safety First
Rule | Enforcement |
No private data | Every browser session is incognito — no cookies, no localStorage, no saved passwords |
Only what you ask | The scraper never mines extra data beyond your explicit request |
No localhost | Internal/private IPs are blocked by default |
Rate limited | Minimum 2 seconds between requests — never floods servers |
CAPTCHA = human only | No automated CAPTCHA solving. If one appears, you solve it interactively |
User-Agent rotation | Each request looks like a real browser |
Related MCP server: Stealth Browser MCP
Quick Start
pip install wafle-scraper
playwright install chromium
wafle-scraperConfiguration
OpenCode / Claude Desktop / Cursor
{
"mcpServers": {
"wafle-scraper": {
"command": "wafle-scraper",
"description": "Web scraping & browser automation — incognito, audited, safe"
}
}
}CLI Options
wafle-scraper # MCP stdio mode (default for agents)
wafle-scraper --http --port 8000 # HTTP SSE mode
wafle-scraper --version # Show versionMCP Tools
Tool | Description | Safety |
| Extract text from a static URL (requests + BeautifulSoup) | ✅ Read-only, no JS |
| Navigate a page in isolated incognito browser and extract text | ✅ Incognito, no cookies |
| Fetch public posts from a subreddit via official API | ✅ API, no scraping |
| Find email addresses on a public page | ✅ Only what you ask |
| Click, type, scroll, extract, screenshot in live browser | ✅ You control the actions |
Browser Backend (Playwright)
Incognito always:
storage_state=None, fresh context per sessionNo permissions: No camera, mic, location access
Anti-detection: Rotating UA, viewport, locale, timezone
Natural delays: Human-like timing between actions
Gradual scroll: Loads lazy content naturally
CAPTCHA Handling
Wafle-Scraper does NOT solve CAPTCHAs automatically. When a CAPTCHA is detected:
The scraper pauses
Prompts you to open the URL in your browser
You solve the CAPTCHA manually
Type
doneand the scraper continues
This is the only ethical and reliable approach without paid services.
Security
Blocked:
localhost,127.0.0.1, private IPs,file://,chrome://Rate limiting (configurable, default 2s min interval)
Scope enforcement — only processes what you explicitly request
User-Agent rotation
Browser isolation — Playwright contexts are fully sandboxed
Requirements
Python 3.10+
Playwright with Chromium installed (
playwright install chromium)Windows, macOS, Linux
Installation from Source
git clone https://github.com/creandoaldia/wafle-scraper.git
cd wafle-scraper
pip install -e .
playwright install chromiumLicense
MIT
Why "Wafle-Scraper"?
Part of the WAFLE ecosystem (Web AI Framework for Language Ecosystems). Wafle-Scraper gives WAFLE agents the ability to read the live web — safely, transparently, and under your control.
This server cannot be installed
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/creandoaldia/wafle-scraper'
If you have feedback or need assistance with the MCP directory API, please join our Discord server