novada-mcp
OfficialExtract structured product data (title, price, rating, reviews, BSR, availability) from Amazon product pages via novada_scrape.
Web search engine used by novada_search tool for performing searches.
Extract repository data (stars, forks, issues, description, languages) from GitHub via novada_scrape.
Extract reviews, ratings, and salary data from Glassdoor company pages via novada_scrape.
Web search engine used by novada_search tool for performing searches.
Extract profile data (posts, followers, engagement) from Instagram via novada_scrape.
AI brand monitoring tool to check how Perplexity mentions your brand via novada_ai_monitor.
Extract posts, scores, comments, and timestamps from Reddit subreddits via novada_scrape.
Extract video stats, engagement, and profile data from TikTok via novada_scrape.
Supported as a hard target for scraping with automatic anti-bot bypass via novada_scrape.
Extract video titles, views, duration, and channel data from YouTube via novada_scrape.
Extract property data (price, beds, baths, sqft, Zestimate) from Zillow via novada_scrape.
novada
One MCP server. All web data. Search, scrape, crawl, proxy, and AI research — in a single
npxcommand.
The Problem
AI agents need web data but the tools are fragmented:
Tavily does search but can't scrape or proxy
Firecrawl does scrape but can't search or proxy
BrightData does everything but ships 69 tools that bloat your context window
Building it yourself means maintaining proxies, anti-bot bypass, retry logic, and 10 different APIs
Related MCP server: mcp-server-scraper
The Fix
npx novada-mcpOne server. One API key. Tools that cover every web data need an AI agent has:
Need | Tool | What it does |
Find information |
| Web search across Google, Bing, DuckDuckGo, Yandex, Yahoo |
Read a page |
| Any URL → clean markdown, batch up to 10 in parallel |
Deep research |
| One call → parallel searches → dedup → cited multi-source report |
Crawl a site |
| BFS/DFS up to 20 pages with regex path filtering |
Discover URLs |
| Sitemap + BFS discovery without reading content |
Platform data |
| Amazon, LinkedIn, TikTok, GitHub, Zillow — 129 platforms |
Monitor changes |
| Track price/content/availability changes between checks |
Verify claims |
| Parallel fact-checking against live web sources |
Raw HTML |
| JS render or full browser CDP for bot-protected pages |
Browser automation |
| Navigate, click, type, fill forms, screenshot in cloud browser |
Browser flows |
| Multi-step browser automation sequences |
Proxy credentials |
| Residential, mobile, ISP, datacenter, static, dedicated — 195 countries |
AI brand monitoring |
| Check how ChatGPT, Perplexity, Grok, Claude, Gemini mention your brand |
Health check |
| Check which API products are active on your key |
Async scraping |
| Submit async scraping task → poll → retrieve results |
What Makes This Different
novada_research is unique. No other MCP server turns one question into a cited multi-source report. It searches across Google, Bing, and DuckDuckGo in parallel, deduplicates, extracts full content from the top 5 sources, and synthesizes with citations. One tool call replaces an entire research workflow. Depth options: quick (3 queries), deep (5-6), comprehensive (8-10).
Auto-escalation handles anti-bot automatically. Static fetch → JS render → Browser CDP. Known hard targets (Amazon, LinkedIn, G2, Zillow, Glassdoor, Walmart, Instagram, TikTok, Shein) skip straight to the right method based on a 30+ domain registry. You never think about Cloudflare, DataDome, Kasada, or PerimeterX — the tool handles it.
Agent-first design (8.5/10 benchmark score). Every response includes agent_instruction with structured next-step guidance, source field (live/cache/wayback), structured errors with failure_class, cross-tool hints suggesting better alternatives, and a ## Agent Action block with machine-parseable status codes.
Quick Start
Get a key at novada.com
Add to your MCP client:
Claude Code:
claude mcp add novada -e NOVADA_API_KEY=your_key -- npx -y novada-mcpClaude Desktop / Cursor / VS Code / Windsurf:
{
"mcpServers": {
"novada": {
"command": "npx",
"args": ["-y", "novada-mcp"],
"env": { "NOVADA_API_KEY": "your_key" }
}
}
}Try it:
novada_search({query: "Claude MCP tutorials", num: 5})
novada_research({question: "How do MCP servers work?", depth: "deep"})
novada_extract({url: "https://news.ycombinator.com", format: "markdown"})
novada_monitor({url: "https://amazon.com/dp/B09...", fields: ["price", "availability"]})Tool Reference
Search & Research
Tool | Purpose | Key Params | Example |
| Web search via 5 engines |
|
|
| Multi-source parallel research |
|
|
| Fact-check claims against web |
|
|
Extract & Crawl
Tool | Purpose | Key Params | Example |
| Extract content from URL(s) |
|
|
| Crawl multiple pages from a domain |
|
|
| Discover URLs on a site |
|
|
| Detect page changes over time |
|
|
Structured Platform Data
novada_scrape supports 129 platforms with structured data extraction. Returns clean tabular records, not raw HTML.
Platform | Operation Examples | Data Returned |
Amazon |
| Title, price, rating, reviews, BSR, availability |
| Company info, employee count, profile data | |
TikTok |
| Video stats, engagement, profile data |
GitHub |
| Stars, forks, issues, description, languages |
| Posts, scores, comments, timestamps | |
Zillow |
| Price, beds, baths, sqft, Zestimate |
Glassdoor |
| Reviews, ratings, salary data |
YouTube |
| Video titles, views, duration, channel |
| Posts, followers, engagement | |
Google Shopping |
| Products, prices, merchants |
Full platform list: call novada_discover or read the novada://scraper-platforms MCP resource.
Proxy Network
Route your own HTTP requests through Novada's proxy infrastructure. 100M+ IPs across 195 countries.
Tool | Proxy Type | Best For |
| Real home ISP IPs | Anti-bot bypass, geo-restricted content |
| ISP-assigned IPs | Social media, ecommerce platforms |
| Datacenter IPs | High-volume, non-protected targets |
| 4G/5G mobile IPs | Mobile-targeted content, app APIs |
| Dedicated static ISP IP | Account management, login flows |
| Exclusive datacenter IP | High-trust platforms, clean reputation |
Each proxy tool returns connection credentials in url, env, or curl format. Params: country (ISO 2-letter), city (optional), session_id (for sticky sessions).
Browser Automation
Tool | Purpose | Example |
| Full browser interaction via CDP |
|
| Multi-step automation sequences | Click, scroll, wait, type, screenshot — up to 20 actions per call |
| Raw rendered HTML from protected pages |
|
Sessions persist across calls via session_id. Cookies, login state, and page context are maintained.
Use Cases
AI Agent Research & RAG Pipelines
novada_research({question: "What are the latest developments in quantum computing?", depth: "comprehensive"})Returns a cited multi-source report. Feed directly into RAG vector stores or use as context for agent reasoning.
E-Commerce Price Monitoring
novada_monitor({url: "https://amazon.com/dp/B0XXXXXX", fields: ["price", "availability"]})First call records baseline. Call again later — returns field-level diffs with percentage change (e.g., price: $999 → $899, ↓10%).
Competitive Intelligence
novada_scrape({platform: "amazon.com", operation: "amazon_product_keywords", params: {keyword: "wireless earbuds"}, limit: 20})Get structured product data (price, rating, reviews, BSR) for competitive analysis across 129 platforms.
Lead Generation
novada_scrape({platform: "linkedin.com", operation: "linkedin_company_information_url", params: {url: "https://linkedin.com/company/..."}, limit: 1})Extract company info, employee count, and industry data from LinkedIn company pages.
Content Extraction for LLM Training
novada_crawl({url: "https://docs.example.com", max_pages: 20, select_paths: "/docs/.*"})Crawl documentation sites and extract clean markdown for fine-tuning datasets or knowledge bases.
AI Brand Monitoring
novada_ai_monitor({brand: "YourProduct", models: ["chatgpt", "perplexity", "claude"]})Check how AI models reference your brand: sentiment, claims, competitor mentions, source URLs.
Geo-Targeted Data Collection
novada_proxy_residential({country: "DE", city: "berlin", format: "curl"})Get proxy credentials for any of 195 countries. Use with your own HTTP client for geo-specific content access.
Honest Comparison
Novada | Firecrawl | Tavily | BrightData | |
Tools | 25 | 14 | 2 | 69 |
Search engines | 5 | 0 | 1 | 3 |
Multi-source research | Yes | No | No | No |
Proxy as MCP tool | Yes | No | No | No |
Auto anti-bot escalation | Yes | No | N/A | No |
Change monitoring | Yes | No | No | No |
Platform scraping | 129 platforms | No | No | 437 platforms |
Browser automation | Yes (CDP) | No | No | Yes |
MCP Prompts & Resources | Yes (5+4) | No | No | No |
Hosted MCP (no install) | No | No | No | Yes |
Agent-first score | 8.5/10 | 6.0 | 6.0 | N/A |
What we don't have yet: hosted HTTP endpoint (requires terminal install for now), and some Scraper API platforms need separate activation. BrightData has more structured scrapers (437 vs 129).
Anti-Bot Support
Novada automatically handles these anti-bot systems via its escalation chain:
Anti-Bot System | Detection | Escalation Method |
Cloudflare |
| Auto-render via Web Unblocker |
DataDome |
| Auto-render |
Kasada | Script path detection | Browser CDP |
PerimeterX |
| Auto-render |
Akamai |
| Auto-render |
Imperva/Incapsula |
| Auto-render |
30+ domains are pre-tagged in the hard target registry — these skip static fetch entirely and go straight to the right method.
Configuration
Variable | Required | Purpose |
| Yes | API key — covers search, extract, crawl, scrape, research, verify, monitor |
| No | Browser API WebSocket URL for |
| No | Proxy username for |
| No | Proxy password |
| No | Proxy host:port endpoint |
| No | Separate key for Web Unblocker (if different from main API key) |
| No | Load specific tools only: |
| No | Load tool groups: |
Links
Docs + API key: novada.com
GitHub: github.com/NovadaLabs/novada-mcp
Tool details: call
novada_discoverornovada_healthfrom any MCP client
License
MIT
Maintenance
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/NovadaLabs/novada-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server