zenrows-mcp

Official

Overview Schema Related Servers Score Discussions

scrape

Read-only

Extract webpage content for analysis by fetching clean markdown, HTML, or structured JSON. Handles JavaScript-rendered pages and bypasses anti-bot protection.

Instructions

Scrape any webpage and return its content using ZenRows.

Use this tool to fetch webpage content for analysis. By default it returns clean markdown, which is ideal for LLM processing.

When to enable options:

js_render: page uses React/Vue/Angular, loads content dynamically, or content appears missing on the first attempt
premium_proxy: site returns 403/blocked errors even with js_render enabled
wait_for: specific content loads after initial render (requires js_render)
css_extractor: you only need specific elements, not the whole page
autoparse: structured data pages like products or articles

Examples: Basic: { url: "https://example.com" } Dynamic: { url: "https://spa.com", js_render: true } Protected:{ url: "https://protected.com", js_render: true, premium_proxy: true } Extract: { url: "https://shop.com", css_extractor: '{"title":"h1","price":".price"}' }

Input Schema

TableJSON Schema

Name	Required	Description	Default
`url`	Yes	The webpage URL to scrape
`js_render`	No	Enable JavaScript rendering via headless browser. Required for SPAs (React, Vue, Angular) and pages that load content dynamically.
`premium_proxy`	No	Use premium residential proxies to bypass anti-bot protection. Required for heavily protected sites. Implies higher credit cost.
`proxy_country`	No	Country for geo-targeted scraping. ISO 3166-1 alpha-2 code (e.g. 'US', 'GB', 'DE'). Requires premium_proxy=true.
`response_type`	No	Output format. 'markdown' (default) preserves structure and is ideal for LLMs. 'plaintext' strips all formatting for pure text extraction. 'pdf' returns a PDF of the page. 'html' returns the raw HTML source (omits the response_type param; ZenRows default). Ignored when autoparse, css_extractor, outputs, or screenshot params are set.	markdown
`autoparse`	No	Automatically extract structured data from the page into JSON. Best for product pages, articles, and listings.
`css_extractor`	No	Extract specific elements using CSS selectors. JSON object mapping names to selectors, e.g. '{"title":"h1","price":".price-tag"}'. Returns JSON instead of full page content.
`wait_for`	No	CSS selector to wait for before capturing. Use when key content loads after the initial page render. Requires js_render=true.
`wait`	No	Milliseconds to wait after page load before capturing content. Max 30000 (30s). Requires js_render=true.
`js_instructions`	No	JSON array of browser interactions to run before scraping. Requires js_render=true. Example: [{"click":"#load-more"},{"wait":1000},{"wait_for":".results"}]
`outputs`	No	Comma-separated list of data types to extract as structured JSON. Available: emails, headings, links, menus, images, videos, audios. Use '*' for all types. Returns JSON instead of full page content.
`screenshot`	No	Capture an above-the-fold screenshot of the page. Returns an image instead of text content. Useful for visual verification or debugging.
`screenshot_fullpage`	No	Capture a full-page screenshot including content below the fold. Returns an image instead of text content.
`screenshot_selector`	No	Capture a screenshot of a specific element using a CSS selector. Example: ".product-card". Returns an image instead of text content.

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint=true and destructiveHint=false. Description adds valuable behavioral context: default markdown output ideal for LLMs, and crucially explains that certain parameters (css_extractor, autoparse, outputs, screenshot) change the return type from text to JSON or images. This output-switching behavior is not captured in annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Well-structured with clear information hierarchy: purpose statement, default behavior, conditional options guide, and examples. Every section earns its place. Examples section is slightly verbose but appropriate for a 14-parameter tool where syntax matters. Good use of formatting (bullet points, code blocks).

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a complex tool with 14 parameters and no output schema, description adequately explains return value variations (markdown default vs JSON vs images depending on params). Covers the ZenRows-specific options (premium_proxy credit cost mentioned in schema, wait_for interactions explained). Could mention error handling or rate limits, but sufficient for invocation.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, establishing baseline 3. Description adds significant value via the 'When to enable options' section which provides contextual semantics for when to use parameters (e.g., 'page uses React/Vue/Angular' triggers js_render). The concrete examples demonstrate parameter interactions and valid value formats (e.g., CSS selector JSON syntax).

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Opens with specific verb+resource ('Scrape any webpage') and identifies the underlying service ('using ZenRows'). Clearly states default output format ('clean markdown') and primary use case ('fetch webpage content for analysis'). No siblings to differentiate from, but scope is precisely defined.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Contains explicit 'When to enable options' section that maps specific technical conditions (React/Vue/Angular, 403 errors, delayed content loading) to parameter usage. Provides concrete decision trees for selecting js_render, premium_proxy, and other options. Includes practical JSON examples showing parameter combinations.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

scrapeA

Latest Blog Posts

Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
open source
OpenAI
Tool Definition Quality Score (TDQS)
By punkpeye on April 3, 2026.
mcp
The Hackers Who Tracked My Sleep Cycle
By punkpeye on March 26, 2026.
security

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/ZenRows/zenrows-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server