AgentScrape
Server Details
Pay-per-call web scraping for AI agents via x402 on Base USDC. Six tools, no signup.
- Status
- Healthy
- Last Tested
- Transport
- Streamable HTTP
- URL
- Repository
- hshintelligence/agent-scrape
- GitHub Stars
- 1
- Server Listing
- AgentScrape
Glama MCP Gateway
Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.
Full call logging
Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.
Tool access control
Enable or disable individual tools per connector, so you decide what your agents can and cannot do.
Managed credentials
Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.
Usage analytics
See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.
Tool Definition Quality
Average 3.4/5 across 6 of 6 tools scored.
Each tool has a clearly distinct purpose: session creation, metadata extraction, AI-driven data extraction, multi-step workflows, content scraping, and screenshots. No overlapping functionality.
All tool names follow a consistent verb_noun pattern in snake_case (e.g., create_browser_session, extract_structured_data), making them predictable and easy to understand.
With 6 tools, the server is well-scoped for web scraping and browser automation tasks. Each tool serves a distinct need without being overwhelming or insufficient.
The tool set covers major web scraping operations: session management, content extraction, metadata, screenshots, and workflows. A minor gap is the lack of an explicit tool to close a browser session, but sessions are likely managed internally.
Available Tools
6 toolscreate_browser_sessionAInspect
Create a stateful browser session that persists cookies and localStorage across multiple scrape/workflow calls.
| Name | Required | Description | Default |
|---|---|---|---|
| ttl_seconds | No | Session TTL (default 1800, max 7200) |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Describes statefulness and persistence, but lacks details on lifecycle, expiration, or side effects. No annotations beyond title; bar is moderate.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Single sentence, no fluff, front-loaded with action and purpose.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Sufficient for a simple tool with one optional parameter and no output schema. Could mention return value (session ID) but not required.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema covers 100% of parameters with adequate descriptions. The description does not add meaningful meaning beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Clearly states the tool creates a stateful browser session for persisting cookies and localStorage. Distinguishes from sibling tools which are for extraction, scraping, screenshots, and workflows.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Implies usage across multiple scrape/workflow calls, but does not provide explicit when-to-use or when-not-to-use guidance, nor alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
extract_metadataBInspect
Extract page metadata: title, description, Open Graph, Twitter cards, JSON-LD, canonical URL, and all meta tags.
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description adds value by enumerating what metadata is extracted (Open Graph, Twitter cards, JSON-LD, etc.). However, it does not disclose behavioral traits such as authentication needs, rate limits, error handling for invalid URLs, or any side effects (which are minimal). Annotations provide no readOnlyHint or destructiveHint, so the description carries the full burden and only partially fulfills it.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single sentence, front-loaded with the action, and lists the extracted items efficiently. No redundancy or unnecessary words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description lists what metadata is extracted but omits output format (e.g., JSON object structure) and any return value details. Given no output schema, the description should partially cover this. It is adequate but not complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The single parameter 'url' has no description in the schema (0% coverage). The tool description does not add any semantics about the URL format, restrictions, or required prefixes. It fails to compensate for the low schema coverage.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the verb 'extract' and the resource 'page metadata', listing specific metadata types (title, description, Open Graph, etc.). This distinguishes it from sibling tools like `extract_structured_data` and `scrape_webpage` which have different scopes.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance on when to use this tool versus alternatives. No context about prerequisites, limitations, or when to avoid using it. For example, it does not mention that this tool only extracts metadata and not full page content, which could be inferred but not explicitly stated.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
extract_structured_dataAInspect
AI-powered structured data extraction from any webpage using natural language. Returns JSON matching your prompt or schema.
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | The URL to extract from | |
| prompt | Yes | Natural language description of what to extract | |
| schema | No | Optional JSON schema for the response | |
| wait_ms | No | ||
| wait_for | No |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations provide no safety hints. The description lacks disclosure of potential issues like rate limits, costs, latency, or accuracy of AI extraction. It does not state whether the tool modifies the page or is read-only.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Two sentences, no wasted words. Every sentence adds value: one explains what it does, the other the output format.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Covers core functionality but lacks details on wait parameters, error handling, limitations (e.g., dynamic content), and output guarantees. Adequate but not comprehensive given 5 parameters and no output schema.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 60%; url, prompt, and schema have descriptions. The description adds context that prompt and schema are complementary but does not explain wait_ms or wait_for, leaving gaps.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool extracts structured data from webpages using AI and natural language, returning JSON. It distinguishes from siblings like scrape_webpage (unstructured) and extract_metadata (metadata-focused).
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies use for structured extraction via natural language or schema, but does not explicitly state when to avoid it or compare to alternative tools like scrape_webpage or screenshot_webpage.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
run_workflowBInspect
Execute a multi-step browser workflow atomically: navigate, click, type, wait, scroll, screenshot, extract, evaluate. Up to 20 steps.
| Name | Required | Description | Default |
|---|---|---|---|
| steps | Yes | Ordered list of workflow steps to execute | |
| viewport | No | ||
| session_id | No | ||
| persist_session | No |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations (no readOnly/destructive hints), description carries full burden. It mentions 'atomically' and 'Up to 20 steps' but fails to disclose failure behavior, session dependencies, or side effects like page state changes. Significant gaps in behavioral disclosure.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Single sentence, front-loaded with verb and resource. Concise but not overly brief; every part adds information. No wasted words.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given moderate complexity (4 params, no output schema, nested objects in steps), the description is inadequate. It does not explain return behavior, session requirements, or viewport effects. The user is left without enough context to use the tool correctly.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is only 25% (only 'steps' has a description). The tool description adds value by listing possible actions for steps, but does not explain other parameters like 'viewport', 'session_id', or 'persist_session'. A baseline 3 is appropriate as the description partially compensates for low coverage.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
Description clearly states 'Execute a multi-step browser workflow atomically' and lists specific actions (navigate, click, etc.), distinguishing from siblings that are single-action tools like screenshot_webpage or extract_structured_data.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Description lacks explicit guidance on when to use this tool vs single-action alternatives. 'Atomically' implies one-shot execution but does not mention that for individual actions, other tools should be used. No 'when not to use' or alternative references.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
scrape_webpageBInspect
Scrape any webpage and return content as markdown, html, text, or json. Pay-per-call web scraping for AI agents.
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | The URL to scrape (http or https) | |
| format | No | Output format (default: markdown) | |
| wait_ms | No | Milliseconds to wait after page load (max 10000) | |
| viewport | No | Viewport size (default: desktop) | |
| wait_for | No | CSS selector to wait for before extracting |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations beyond title, the description carries full burden. It discloses 'pay-per-call' but omits important behavioral traits such as rate limits, authentication requirements, handling of JavaScript-heavy pages, or any destructive potential. Minimal disclosure.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is two concise sentences with no wasted words. Front-loads the main action and output, and adds context (pay-per-call) efficiently.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
For a tool with 5 parameters and no output schema, the description is incomplete. It does not explain return structure for JSON, error handling, or how parameters like wait_for or viewport affect scraping. More context is needed for an agent to use it effectively.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is 100%, so the schema already documents all parameters. The description adds little beyond restating output formats. It does not provide deeper meaning or usage tips for parameters like wait_ms or viewport.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the action ('scrape') and resource ('any webpage'), and specifies output formats (markdown, html, text, json). It distinguishes itself from sibling tools like screenshot_webpage or extract_metadata, as it focuses on returning content.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description mentions 'pay-per-call' but does not provide explicit when-to-use or when-not-to-use guidance relative to alternatives like extract_structured_data or extract_metadata. The usage context is implied but not elaborated.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
screenshot_webpageBInspect
Capture a PNG screenshot of any webpage. Supports desktop, mobile, and tablet viewports, plus full-page mode.
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | ||
| wait_ms | No | ||
| viewport | No | ||
| wait_for | No | ||
| full_page | No | Capture full scrollable page (default: false) |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations provide no behavioral hints (readOnlyHint, destructiveHint). The description does not disclose whether the tool requires an existing browser session, or any side effects like network requests. For a read-like operation, stating it is non-destructive would help.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
A single, front-loaded sentence that efficiently conveys the core functionality and supported features. No superfluous text.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
With 5 parameters, no output schema, and low schema coverage, the description is too brief. It lacks details on return format (e.g., base64 PNG), dependency on create_browser_session, and behavior of timeouts or errors.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema description coverage is low (20%, only full_page documented). The description adds value by explaining viewport options and full-page mode, but does not clarify wait_ms or wait_for parameters. It only partially compensates for the schema gaps.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool captures a PNG screenshot of a webpage, with specific support for viewports and full-page mode. It distinguishes well from sibling tools like scrape_webpage (text extraction) and create_browser_session (session management).
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage for visual screenshots but lacks explicit guidance on when to use versus alternatives (e.g., scrape_webpage for text, run_workflow for automation). No exclusions or prerequisites mentioned.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
Claim this connector by publishing a /.well-known/glama.json file on your server's domain with the following structure:
{
"$schema": "https://glama.ai/mcp/schemas/connector.json",
"maintainers": [{ "email": "your-email@example.com" }]
}The email address must match the email associated with your Glama account. Once published, Glama will automatically detect and verify the file within a few minutes.
Control your server's listing on Glama, including description and metadata
Access analytics and receive server usage reports
Get monitoring and health status updates for your server
Feature your server to boost visibility and reach more users
For users:
Full audit trail – every tool call is logged with inputs and outputs for compliance and debugging
Granular tool control – enable or disable individual tools per connector to limit what your AI agents can do
Centralized credential management – store and rotate API keys and OAuth tokens in one place
Change alerts – get notified when a connector changes its schema, adds or removes tools, or updates tool definitions, so nothing breaks silently
For server owners:
Proven adoption – public usage metrics on your listing show real-world traction and build trust with prospective users
Tool-level analytics – see which tools are being used most, helping you prioritize development and documentation
Direct user feedback – users can report issues and suggest improvements through the listing, giving you a channel you would not have otherwise
The connector status is unhealthy when Glama is unable to successfully connect to the server. This can happen for several reasons:
The server is experiencing an outage
The URL of the server is wrong
Credentials required to access the server are missing or invalid
If you are the owner of this MCP connector and would like to make modifications to the listing, including providing test credentials for accessing the server, please contact support@glama.ai.
Discussions
No comments yet. Be the first to start the discussion!
Your Connectors
Sign in to create a connector for this server.