Skip to main content
Glama

dat_browse

Run a browser automation task from a natural language instruction, returning the result and optionally screenshots. Supports sync and async modes.

Instructions

Run a browser automation task on dat.ai. Give a natural language instruction and dat.ai will drive a real browser to complete it. Returns the result and any screenshot URLs. Uses sync mode by default (waits for completion, up to 10 min). Set async=true to get a task_id immediately and poll with dat_browse_status.

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
taskYesThe instruction to execute in the browser, e.g. 'Open https://example.com and summarize the page'
asyncNoIf true, return a task_id immediately instead of waiting. Poll with dat_browse_status. Default: false
fanoutNoNumber of nodes to race the task on (1-10, default 1). Optional.
timeoutNoTask timeout in milliseconds (max 10800000 = 3 hours). Optional.
full_pageNoUsed by final_only mode. Capture full scrollable page. Default: true.
country_isoNoRoute to browsing nodes in this country (ISO code, e.g. 'US', 'DE'). Optional.
session_keyNoGroup browsing tasks into a shared session. Optional.
screenshots_modeNoScreenshot capture mode. final_only: one screenshot after completion. every_step: screenshot each step. on_navigation: screenshot on URL change. Default: none (no screenshots).
Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries full burden. It discloses sync/async behavior, default mode, return format (result and screenshot URLs), and mentions optional parameters that affect behavior (fanout, country routing, session grouping, screenshot modes). It does not mention destructive potential or authentication needs, but the nature of the tool implies read/write possibilities in browsing. The description adds value beyond the schema by explaining the default wait time and polling pattern.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is three sentences, each serving a distinct purpose: stating the action, describing output, and explaining modes. It is front-loaded with the core functionality and highly efficient with zero wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

While the description covers the main task and sync/async modes, it lacks details on error handling, return value structure (since no output schema is provided), rate limits, and authentication. For a tool with 8 parameters and no output schema, the description should provide more context on expected results and edge cases to fully inform an AI agent.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so baseline is 3. The description only briefly mentions async mode and timeouts (up to 10 min in sync), adding marginal semantic value beyond parameter descriptions. It does not elaborate on other parameters like fanout, country_iso, or screenshots_mode in the prose itself.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states the tool runs browser automation tasks from natural language instructions on dat.ai, and specifies that it returns results and screenshot URLs. It distinguishes sync and async modes but doesn't explicitly differentiate from sibling tools, though the role is evident.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides clear guidance on when to use sync vs async mode, including details that sync waits up to 10 minutes and async returns an ID immediately for polling with dat_browse_status. However, it does not discuss when to choose this tool over siblings like dat_browse_screenshot, though the sibling relationship suggests they are complementary rather than alternatives.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/willtholke/dat.ai-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server