Click
browser_clickClicks a web element in a browser session using its reference from the latest page snapshot.
Instructions
Click an element by ref from the latest snapshot.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| ref | Yes | ||
| session | Yes |
browser_clickClicks a web element in a browser session using its reference from the latest page snapshot.
Click an element by ref from the latest snapshot.
| Name | Required | Description | Default |
|---|---|---|---|
| ref | Yes | ||
| session | Yes |
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
No annotations are provided, so the description carries the full transparency burden. It only states 'click an element' without disclosing potential side effects (e.g., navigation, page changes), error conditions (e.g., invalid ref), or whether the session must be active.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single concise sentence with no unnecessary words. It could be slightly expanded for clarity, but it remains efficient.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given two required parameters, no output schema, and no annotations, the description is minimal. It lacks details on return values, waiting behavior, frame handling, and error scenarios, making it incomplete for a click action tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 0%, and the description adds meaning for the 'ref' parameter (by stating it refers to an element from the latest snapshot) but does not explain the 'session' parameter beyond its existence. This partially compensates for the lack of schema descriptions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Click an element by ref from the latest snapshot,' which specifies the action (click) and the resource/context (element identified by ref from a snapshot). It distinguishes from siblings like browser_type (typing) and browser_navigate (navigation).
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance on when to use this tool versus alternatives, nor when not to use it. The description implies a prerequisite (latest snapshot) but does not explicitly state that a snapshot must be taken first via browser_snapshot.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/apexradius/browser-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server