Skip to main content
Glama

Smart Tap (AI) [Pro]

smart_tap

Finds a UI element by natural language description and taps its center. Combines locating and tapping into one action, returning success status and the tapped element.

Instructions

[Pro] Finds a UI element by natural language description and taps it. Combines element finding and tapping into a single action. Example: smart_tap('the Sign In button') will locate the button and tap its center coordinates. Returns whether the tap succeeded and which element was tapped.

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
device_idYesDevice serial ID
element_descriptionYesDescription of the element to tap, e.g. 'the Submit button' or 'the settings icon'
Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Correctly describes the tool's behavior (finds element, taps its center) and return value (success flag and element tapped). With no annotations, it carries the full burden and does so well.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Three sentences covering purpose, combined nature, example, and return value. Concise and well-structured.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given 2 simple parameters, no output schema, and no annotations, the description adequately explains the tool's behavior: finds and taps via natural language, returns success info. Minor gap: doesn't explain failure behavior or limitations (e.g., if element not found).

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema already covers both parameters with descriptions, so baseline is 3. The description provides a useful example mapping parameter to use but doesn't add additional semantics beyond what the schema and example provide.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states it uses natural language to find and tap a UI element, combining two actions into one. The example makes the purpose concrete and distinct from siblings like 'find_element' or 'tap'.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Provides an example with typical usage ('the Sign In button'), implying when to use it (when element description is known). However, it doesn't explicitly differentiate from alternatives like 'tap' or 'find_element' combined with 'tap', nor suggest when not to use it.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/saranshbamania/mobile-device-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server