Skip to main content
Glama

tap

Taps a specified UI element on iOS or Android using index, coordinates, or label text. Auto-dismisses keyboard when target is hidden behind it.

Instructions

Tape sur un élément de l'écran. Fonctionne sur iOS et Android. Utilise index (du get_ui_tree), coordonnées (x,y), ou texte. Auto-dismiss le clavier si la cible est physiquement masquée par lui (sécurité, pas de retap nécessaire).

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
indexNoIndex de l'élément depuis get_ui_tree
xNoCoordonnée X pour tap direct
yNoCoordonnée Y pour tap direct
textNoTexte/label de l'élément à taper
auto_dismiss_keyboardNoSi la cible est masquée par le clavier, le dismiss avant de tap (default: true). Mettre à false pour permettre le tap directement sur le clavier.
Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description discloses an important behavioral trait: auto-dismissal of the keyboard if the target is masked by it, which is not fully captured by the parameter default. However, it does not describe error handling, what happens when multiple identification methods are provided, or the return value (no output schema). With no annotations, the description carries the burden but covers only partial behavior.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is concise with three sentences, front-loading the purpose. Each sentence adds a distinct piece of information (action, platform, methods, keyboard behavior). There is no redundancy or unnecessary detail.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple tap tool with 5 parameters and no output schema, the description provides adequate core context but misses details like coordinate system (absolute vs relative), recommended identification method priority, and conflict resolution. The sibling tools list includes related gestures, but the description does not clarify when to use tap over long_press or swipe.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 100% description coverage, so the baseline is 3. The description adds that 'index' comes from get_ui_tree, which is helpful, but this is implicit from the schema description. The auto-dismiss keyboard default is also described in the schema. The description does not add significant new meaning beyond the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the action (tap), the target (element of screen), and three identification methods (index, coordinates, text). It also specifies platform support (iOS and Android). However, it does not explicitly distinguish this tool from siblings like long_press or swipe, though the verb 'tap' implies the basic action.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides no guidance on when to use this tool versus alternatives such as long_press, swipe, or type_text. It does not mention prerequisites, limitations, or scenarios where this tool is inappropriate. The user must infer usage from the tool's name and basic action.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/nthImpulse/phantom-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server