comet_screenshot
Capture screenshots of web pages during autonomous browsing sessions to document research findings and visual information.
Instructions
Capture a screenshot of current page
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
No arguments | |||
Capture screenshots of web pages during autonomous browsing sessions to document research findings and visual information.
Capture a screenshot of current page
| Name | Required | Description | Default |
|---|---|---|---|
No arguments | |||
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
With no annotations provided, the description carries full burden but only states the basic action. It doesn't disclose behavioral traits such as what format the screenshot is captured in (PNG, JPEG), whether it requires specific permissions, if it captures the entire page or viewport, or how the result is returned. This leaves significant gaps for a tool that presumably produces output.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is a single, efficient sentence that front-loads the core action. There is zero waste, and every word ('capture', 'screenshot', 'current page') contributes directly to understanding the tool's function.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given no annotations, no output schema, and a simple but potentially output-producing tool, the description is incomplete. It doesn't explain what the screenshot output looks like (e.g., file, base64, path) or any behavioral nuances, leaving the agent with insufficient context to use it effectively beyond the basic action.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 0 parameters with 100% coverage, so no parameter documentation is needed. The description doesn't add parameter details, which is appropriate here, but it could mention implicit context (e.g., 'current page' refers to an active browser tab). Baseline is 4 for zero parameters.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the action ('capture') and target ('screenshot of current page'), making the purpose immediately understandable. It doesn't differentiate from siblings like comet_tabs or comet_upload, but the verb+resource combination is specific enough for basic understanding.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance is provided on when to use this tool versus alternatives like comet_tabs (which might list tabs) or comet_upload (which might upload files). The description implies usage when a screenshot is needed, but offers no context about prerequisites, timing, or exclusions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/Thomasneatbiggers/Perplexity-Comet-MCP'
If you have feedback or need assistance with the MCP directory API, please join our Discord server