VibeKit

by io.github.609NFT

Server Details

Build, deploy & manage hosted apps and chat with their AI coding agents from any MCP client.

Status: Healthy
Last Tested: 2026-07-01 03:34
Transport: Streamable HTTP
URL

Glama MCP Gateway

Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.

MCP client

Glama

MCP server

Full call logging

Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.

Tool access control

Enable or disable individual tools per connector, so you decide what your agents can and cannot do.

Managed credentials

Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.

Usage analytics

See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.

100% free. Your data is private.

Tool Definition Quality

A3.8/5.0

Tool DescriptionsA

Average 4.2/5 across 36 of 36 tools scored. Lowest: 3.5/5.

Server CoherenceA

Disambiguation4/5

Most tools have clearly distinct purposes with descriptive names. The main potential confusion is between vibekit_chat (interactive agent) and vibekit_submit_task (autonomous task), but descriptions differentiate. Otherwise, lifecycle tools are well-distinguished.

Naming Consistency4/5

The vast majority follow a vibekit_verb_noun pattern (e.g., vibekit_create_app, vibekit_list_apps). A few exceptions like vibekit_account, vibekit_chat, and vibekit_deploy are simple nouns or verbs, breaking the pattern slightly but still readable.

Tool Count2/5

With 36 tools, the set is quite large. While the platform covers many features, this count exceeds typical MCP server scopes and may overwhelm agents. It could be trimmed by merging related actions (e.g., unified task operations).

Completeness4/5

The tool surface is highly comprehensive, covering account management, app lifecycle, database operations, deployments, tasks, schedules, skills, QA, and agent interaction. Minor gaps exist, such as lack of an update_app tool or domain management, but overall coverage is strong.

Available Tools

36 tools

vibekit_accountVibeKit Account InfoA

Read-onlyIdempotent

Inspect

Get VibeKit account info: plan, credit balance, and usage.

ParametersJSON Schema

Name	Required	Description	Default
No parameters

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint=true and idempotentHint=true, indicating safe, idempotent behavior. The description adds that the tool returns plan, credit balance, and usage, which goes beyond annotations to specify the output. No contradictions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, clear sentence of 12 words. It is front-loaded with the action and resource, and each word adds value. No unnecessary content.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's simplicity (no parameters, no output schema), the description adequately conveys the purpose and return value. It lists the key pieces of information returned (plan, credit balance, usage), making it complete for an AI agent.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The tool has zero parameters, so the baseline score is 4. The description does not need to add parameter meaning since none exist. It correctly focuses on the output.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states 'Get VibeKit account info: plan, credit balance, and usage.' It uses a specific verb ('Get') and resource ('account info'), and lists the exact data returned. This distinguishes it from sibling tools, none of which target account info.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly states when to use the tool: to get account info. It does not provide explicit when-not-to-use or alternatives, but for a simple read-only tool with no parameters, the context is sufficient. The sibling list shows no overlapping functionality, so no confusion.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_agent_historyVibeKit Agent HistoryA

Read-only

Inspect

Get the recent chat history between the user and an app's AI agent.

ParametersJSON Schema

Name	Required	Description	Default
`appId`	Yes	App ID (from vibekit_list_apps)
`limit`	No	Maximum number of messages to return (default 20)

Tool Definition Quality

A3.5/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations declare readOnlyHint=true, indicating a safe read operation. The description ('get') aligns. Since annotations already cover the safety profile, the description adds no extra behavioral context (e.g., rate limits, auth).

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

A single, well-structured sentence that is front-loaded with the core action. No extraneous words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

With no output schema, the description should elaborate on the return format (e.g., structure of messages, sender info). It does not specify what 'recent' means or whether the response is a list; limit parameter implies pagination but is not explained. Adequate but incomplete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 100% description coverage, with appId explained as 'from vibekit_list_apps' and limit with a default. The description does not add any parameter-level detail beyond what the schema provides, meeting the baseline for high schema coverage.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the action (get), resource (chat history), and context (between user and app's AI agent). It distinguishes from sibling tools like vibekit_chat (sending messages) and other retrieval tools.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No explicit guidance on when to use this tool versus alternatives or exclusions. The description is purely functional without usage context.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_agent_statusVibeKit Agent StatusA

Read-only

Inspect

Check whether an app's AI agent is idle or working, and get its LIVE progress. Returns activity.status = the current human-readable build step (e.g. 'editing index.html', 'deploying'), activity.steps = the steps so far, and activity.done = whether the turn finished. Poll this every few seconds while a build is running and relay activity.status to the user so they see progress. Once activity.done is true the response also includes liveCheck = a server-side fetch of the deployed page ({ url, ok, httpStatus, title }); use it to confirm the app is live and renders — do NOT curl/fetch the URL yourself, your sandbox may block it.

ParametersJSON Schema

Name	Required	Description	Default
`appId`	Yes	App ID (from vibekit_list_apps)

Tool Definition Quality

A4.4/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint=true, and the description adds detailed behavioral context: returns live progress, steps, done flag, and liveCheck. It also warns about sandbox blocking, enhancing transparency.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is front-loaded with the main purpose, then explains fields and usage. It is concise but could be slightly tighter; still well-structured.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

No output schema exists, but the description fully explains return fields and their usage. It provides complete guidance for polling and acting on results.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% for the single parameter 'appId' with a clear description. The description adds no extra meaning beyond the schema, so baseline 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly specifies the verb 'check' and the resource 'app's AI agent status', and the title 'VibeKit Agent Status' aligns. It distinguishes from siblings by focusing on live progress polling.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Provides explicit guidance: 'Poll this every few seconds while a build is running' and warns against fetching the URL yourself. It does not explicitly contrast with alternative tools but context is clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_app_envVibeKit Get Env VarsA

Read-onlyIdempotent

Inspect

Get the environment variables configured for an app (names and values). Read-only — use vibekit_set_env to change them.

ParametersJSON Schema

Name	Required	Description	Default
`appId`	Yes	App ID (from vibekit_list_apps)

Tool Definition Quality

A4.7/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint=true and idempotentHint=true. The description reinforces this with 'Read-only' and adds that it returns names and values. No contradiction; it adds useful context beyond annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is extremely concise with two sentences: the first clearly states the core functionality, and the second provides essential usage guidance. Every word adds value.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple read-only getter with one required parameter and annotations already present, the description provides all necessary context: what it does, that it's read-only, and the sibling for mutations. No output schema needed due to simplicity.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with the description of 'appId' as 'App ID (from vibekit_list_apps).' The tool description does not add additional meaning beyond what the schema already provides, so baseline score of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly specifies the verb 'Get' and the resource 'environment variables configured for an app (names and values).' It also distinguishes the tool from its sibling 'vibekit_set_env' by noting the read-only nature and directing to the sibling for changes.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly states 'Read-only — use vibekit_set_env to change them,' providing clear guidance on when to use this tool vs. the sibling for mutation.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_app_logsVibeKit App LogsA

Read-only

Inspect

Fetch recent container/application logs for an app. Use this to debug crashes, errors, or a deploy that isn't serving.

ParametersJSON Schema

Name	Required	Description	Default
`appId`	Yes	App ID (from vibekit_list_apps)
`lines`	No	Number of log lines to retrieve (default 100)

Tool Definition Quality

A4/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint=true, and the description confirms the read-only nature ('fetch...logs'). The description adds context about debugging but no additional behavioral traits beyond what annotations provide, so score is adequate.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single sentence that efficiently conveys purpose and usage with no wasted words. It is front-loaded and easy to parse.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple log retrieval tool with two well-documented parameters and no output schema, the description covers the essential context: what logs are for and when to use. It is complete enough given the tool's simplicity.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, with both parameters documented in the schema. The description does not add significant meaning beyond the schema, so baseline 3 applies.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb 'Fetch' and the resource 'logs', and specifies the context (debugging crashes, errors, deploy issues). It distinguishes from siblings by focusing on log retrieval, which no other sibling tool does.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly states when to use the tool: to debug crashes, errors, or a deploy that isn't serving. It does not mention when not to use or alternatives, but given the sibling set, there is no alternative log tool, making the guidance sufficient.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_cancel_taskVibeKit Cancel TaskA

Idempotent

Inspect

Cancel a running task by taskId. No-op if it has already finished.

ParametersJSON Schema

Name	Required	Description	Default
`taskId`	Yes	Task ID to cancel (from vibekit_submit_task)

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint=false, idempotentHint=true, destructiveHint=false. The description adds the specific no-op condition for finished tasks, which is useful context beyond annotations. No contradictions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two short sentences that fully convey the purpose and behavior. No unnecessary words, front-loaded with the action.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple cancellation tool with one parameter and good annotations, the description is complete. It covers the action, idempotence, and parameter source. No output schema needed.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema has 100% coverage for the single parameter taskId. The description adds context: 'from vibekit_submit_task', which helps the agent understand the source of the ID. This goes beyond the schema's description.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the action (cancel a running task) and resource (taskId), and distinguishes from sibling tools like vibekit_submit_task, vibekit_get_task, etc. The idempotent 'No-op if finished' adds precision.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says when to use (cancel a running task) and when not needed (if finished). Does not name alternatives, but the sibling list implicitly provides them. The idempotent hint is clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_chatVibeKit Chat with AgentAInspect

Send a message to an app's built-in AI coding agent, which reads, writes, and modifies the app's code and redeploys it. Use for 'build/make a change to my app' requests. If the agent finishes quickly you get its reply directly. Otherwise you get status:'working' — do NOT resend; instead give the user LIVE progress: poll vibekit_agent_status every few seconds and relay the current step from activity.status ('editing the homepage…', 'deploying…') until activity.done, then read vibekit_agent_history for the final reply.

ParametersJSON Schema

Name	Required	Description	Default
`appId`	Yes	App ID (from vibekit_list_apps)
`message`	Yes	What you want the agent to do or change — be specific about features, design, and behavior.

Tool Definition Quality

A4.7/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Beyond annotations (readOnlyHint=false etc.), the description reveals asynchronous behavior with status:'working', instructs the calling agent on proper polling, and explains that the agent reads/writes/modifies code and redeploys it. No contradiction with annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is appropriately sized, front-loads the purpose, and every sentence provides essential information. No wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's async nature and lack of output schema, the description fully explains the expected flow: quick reply, or polling for progress and final answer. It also references sibling tools (vibekit_agent_status, vibekit_agent_history) for complete guidance.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so baseline is 3. The description adds context that appId comes from vibekit_list_apps and that message should be specific, but does not significantly add meaning beyond what the schema already provides.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb (send a message) and resource (app's AI coding agent), and distinguishes from siblings by specifying it's for 'build/make a change' requests. It also mentions the agent modifies code and redeploys, making the purpose unambiguous.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says use for 'build/make a change to my app' requests. Provides clear instructions on handling async responses: if status is 'working', poll vibekit_agent_status instead of resending, then read vibekit_agent_history for final reply. This gives complete when-to-use and when-not-to-use guidance.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_create_appVibeKit Create AppAInspect

Create a new hosted app from a starter template and deploy it live at .vibekit.bot. Call vibekit_list_templates first to pick a valid template. To deploy existing code from a GitHub repo instead, use vibekit_deploy.

ParametersJSON Schema

Name	Required	Description	Default
`template`	Yes	Template id from vibekit_list_templates (e.g. 'landing', 'dashboard', 'blog', 'saas', 'crud-api')
`subdomain`	Yes	Desired subdomain — the app goes live at <subdomain>.vibekit.bot. Lowercase letters, numbers, and hyphens.

Tool Definition Quality

A4.6/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations indicate write operation (readOnlyHint=false). Description adds context about live deployment and subdomain usage. No contradictions, but could mention if app creation is free or requires authentication.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, front-loaded with purpose, then prerequisite and alternative. Zero redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Covers purpose, prerequisite, and alternative. No output schema, so return value (e.g., app ID) is missing, but otherwise complete for a creation tool with 2 params.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema covers both parameters fully. Description enhances by linking template to vibekit_list_templates and noting subdomain format constraints beyond schema description.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Specifies verb 'create', resource 'hosted app from starter template', and outcome 'deploy live at subdomain.vibekit.bot'. Distinguishes from sibling vibekit_deploy.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly instructs to call vibekit_list_templates first and suggests vibekit_deploy as alternative for deploying existing GitHub code.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_create_scheduleVibeKit Create ScheduleAInspect

Create a recurring coding task that the AI runs automatically on a cron schedule (e.g. a weekly SEO pass). Use for 'every Monday, do X to my repo' requests.

ParametersJSON Schema

Name	Required	Description
`cron`	Yes	Cron expression, e.g. '0 9 * * 1' for every Monday at 9am UTC
`name`	No	Friendly name for the schedule (optional)
`repo`	Yes	GitHub repo as 'owner/repo'
`task`	Yes	What to do on each run, e.g. 'Improve SEO and page speed'

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations declare readOnlyHint=false, which aligns with creating a schedule. The description adds that it's a 'recurring coding task' run 'automatically on a cron schedule', implying persistence and automation. No contradictions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two concise sentences. The first sentence states purpose and resource with an example; the second gives usage guidance. No wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Lacks output schema but that's not required. The description covers what the tool does and when to use it. Could mention error handling or idempotency, but given sibling tools and annotations, it is sufficiently complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema has 100% description coverage, so baseline is 3. The description reuses the schema's parameter descriptions ('e.g. '0 9 * * 1' for cron) but adds no new meaning beyond the example.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the action ('Create a recurring coding task'), the resource ('cron schedule'), and provides a specific example ('weekly SEO pass'). It distinguishes from sibling tools like delete_schedule and list_schedules.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says 'Use for 'every Monday, do X to my repo' requests', providing clear context. It does not explicitly exclude other uses or contrast with alternatives like submit_task for one-off tasks, but the example is sufficient.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_database_statusVibeKit Database StatusA

Read-onlyIdempotent

Inspect

Get an app's database status and connection info (whether it's provisioned, frozen, etc.).

ParametersJSON Schema

Name	Required	Description	Default
`appId`	Yes	App ID (from vibekit_list_apps)

Tool Definition Quality

A4/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint=true and idempotentHint=true. The description adds the information about what statuses are returned (provisioned, frozen), but does not disclose additional behavioral traits like response format or error conditions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single sentence with 18 words, conveying the essential purpose without any fluff. It is appropriately front-loaded.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple status-checking tool with one parameter, the description adequately explains what it does and what it returns. While it doesn't enumerate all possible statuses, it gives enough context for an agent.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with the appId parameter having a description. The description does not add new meaning beyond the schema, so baseline score of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb (Get), resource (database status/connection info), and provides examples (provisioned, frozen). It distinguishes this from sibling tools like vibekit_db_query or vibekit_db_schema.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage for checking database status, but does not explicitly state when to use vs. alternatives like vibekit_enable_database. However, the context is clear enough for an agent to decide.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_db_queryVibeKit Run SQL QueryA

Destructive

Inspect

Run a SQL query against an app's Postgres database and return the result rows. Supports reads AND writes (INSERT/UPDATE/DELETE/DDL), so treat write or DDL statements as destructive and confirm them with the user first. Call vibekit_db_schema first to learn the tables. SQL string, max 5000 chars.

ParametersJSON Schema

Name	Required	Description	Default
`sql`	Yes	SQL to execute (max 5000 chars). Reads and writes both allowed.
`appId`	Yes	App ID (from vibekit_list_apps)

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description aligns with annotations (readOnlyHint=false, destructiveHint=true) and adds specific behavioral details: supports INSERT/UPDATE/DELETE/DDL, max 5000 chars, and requires user confirmation for writes. This provides context beyond the annotations, such as the exact types of write operations allowed.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is three sentences, each earning its place: purpose, write safety, and prerequisite. It is front-loaded with the core action and uses no filler. Very concise and well-structured.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description covers the essential aspects: operation type, destructive nature, character limit, and a prerequisite step. Despite no output schema, it mentions returning result rows. It could benefit from mentioning result format limits, but the current content is sufficient for an agent to use the tool safely.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100% for both parameters; the schema already documents the SQL max length and appId source. The overall description adds the prerequisite to call vibekit_db_schema but does not provide additional parameter-level meaning. Baseline of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb+resource: 'Run a SQL query against an app's Postgres database and return the result rows.' It explicitly mentions support for both reads and writes, distinguishing it from a read-only query tool. The advice to call vibekit_db_schema first further clarifies the workflow.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides explicit when-to-use guidance: treats write/DDL as destructive and advises user confirmation. It also recommends calling vibekit_db_schema first, which is an alternative for learning schema. It does not explicitly exclude other siblings like vibekit_db_table, but the guidance is clear enough for safe use.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_db_schemaVibeKit Database SchemaA

Read-onlyIdempotent

Inspect

Get an app's database schema — every table and its columns. Always call this before writing a query with vibekit_db_query.

ParametersJSON Schema

Name	Required	Description	Default
`appId`	Yes	App ID (from vibekit_list_apps)

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint=true and idempotentHint=true, so the description's job is to add context. It adds the prerequisite usage pattern, but doesn't mention other behavioral traits like authentication or rate limits. Still, it adds value beyond annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two sentences, with no wasted words. The key action and usage guidance are front-loaded.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The tool has one parameter, clear schema, and annotations. The description mentions output ('every table and its columns'), compensating for no output schema. It is complete for a simple read tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with a well-described appId parameter. The description doesn't add extra meaning beyond what the schema already provides, meeting the baseline for high coverage.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states 'Get an app's database schema — every table and its columns,' which is a specific verb+resource. It distinguishes itself from sibling tools like vibekit_db_query and vibekit_db_table.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly says 'Always call this before writing a query with vibekit_db_query,' providing clear when-to-use guidance and naming an alternative.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_db_tableVibeKit Browse TableA

Read-onlyIdempotent

Inspect

Browse rows of one table with pagination and optional sorting — a quick read-only alternative to hand-writing a SELECT.

ParametersJSON Schema

Name	Required	Description
`appId`	Yes	App ID (from vibekit_list_apps)
`limit`	No	Max rows to return (default 50, max 200)
`table`	Yes	Table name (see vibekit_db_schema)
`offset`	No	Row offset for pagination (default 0)

Tool Definition Quality

A3.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint and idempotentHint. Description adds that the tool supports pagination and optional sorting, which are behavioral details beyond annotations. No contradiction.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Very concise single sentence, front-loaded. However, mentions 'optional sorting' which is not reflected in the input schema, creating a potential inconsistency.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Adequate for a simple read tool with 4 parameters and no output schema. Covers purpose, pagination, and read-only nature. Lacks details on response format and the sorting claim is unsupported by schema.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with descriptions. Description mentions pagination (limit/offset) and sorting, but schema lacks a sort parameter, potentially misleading. Description adds little meaning beyond the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clear verb 'browse' and resource 'rows of one table'. Mentions pagination and sorting. Positioned as 'quick read-only alternative to hand-writing a SELECT', which distinguishes it from writing raw SQL but lacks explicit differentiation from sibling tools like vibekit_db_query.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Implies usage for simple read-only browsing ('quick read-only alternative'), but no explicit when-to-use or when-not-to-use guidance, nor mentions alternatives like vibekit_db_query for complex queries.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_delete_appVibeKit Delete AppA

DestructiveIdempotent

Inspect

Permanently delete a hosted app — stops the container and removes its workspace, database, env vars, and subdomain. Irreversible (the GitHub repo is NOT touched). Confirm with the user before calling.

ParametersJSON Schema

Name	Required	Description	Default
`appId`	Yes	App ID (from vibekit_list_apps)

Tool Definition Quality

A4.4/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already set destructiveHint=true; description adds concrete details on what is destroyed (container, workspace, DB, etc.) and what is not (GitHub repo), plus confirmation note. No contradiction with annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences front-load the action and scope, then list exclusions and guidance. Every sentence adds value with no redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given destructive nature, description covers deleted items and user confirmation. No output schema, but return format not critical for invocation. Could briefly note success/failure indication.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Single parameter appId has 100% schema coverage with description 'App ID (from vibekit_list_apps)'. The tool description adds no further meaning beyond the schema, so baseline 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the action (permanently delete) and resource (hosted app), listing specific components removed (workspace, database, env vars, subdomain). It distinguishes from sibling tools like vibekit_stop_app by emphasizing irreversibility.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly advises user confirmation and notes irreversibility, setting clear context for invocation. Does not explicitly name alternatives but implies destructive nature relative to less drastic siblings.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_delete_scheduleVibeKit Delete ScheduleA

DestructiveIdempotent

Inspect

Delete a scheduled recurring task by scheduleId. Confirm with the user first.

ParametersJSON Schema

Name	Required	Description	Default
`scheduleId`	Yes	Schedule ID to delete (from vibekit_list_schedules)

Tool Definition Quality

A4.9/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already indicate destructiveHint=true. The description adds valuable context by stating 'Confirm with the user first,' reinforcing the destructive nature and providing an important behavioral constraint. No contradiction with annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is extremely concise with two sentences: first stating the action, then giving the prerequisite. Every word is essential, and it is front-loaded with the main purpose.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the single parameter, clear annotations, and no output schema, the description is complete. It covers the action, parameter source, and usage prerequisite, leaving no ambiguity.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, but the description adds meaning by specifying that scheduleId comes from vibekit_list_schedules, providing useful source information beyond the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states 'Delete a scheduled recurring task by scheduleId.' This provides a specific verb and resource, and it distinguishes from sibling tools like vibekit_create_schedule and vibekit_list_schedules.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly instructs 'Confirm with the user first.' This gives clear guidance on when and how to use the tool, emphasizing the need for user confirmation before destructive action.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_deployVibeKit DeployAInspect

Deploy a GitHub repo to VibeKit hosting, live at .vibekit.bot (Fargate container behind nginx, SSL automatic). Use when the code already lives in a GitHub repo; for a starter template instead use vibekit_create_app.

ParametersJSON Schema

Name	Required	Description	Default
`repo`	Yes	GitHub repo as 'owner/repo' (e.g. '609NFT/vibekit')
`subdomain`	Yes	Desired subdomain — the app goes live at <subdomain>.vibekit.bot. Lowercase letters, numbers, and hyphens.

Tool Definition Quality

A4.3/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations (readOnlyHint=false, destructiveHint=false, idempotentHint=false) indicate a typical mutation without side effects. The description adds deployment infrastructure details (Fargate, nginx, automatic SSL) and the resulting URL, going beyond the annotations. However, it does not disclose whether the action overwrites previous deployments or triggers any background tasks.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences: the first describes the core action and result, the second provides usage context and alternative. Front-loaded and every sentence earns its place.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

No output schema is provided, and the description does not explain what the tool returns (e.g., deployment ID, status). Given the complexity of deploying a repository, more information about the outcome or follow-up steps would be helpful for the agent. Sibling tools like vibekit_list_deploys and vibekit_get_app partially compensate, but the description itself is incomplete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with clear descriptions for both parameters. The description adds minimal value beyond restating the subdomain pattern. Since the schema already does the heavy lifting, a baseline score of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the action 'Deploy a GitHub repo to VibeKit hosting' and the outcome 'live at <subdomain>.vibekit.bot'. It specifies the technology stack (Fargate, nginx, SSL) and distinguishes itself from 'vibekit_create_app' by directing users to that sibling for starter templates.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicit instructions: 'Use when the code already lives in a GitHub repo; for a starter template instead use vibekit_create_app.' This provides clear context for when to use this tool and directs to an alternative, leaving no ambiguity.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_enable_databaseVibeKit Enable DatabaseA

Idempotent

Inspect

Provision a Postgres database for an app (idempotent — safe if already enabled). Required before the vibekit_db_* tools work. Note: free-tier apps need the $3/mo Database add-on.

ParametersJSON Schema

Name	Required	Description	Default
`appId`	Yes	App ID (from vibekit_list_apps)

Tool Definition Quality

A4.3/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide idempotentHint=true and destructiveHint=false. The description adds that the operation is 'safe if already enabled', reinforcing idempotency. It also discloses the free-tier add-on requirement, which is a behavioral constraint. No contradiction with annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two sentences with no filler. The first sentence states the core action and idempotency. The second provides prerequisite context. Information is front-loaded and every sentence adds value.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple tool with one parameter and no output schema, the description covers all necessary context: what it does, idempotency, prerequisite for free-tier, and its role in enabling other database tools. Annotations complement the description. There are no gaps that would confuse an agent.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The single parameter appId is already fully described in the schema ('App ID (from vibekit_list_apps)'). The description does not add additional meaning beyond the schema. Since schema coverage is 100%, the baseline score of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb 'Provision' and the resource 'a Postgres database for an app'. It also includes the idempotency note, which distinguishes it from a non-idempotent create. Among sibling tools like vibekit_database_status, vibekit_db_query, etc., the purpose is unambiguous.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly states that this tool is 'Required before the vibekit_db_* tools work', providing clear context for when to use it. It also notes a prerequisite for free-tier apps ($3/mo add-on). However, it does not explicitly exclude usage scenarios or mention direct alternatives, but the sibling set does not contain a direct substitute.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_get_appVibeKit Get AppA

Read-onlyIdempotent

Inspect

Get full details for one hosted app — status, subdomain/URL, repo, runtime, and database/domain config. Use the appId from vibekit_list_apps.

ParametersJSON Schema

Name	Required	Description	Default
`appId`	Yes	App ID (from vibekit_list_apps)

Tool Definition Quality

A4.4/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint and idempotentHint. Description adds value by listing specific details returned (status, subdomain/URL, repo, runtime, database/domain config), which is transparent about what the tool fetches.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, each essential. First states purpose with key details, second gives usage hint. No filler.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a one-parameter get tool with no output schema, description explicitly lists what details are returned, making it complete enough for selection and invocation.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema has 100% coverage with description for appId parameter. Description reinforces that appId comes from vibekit_list_apps, adding helpful context beyond schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states 'Get full details for one hosted app' with specific verb and resource, and lists exact details returned. It distinguishes from siblings like vibekit_list_apps by specifying it's for a single app.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly instructs to use appId from vibekit_list_apps, providing clear prerequisite. No explicit when-not-to-use, but context implies it's for single app details, not listing.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_get_skillVibeKit Get SkillA

Read-only

Inspect

Fetch the full content of a specific skill — implementation patterns, code examples, and best practices for a domain. Fetch on-demand when you need guidance on a specific topic.

ParametersJSON Schema

Name	Required	Description	Default
`id`	Yes	Skill ID from vibekit_list_skills (e.g. 'nextjs', 'trpc', 'auth')

Tool Definition Quality

A4/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint=true and openWorldHint=true, so description adds no extra behavioral context beyond the content type (patterns, examples, best practices). No contradiction.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two efficient sentences: first defines purpose, second guides usage. No wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

No output schema exists, but description covers return content (implementation patterns, code examples, best practices) and parameter source, making it sufficiently complete for a simple read tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with a clear parameter description ('Skill ID from vibekit_list_skills'). Description does not add additional parameter meaning beyond what schema provides.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description uses a specific verb ('Fetch') and identifies the resource ('full content of a specific skill'), clearly distinguishing from sibling 'vibekit_list_skills' which lists skills without full content.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly advises 'Fetch on-demand when you need guidance on a specific topic', providing clear context for use. However, it does not explicitly state when not to use or mention alternative tools.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_get_taskVibeKit Get TaskA

Read-only

Inspect

Get the status and result of a task submitted via vibekit_submit_task.

ParametersJSON Schema

Name	Required	Description	Default
`taskId`	Yes	Task ID returned from vibekit_submit_task

Tool Definition Quality

A4.1/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint. Description adds 'status and result' context, which is consistent. No contradictions, but no additional behavioral details beyond annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Single sentence, no redundant information, front-loaded with purpose. Efficient and clear.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple retrieval tool with one parameter and no output schema, the description is sufficient. It covers purpose and input origin.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% and schema already describes taskId as returned from vibekit_submit_task. Description adds no new parameter details beyond that.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb 'get', the resource 'task', and ties it to 'vibekit_submit_task', distinguishing it from sibling tools like cancel or list.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

It explicitly says 'of a task submitted via vibekit_submit_task', indicating when to use it. It does not explicitly mention alternatives, but the context is clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_list_appsVibeKit List AppsA

Read-onlyIdempotent

Inspect

List all hosted apps in your VibeKit account, each with its id, subdomain, status, and live URL. Start here to get an appId for the other app tools.

ParametersJSON Schema

Name	Required	Description	Default
No parameters

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnly and idempotent, and description adds specifics on what data is returned (id, subdomain, etc.). No contradictions; transparency is good beyond structured fields.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, no redundancy, front-loaded with core purpose. Every word earns its place.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Fully adequate for a simple list tool with no parameters and no output schema. Covers purpose, output fields, and usage hint, meeting all requirements for completeness.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

No parameters exist (schema coverage 100%), so description need not add parameter details. Baseline 4 for zero-param tools; description provides no additional semantic info, which is acceptable.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description explicitly states 'List all hosted apps' and specifies returned fields (id, subdomain, status, live URL), distinguishing it from sibling tools like get_app or create_app. The phrase 'Start here to get an appId' clarifies its role.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Directly advises using it to obtain an appId for other tools, providing clear when-to-use context. No explicit exclusions or alternatives, but the context is sufficiently clear for a list tool.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_list_deploysVibeKit List DeploysA

Read-onlyIdempotent

Inspect

List an app's recent deploys, newest first, with status and commit info. Find a deployId here to pass to vibekit_rollback_deploy.

ParametersJSON Schema

Name	Required	Description	Default
`appId`	Yes	App ID (from vibekit_list_apps)
`limit`	No	Max deploys to return (default 20, max 50)

Tool Definition Quality

A4.3/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint and idempotentHint, indicating safe, idempotent operation. Description adds no further behavioral context, but does not contradict. Adequate given annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences that are front-loaded with the core purpose and a secondary use case. No extraneous information, every sentence adds value.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given low complexity (2 parameters, no nested objects), full schema, and annotations, the description fully covers what the agent needs: purpose, parameter hints, return content (status, commit info), and linkage to rollback tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so baseline is 3. Description adds value by cross-referencing appId source (vibekit_list_apps) and specifying limit defaults implicitly. This extra context raises the score.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states verb (list), resource (deploys), and scope (recent, newest first, with status and commit info). It also distinguishes itself by mentioning its use for finding deployId for rollback, differentiating from sibling tools.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Description implies when to use (to see recent deploys or get deployId for rollback) but does not explicitly state when not to use or provide alternatives. However, context signals and sibling names make it clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_list_schedulesVibeKit List SchedulesA

Read-onlyIdempotent

Inspect

List all of the account's scheduled recurring tasks.

ParametersJSON Schema

Name	Required	Description	Default
No parameters

Tool Definition Quality

A4.1/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint and idempotentHint. Description adds no extra behavioral traits (e.g., pagination, ordering, rate limits) beyond confirming read-only nature.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Single sentence, front-loaded with verb and noun, no fluff. Every word earns its place.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Tool has no parameters, no output schema, and annotations cover safety. Description fully conveys purpose for a simple list operation.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

No parameters; schema coverage 100%. Baseline 4 applies as description needs no parameter info.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description states verb 'List', resource 'scheduled recurring tasks', and scope 'account's'. Clearly distinguishes from siblings like vibekit_list_tasks and vibekit_create_schedule.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Description implies usage for listing schedules, but no explicit when-to-use, when-not-to-use, or alternative tools mentioned. Context signals show many sibling tools.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_list_skillsVibeKit List SkillsA

Read-only

Inspect

List available implementation skills (IDs, names, descriptions, tags) from the VibeKit skills registry. Discover skills here before fetching one with vibekit_get_skill.

ParametersJSON Schema

Name	Required	Description	Default
`tag`	No	Filter skills by tag (e.g. 'react', 'database', 'security')

Tool Definition Quality

A4/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already indicate readOnlyHint=true and openWorldHint=true. The description adds that it returns specific fields but does not disclose additional behaviors such as pagination, rate limits, or ordering. It is consistent with annotations but adds only minor value.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is concise with exactly two sentences, front-loaded with the key action and purpose. No wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description lists the fields returned (IDs, names, descriptions, tags) compensating for lack of output schema. However, it does not mention pagination or limits, which could be relevant for a list operation. Still, for a simple filtered list, it is mostly complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with the 'tag' parameter already described in the schema. The description does not add new meaning or examples beyond what the schema provides, so baseline score of 3 applies.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool lists skills with specific fields (IDs, names, descriptions, tags) and distinguishes from the sibling tool vibekit_get_skill by indicating it's for discovery before fetching.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly says 'Discover skills here before fetching one with vibekit_get_skill', providing clear context on when to use this tool. However, it does not mention when not to use it or offer alternatives beyond the implied sequence.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_list_tasksVibeKit List TasksA

Read-onlyIdempotent

Inspect

List recent coding tasks, optionally filtered by status.

ParametersJSON Schema

Name	Required	Description	Default
`limit`	No	Max number of tasks to return (default 10)
`status`	No	Filter by task status

Tool Definition Quality

A3.7/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint and idempotentHint, so the safety profile is clear. The description adds 'recent' context but doesn't disclose ordering or pagination behavior. Adequate but minimal extra value.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The single sentence is extremely concise and front-loaded with key information (verb and optional filter). No wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

With no output schema, the description should help the agent understand the return structure, which it does not (e.g., fields of tasks). However, the tool is simple and common, so minimal completeness is acceptable.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so baseline is 3. The description adds the 'recent' temporal context not in the schema, but otherwise does not elaborate on parameter semantics beyond what the schema provides.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the action (list), resource (recent coding tasks), and optional filtering by status. It effectively distinguishes from sibling tools like vibekit_get_task (single task) and vibekit_submit_task (submission).

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage for retrieving a filtered list of tasks but lacks explicit guidance on when to use this vs alternatives (e.g., vibekit_get_task for a specific task). No when-not or alternative names mentioned.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_list_templatesVibeKit List TemplatesA

Read-onlyIdempotent

Inspect

List the starter templates available to vibekit_create_app (e.g. landing, dashboard, blog, saas, crud-api). Call this before vibekit_create_app to choose a valid template.

ParametersJSON Schema

Name	Required	Description	Default
No parameters

Tool Definition Quality

A4.3/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already indicate readOnlyHint and idempotentHint, covering safety. The description adds value by specifying the relationship with create_app and example outputs, but does not disclose further behavioral details like caching or rate limits.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two sentences, front-loaded with purpose and examples. Every sentence earns its place with no waste.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no parameters and no output schema, the description is complete: it explains the tool's purpose, its relationship to create_app, and provides example values. Annotations further assure safety.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

There are no parameters (schema coverage 100%). The description provides additional context via examples of template names, helping the agent understand the output format, which justifies the baseline of 4.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb 'list' and the resource 'starter templates available to vibekit_create_app', with concrete examples (landing, dashboard, blog, etc.). It distinguishes itself from sibling tools by explicitly linking to vibekit_create_app.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly says 'Call this before vibekit_create_app to choose a valid template', providing clear when-to-use guidance. While it does not mention when not to use or alternatives, the context makes it sufficient.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_qa_statusVibeKit QA StatusA

Read-only

Inspect

Get the latest automated QA results and status for an app.

ParametersJSON Schema

Name	Required	Description	Default
`appId`	Yes	App ID (from vibekit_list_apps)

Tool Definition Quality

A3.8/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide 'readOnlyHint: true', so the description mainly adds 'latest', indicating it returns only the most recent results. No additional behavioral traits (e.g., rate limits, data freshness) are disclosed, but the annotation covers the read-only aspect sufficiently.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

A single sentence of 10 words, front-loaded with the essential purpose, no redundancy. Every word earns its place.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple read-only tool with one parameter, the description covers the core purpose. However, it lacks details about the output format or possible status values, though no output schema exists to compensate. Slightly more context would be beneficial.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema already describes the only parameter 'appId' with a clear note that it comes from 'vibekit_list_apps'. The description does not add extra meaning beyond the schema, and coverage is 100%, so baseline 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb 'Get', the resource 'latest automated QA results and status', and the scope 'for an app'. It effectively distinguishes from the sibling tool 'vibekit_run_qa' which triggers QA runs.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage after running QA (by saying 'latest results'), but there is no explicit guidance on when to use this tool versus alternatives like 'vibekit_run_qa' or when not to use it.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_redeployVibeKit RedeployAInspect

Redeploy an existing app to pull and ship the latest code. Use after pushing changes to the app's GitHub repo.

ParametersJSON Schema

Name	Required	Description	Default
`appId`	Yes	App ID (from vibekit_list_apps)

Tool Definition Quality

A3.8/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already indicate mutability (readOnlyHint=false) and non-idempotency (idempotentHint=false). Description adds no behavioral details about consequences like downtime, cancellation, or restart. Minimal additional value.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences with no waste. Front-loaded with action and purpose. Efficient and to the point.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Simple tool with single parameter, no output schema. Description covers action and trigger but lacks post-redeploy outcome details (e.g., app restart, status). Still fairly complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema covers parameter appId with description. Description reiterates no additional semantics beyond schema. Baseline score of 3 applies given 100% schema coverage.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states verb 'redeploy', resource 'existing app', and purpose 'pull and ship the latest code'. Distinguishes from sibling tools like vibekit_deploy and vibekit_rollback_deploy.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says 'Use after pushing changes to the app's GitHub repo', providing clear when-to-use guidance. Does not mention when not to use or alternatives, but context is sufficient.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_restart_appVibeKit Restart AppA

Idempotent

Inspect

Restart a hosted app's container. Use when an app is wedged, or to apply changed environment variables.

ParametersJSON Schema

Name	Required	Description	Default
`appId`	Yes	App ID (from vibekit_list_apps)

Tool Definition Quality

A4.3/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare idempotentHint=true and destructiveHint=false, so the description need not repeat safety profile. The description adds that it restarts the container, but does not disclose potential side effects like data loss or downtime, which would add value.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences with no wasted words. Action first, then conditions. Highly efficient.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple tool with one parameter and no output schema, the description together with annotations provides complete guidance for an agent to select and invoke it correctly.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The single parameter appId has schema description 'App ID (from vibekit_list_apps)', which adds guidance on where to obtain the value. With 100% schema coverage, this is a good addition.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the action 'Restart a hosted app's container' and provides specific use cases ('when an app is wedged, or to apply changed environment variables'). This distinguishes it from sibling tools like start_app and stop_app.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly states when to use the tool (wedged app, apply env changes). Does not provide negative guidance (when not to use), but positive context is sufficient.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_rollback_deployVibeKit Rollback DeployA

Idempotent

Inspect

Roll an app back to a previous deploy (get the deployId from vibekit_list_deploys). The container restarts on the rolled-back build; reversible by rolling forward again.

ParametersJSON Schema

Name	Required	Description	Default
`appId`	Yes	App ID (from vibekit_list_apps)
`deployId`	Yes	Deploy ID to roll back to (from vibekit_list_deploys)

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations indicate destructiveHint=false and idempotentHint=true. The description adds behavioral context: 'The container restarts on the rolled-back build; reversible by rolling forward again.' This goes beyond annotations by explaining the restart and reversibility. No contradiction with annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, front-loaded with the primary action, then side effects and reversibility. No wasted words; structure is optimal for quick comprehension.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, the description covers the core operation and side effects. It does not specify return values or error scenarios, but for a simple rollback action, this is adequate. The action is clearly defined for this context.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with descriptions for both params. The description adds value by explicitly stating the source of deployId (from vibekit_list_deploys) and implying appId from vibekit_list_apps, which enhances understanding beyond the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the action: 'Roll an app back to a previous deploy'. It is specific about the resource (an app's deploy) and provides the source for the deployId, distinguishing it from siblings like vibekit_deploy, vibekit_redeploy, and vibekit_restart_app.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly tells the user to get the deployId from vibekit_list_deploys, providing clear sequential guidance. It also notes that the action is reversible by rolling forward again. However, it does not explicitly state when NOT to use this tool or list alternatives, though the context is sufficient.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_run_qaVibeKit Run QAAInspect

Kick off an automated QA run (headless-browser checks) against a hosted app. Poll vibekit_qa_status for results.

ParametersJSON Schema

Name	Required	Description	Default
`appId`	Yes	App ID (from vibekit_list_apps)

Tool Definition Quality

A4.1/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already indicate readOnlyHint=false and destructiveHint=false, so the description's contribution is limited. It adds context that the run is automated, uses headless browsers, and is non-blocking (poll required). However, it doesn't disclose potential side effects like logs or cost, but given annotations, a 3 is appropriate.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is extremely concise with two sentences: the first clearly states the tool's purpose, and the second provides the next step (polling). No unnecessary words; every sentence adds value.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's simplicity (one parameter, no output schema, clear workflow), the description is complete. It explains the tool's purpose, the action, and the follow-up step. No additional context is needed.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with the only parameter appId described as 'App ID (from vibekit_list_apps)'. The description adds no additional parameter semantics beyond the schema, so baseline 3 is justified.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool initiates an automated QA run with specific verb 'Kick off' and resource 'automated QA run (headless-browser checks)'. It also differentiates from sibling tools like vibekit_qa_status by indicating it's the trigger step, and the sibling is for polling results.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly states when to use this tool (to kick off QA checks) and what to do next (poll vibekit_qa_status for results). It doesn't explicitly mention when not to use or alternatives, but the context is sufficient for an AI agent to understand the workflow.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_set_envVibeKit Set Env VarsA

Idempotent

Inspect

Set or update environment variables for an app (merges with existing). The app must be restarted or redeployed for changes to take effect.

ParametersJSON Schema

Name	Required	Description	Default
`vars`	Yes	Key-value pairs to set as environment variables, e.g. { "API_KEY": "...", "NODE_ENV": "production" }
`appId`	Yes	App ID (from vibekit_list_apps)

Tool Definition Quality

A4.4/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations indicate idempotent and non-destructive. Description adds that it merges with existing variables and requires restart/redeploy, which are key behavioral details beyond annotations. No contradiction.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, no unnecessary words. First sentence defines the action, second adds a critical requirement. Highly efficient.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Covers the main purpose and post-action requirement. No output schema, but the description does not specify what the tool returns (e.g., success message). However, given the simplicity, it is fairly complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so baseline 3. Description does not add further meaning to parameters beyond what the schema already provides.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clear verb 'Set or update' and specific resource 'environment variables for an app'. Distinguishes from sibling tools as no other tool sets env vars.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly mentions that updates merge with existing vars and that restart/redeploy is required for changes to take effect. Provides clear usage context but does not state when not to use or compare to alternatives.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_start_appVibeKit Start AppA

Idempotent

Inspect

Start a stopped hosted app's container. Use to bring an app stopped via vibekit_stop_app back online.

ParametersJSON Schema

Name	Required	Description	Default
`appId`	Yes	App ID (from vibekit_list_apps)

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already indicate idempotent and non-destructive behavior. The description adds the key behavioral constraint that the app must be in a stopped state, complementing the annotations without contradiction.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is extremely concise with two sentences, no redundant information, and front-loads the primary action. Every word earns its place.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple tool with one parameter and no output schema, the description covers the essential context: what it does, the prerequisite (stopped app), and the related stop action. It lacks detail on expected return values or effects on app state, but these are largely implied.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema covers the single parameter with a clear description ('App ID (from vibekit_list_apps)'). The tool description does not add additional meaning beyond what the schema provides, which is adequate given 100% schema coverage.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the action ('start a stopped hosted app's container') and the specific resource ('app's container'), distinguishing it from sibling tools like vibekit_stop_app and vibekit_restart_app.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description specifies that it is used to bring an app stopped via vibekit_stop_app back online, providing clear context. However, it does not explicitly contrast with vibekit_restart_app or mention when not to use it, leaving slight ambiguity for edge cases.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_stop_appVibeKit Stop AppA

Idempotent

Inspect

Stop a hosted app's container. The app stays in your account and can be brought back with vibekit_start_app. Use to take an app offline without deleting it.

ParametersJSON Schema

Name	Required	Description	Default
`appId`	Yes	App ID (from vibekit_list_apps)

Tool Definition Quality

A4.3/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With annotations idempotentHint=true and destructiveHint=false, the description adds value by confirming the app persists and can be restarted. No contradiction.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Three concise sentences, each adding essential information: action, non-destructive nature, and use case. No extraneous content.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple tool with one parameter and no output schema, the description fully covers what the agent needs to know: purpose, side effect, and when to use.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with parameter description already specifying the source. Tool description adds no new parameter info beyond what schema provides.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states the action (stop a hosted app's container) and distinguishes from siblings like vibekit_delete_app and vibekit_start_app by emphasizing the app stays in account.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says when to use (take app offline without deleting) and mentions alternative (vibekit_start_app to bring back). Could be stronger on when not to use, but sufficient.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_submit_taskVibeKit Submit TaskAInspect

Submit an autonomous coding task. The AI writes the code, commits to GitHub, and (by default) deploys to .vibekit.bot. Returns a taskId — poll vibekit_get_task or block with vibekit_wait_for_task. Use for 'build X' or 'change X' work against a repo.

ParametersJSON Schema

Name	Required	Description
`repo`	No	GitHub repo as 'owner/repo'. Optional — uses the user's current repo if omitted.
`task`	Yes	What you want built or changed. Be specific about features, design, and behavior.
`branch`	No	Git branch to work on (default 'main')
`deploy`	No	Auto-deploy to <subdomain>.vibekit.bot when done (default true)
`callbackUrl`	No	Optional webhook URL to receive a task-completion notification.

Tool Definition Quality

A4.4/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Description reveals that the AI writes code, commits to GitHub, and deploys by default, going beyond annotations. It mentions async pattern with taskId. No contradictions with readOnlyHint=false, openWorldHint=true, destructiveHint=false.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Three sentences covering action, outcome, async flow, and usage. No wasted words; highly efficient.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Covers key aspects: what it does, async polling/blocking, return value (taskId), deployment behavior. Could elaborate on commit behavior or deploy=false case, but sufficient given parameter descriptions and sibling tools.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, and the description adds value by clarifying defaults (repo defaults to current, deploy defaults true, branch defaults main) and behavior (callbackUrl for notifications).

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool submits an autonomous coding task that writes code, commits to GitHub, and deploys. It distinguishes from siblings like vibekit_get_task and vibekit_wait_for_task by noting polling/blocking, and from vibekit_deploy as a separate step.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says 'Use for 'build X' or 'change X' work', which guides when to use. Does not explicitly state when not to use, but the context of sibling tools makes it clear. Could add more exclusions.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

vibekit_wait_for_taskVibeKit Wait for TaskA

Read-only

Inspect

Block until a submitted task finishes (or the timeout), polling every 5 seconds, then return its result. Use right after vibekit_submit_task when you want the final result inline.

ParametersJSON Schema

Name	Required	Description	Default
`taskId`	Yes	Task ID to wait for (from vibekit_submit_task)
`timeoutSeconds`	No	Max seconds to wait (default 300; the remote connector caps this at 120)

Tool Definition Quality

A4.4/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Discloses blocking behavior, polling interval, and timeout. Consistent with readOnlyHint annotation. No contradictions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two efficient sentences, front-loaded with action, no wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Simple tool with no output schema; description adequately explains behavior and return. Works well with sibling tools.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Adds context beyond schema: taskId origin from submit_task, timeoutSeconds default and cap. Schema covers 100% of parameters.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states it blocks until a task finishes or timeout, polls every 5 seconds, and returns the result. Distinguishes from siblings like vibekit_submit_task and vibekit_cancel_task.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says to use after vibekit_submit_task when wanting inline result. Provides timing context but doesn't explicitly mention when not to use or alternatives.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Claim this connector by publishing a /.well-known/glama.json file on your server's domain with the following structure:

{
  "$schema": "https://glama.ai/mcp/schemas/connector.json",
  "maintainers": [{ "email": "your-email@example.com" }]
}

The email address must match the email associated with your Glama account. Once published, Glama will automatically detect and verify the file within a few minutes.

Discussions

No comments yet. Be the first to start the discussion!

Try in Browser

Your Connectors

Resources

Need Help?