ChiefLab

Server Details

Open execution contract for agents doing business work. 6 operators following one spec. Approval-gated, tenant-isolated, MIT. Reference at chieflab.io/spec/v0.1.

Status: Healthy
Last Tested: 2026-06-02 22:30
Transport: Streamable HTTP
URL

Glama MCP Gateway

Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.

MCP client

Glama

MCP server

Full call logging

Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.

Tool access control

Enable or disable individual tools per connector, so you decide what your agents can and cannot do.

Managed credentials

Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.

Usage analytics

See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.

100% free. Your data is private.

Tool Definition Quality

A3.7/5.0

Tool DescriptionsA

Average 4.3/5 across 32 of 32 tools scored. Lowest: 2.8/5.

Server CoherenceA

Disambiguation4/5

Most tools have distinct purposes, e.g., approve_action vs execute_approved_action vs publish_approved_post. However, alias overloading (e.g., chieflab_launch_product and chieflab_get_users_after_build pointing to the same handler) introduces some ambiguity. The detailed descriptions mostly mitigate confusion, but an agent might still struggle to choose between near-identical aliases.

Naming Consistency4/5

Tools predominantly follow a 'chieflab_verb_noun' pattern (e.g., chieflab_approve_action, chieflab_connect_provider). A few exceptions exist (chieflab_help, chieflab_inbox, chieflab_boot) that are single nouns, but these are clearly distinct and the overall consistency is high.

Tool Count3/5

32 tools is on the high side for an MCP server, but the domain of a growth/marketing launch platform naturally requires many operations (launch, approve, execute, measure, iterate, connect providers, etc.). The count is borderline but still manageable; it doesn't reach the 50+ extreme.

Completeness4/5

The tool set covers the full launch lifecycle: create, approve, execute, measure, and iterate. It includes provider connections, manual fallback, brain summary, and work requests. Minor gaps exist (e.g., no explicit tool for deleting a launch or revoking approval), but core workflows are fully supported.

Available Tools

45 tools

chieflab_approve_actionAInspect

[chieflab_* alias of chiefmo_approve_action] Approve one ChiefMO publish/send action so its executor can fire. USE WHEN the user — in IDE chat — said 'approve ' (e.g. 'approve linkedin', 'approve hn'), 'approve all', 'ship it', 'go ahead', or otherwise greenlit a specific draft you rendered. Match the user's words to the channel, look up agentGuide.renderInChat[channel].actionId from the launch response, and call this tool with that actionId. This is the IDE-native approval path — no need to push the user to the reviewUrl. Pass actionId (preferred) or id (legacy alias). P74: pass autoExecute: true AND the connector inputs (platforms for social / recipients + subject for email) to have the approval chain directly into execution — approve and ship in one tool call. Without autoExecute (or when connector is manual_handoff / blocked), the response includes executionPlan and the caller is expected to invoke the suggestedTool next.

ParametersJSON Schema

Name	Required	Description
`id`	No	Legacy alias for actionId.
`subject`	No	P74. For autoExecute on email: subject line override.
`actionId`	No	The action id from publishActions[].id in a launch_product response. Preferred.
`mediaUrls`	No	P74. Optional media URLs for the social post — auto-uploaded to Zernio's CDN before publishing.
`platforms`	No	P74. For autoExecute on social channels: Zernio platform descriptors, e.g. [{platform: 'linkedin', accountId: '<id>'}]. Required when autoExecute=true and mode=oauth_connector.
`recipients`	No	P74. For autoExecute on email: recipient email addresses.
`autoExecute`	No	P74. When true AND the action's executionMatrix.mode is oauth_connector / api_key_connector / native_api AND the connector is ready, the approve handler immediately invokes chieflab_execute_approved_action with the inputs you supply. The response includes the executor result + the unified proof ledger. Defaults to false (preserves the prior contract — agent calls execute_approved_action explicitly).

Tool Definition Quality

A4.7/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, so description carries burden. It explains the effect (allows executor to fire) and mentions alias. Could disclose more about side effects but adequate for a simple action.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Four sentences, front-loaded with alias and purpose. Slightly verbose in first sentence but each sentence adds unique guidance. Could be tightened slightly.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a tool with no annotations, no output schema, and simple parameters, description covers when, how, and what parameters to use. Includes alias clarification and workflow steps.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema covers 100% of parameters; description adds value by specifying preferred parameter (actionId over legacy id) and provides context on how to obtain actionId from launch response.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states the tool approves a ChiefMO publish/send action, using specific verbs 'Approve' and 'fire'. It distinguishes from related tools like publish_approved_post and send_approved_email by describing the approval step.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly states when to use: when user says 'approve <channel>', 'approve all', 'ship it', etc. Provides step-by-step instructions and notes when not to use (no need for reviewUrl).

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_bootAInspect

Alias of chieflab_status. Use as the FIRST tool when an agent session starts on a workspace that already has activity — recovers all open business loops with literal user commands. Same response shape as chieflab_status, same handler. If the user asked to launch the current repo and a recovered open loop looks unrelated, do not blindly resume it; start a fresh launch for the current repo.

ParametersJSON Schema

Name	Required	Description
`projectName`	No	Optional current project/repo name.
`workspaceId`	No	Optional workspace id.
`currentRepoContext`	No	Optional current repo/project context. Helps agents compare recovered loops against the project the user is asking about.

Tool Definition Quality

A3.9/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, so description carries full burden. It states same response shape and handler as chieflab_status, implying it is read-only (status-like). However, 'recovers' could imply mutation; more explicit safety info (e.g., non-destructive) would improve clarity.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences efficiently convey purpose and usage. Front-loaded with key info. Slightly verbose with 'same response shape...same handler' but acceptable.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a tool that is an alias with one optional parameter and no output schema, the description covers the essential use case and behavior. Could mention if any side effects occur, but as a status-like tool, it's sufficiently complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Input schema has one optional parameter with 100% description coverage. The description adds no additional meaning beyond the schema, so baseline score of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it is an alias of chieflab_status, with a specific purpose: to be used as the first tool when an agent session starts on an active workspace, recovering business loops. This distinguishes it from siblings like chieflab_status through naming and usage context.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly states when to use: 'as the FIRST tool when an agent session starts on a workspace that already has activity'. Also specifies that it recovers open business loops. While it doesn't explicitly mention when not to use, the context implies it's for sessions with prior activity.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_brain_summaryAInspect

USE WHEN the user asks 'what do you remember about my brand?' / 'show me my brain' / 'what have you learned?'. Returns a plain-English paragraph summarizing what the per-workspace brain has accumulated: launch count, top-performing channels, brand voice patterns from approved drafts, what's been rejected and why, channel-specific performance, recent proof points. The moat made visible. Pair with chieflab_brain_read for the raw structured data.

ParametersJSON Schema

Name	Required	Description	Default
`tenantId`	No	Optional tenant scope.
`workspaceId`	No

Tool Definition Quality

A3.8/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It describes the output format and content but does not disclose whether the tool has side effects, permissions required, or behavior when the brain is empty. It implies a read-only operation but lacks explicit safety guarantees.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is four sentences long, front-loaded with usage context, and each sentence adds value. No redundant or filler wording.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description adequately explains the tool's purpose and output, but lacks parameter documentation and behavioral details. For a simple summary tool, it is minimally complete, but the missing parameter semantics and absence of preconditions reduce completeness.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The schema covers only 1 of 2 parameters with a description (tenantId), and the description adds no parameter-specific meaning. It mentions 'per-workspace brain' implying workspaceId is important, but does not explain tenantId or the relationship between the two.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description specifies exact user queries that trigger this tool and clearly states it returns a plain-English summary of accumulated brand memory, including specific content categories. It distinguishes from sibling chieflab_brain_read by indicating that tool provides raw structured data.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly lists trigger phrases ('what do you remember about my brand?', etc.) and advises pairing with chieflab_brain_read for raw data. It does not explicitly state when not to use, but the 'USE WHEN' pattern provides clear context.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_check_measurement_dueAInspect

P71 — measurement queue inspector. USE WHEN the agent wants to know which executed actions are ready for their 24h readback (regardless of how they were executed — native connector OR manual paste). Lists actions where metadata.proof.measurementDueAt <= now AND metadata.proof.measuredAt is unset. Returns: [{actionId, runId, channel, executedAt, measurementDueAt, artifactUrl, executionStatus, recommendedNextTool}]. Pairs with chiefmo_post_launch_review which the cron also calls automatically; this tool surfaces the same queue to a foreground agent so it can opportunistically pull metrics during a session instead of waiting for the next cron tick.

ParametersJSON Schema

Name	Required	Description	Default
`runId`	No	Optional. Restrict to one run.
`workspaceId`	No	Optional workspace id.

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, so description carries full burden. It reveals filtering criteria (measurementDueAt <= now, measuredAt unset), return fields, and relationship to cron tool. Does not mention permissions or destructiveness, but mutation is not implied. Adequate transparency.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Description is a single paragraph, front-loaded with purpose. Contains multiple specific details without excessive verbosity. Could be slightly more structured but is concise.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, description lists return fields (actionId, runId, etc.). Explains condition for queue inclusion and relates to sibling tool. Covers both native connector and manual paste. Complete for agent understanding.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with descriptions for both optional parameters. Description adds no further detail beyond the schema. Baseline of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states 'measurement queue inspector' and specifies when to use: to know which executed actions are ready for 24h readback. It distinguishes from sibling tool chiefmo_post_launch_review by explaining their relationship.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says 'USE WHEN the agent wants to know which executed actions are ready for 24h readback'. It pairs with another tool and explains the cron automation context. Lacks explicit 'when not to use', but positive guidance is strong.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_connector_statusAInspect

USE WHEN the user just completed a connector OAuth flow and you want to confirm it succeeded — 'did my connection work?', 'is my Zernio/HubSpot/Stripe connected?'. Returns status: pending | connected | failed for the given connectionId.

ParametersJSON Schema

Name	Required	Description	Default
`workspaceId`	No
`connectionId`	Yes

Tool Definition Quality

A3.9/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries full burden. It discloses the return value format and that it's a read operation for checking status. However, it does not mention error handling, authentication requirements, or what happens if the connectionId is invalid, leaving some behavioral gaps.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single sentence with a clear usage prefix and example. No unnecessary words, front-loaded with key information.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema and few parameters, the description covers the essential purpose and usage. It lacks details on error cases or edge behaviors, but for a simple status check it is largely sufficient.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema has two parameters with 0% description coverage. The description mentions 'for the given connectionId' but does not explain workspaceId at all. Given the low coverage, the description should provide more clarity on workspaceId's role and usage.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool checks connector OAuth status, gives example queries, and lists possible return values. It distinguishes itself from siblings like chieflab_connect_connector and chieflab_connector_setup_steps by focusing on post-OAuth confirmation.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly begins with 'USE WHEN' and provides specific user queries that trigger this tool. It does not mention when not to use or alternatives, but the context is clear enough for an agent to decide.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_connect_providerAInspect

Connect a third-party provider (Zernio, Resend, GA4, Search Console, HubSpot, Stripe, Linear, Notion, Slack) to this workspace. USE WHEN the user wants to wire up publishing, email sending, or analytics readback. For OAuth providers (ga4 / search_console / hubspot) returns an authorizeUrl the agent surfaces to the user. For API-key providers (zernio / resend) returns instructions for the set-key tool. Without this, publish/send/measure tools return 'configure first' errors.

ParametersJSON Schema

Name	Required	Description	Default
`provider`	Yes	Provider id: zernio \| resend \| ga4 \| search_console \| hubspot \| stripe \| linear \| notion \| slack

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, so description carries full burden. Discloses that for OAuth providers it returns an authorizeUrl for the user, and for API-key providers it returns instructions for a separate set-key tool. Implies the tool is a prerequisite for other actions. Reasonably transparent, though could mention if it modifies workspace state permanently.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Single paragraph, starts with the main action, then usage guidance, then behavioral specifics. All sentences earn their place. No fluff. Could be slightly more concise by splitting into bullet points, but appropriate for current format.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

No output schema, so description explains the two possible return types. Covers error scenario ('configure first' errors). Lists all provider options. Sufficient for an agent to use this tool correctly. Minor omission: doesn't specify if the tool checks connectivity or just saves configuration.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with a description for 'provider' parameter. The description adds value by grouping providers into OAuth vs API-key categories and explaining the behavioral difference, which helps the agent choose the right response path. This goes beyond the simple enum list in the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states 'Connect a third-party provider to this workspace' and lists specific providers (Zernio, Resend, GA4, etc.). Distinguishes from siblings by describing the overall purpose, though could explicitly contrast with chieflab_connect_connector. The USEWHEN clause helps clarify scope.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Provides explicit use-case statement ('USE WHEN the user wants to wire up publishing, email sending, or analytics readback'). Categorizes providers by auth type (OAuth vs API-key) and tells agent what to do with the response. Also warns about prerequisite errors for other tools.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_continue_launch_loopAInspect

[chieflab_* alias of chiefmo_continue_launch_loop] Resume a ChiefLab launch loop from runId. USE WHEN an agent has already called chieflab_get_users_after_build / chiefmo_launch_product and needs the exact next action: surface reviewUrl, execute an approved action, wait for measurement, measure results, or prepare the next move. Default response is summary-sized: reviewUrl + action ids, not full draft bodies. Pass responseShape:"full" only for debug/export.

ParametersJSON Schema

Name	Required	Description
`runId`	Yes	The launchId/runId returned by chieflab_get_users_after_build or chiefmo_launch_product.
`workspaceId`	No	Optional workspace id. Hosted auth usually supplies this.
`responseShape`	No	summary (default, small — returns reviewUrl + openApprovalIds without draft bodies) \| full (inline full approval objects; large, debug/export only).
`includeFullArtifacts`	No	Alias for responseShape:"full". Use only for debug/export.

Tool Definition Quality

A4/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so description carries full burden. It mentions statefulness but does not disclose whether the tool has side effects, is read-only, or any other behavioral traits. Basic context is given but lacks depth.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Description is a single paragraph but packs necessary information efficiently. Front-loaded with clear action. Slight room for improvement in structure (e.g., bullet points) but not overly long.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the complexity of a launch loop resumption tool and absence of output schema or annotations, the description provides a reasonable overview but lacks details on return values, error conditions, and exact behavioral outcomes.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so baseline is 3. Description does not add additional meaning beyond the schema; it merely restates the schema description for runId.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states the tool resumes a launch loop from a runId, with specific verb and resource. It distinguishes from siblings by indicating it's the next step after chieflab_get_users_after_build or chiefmo_launch_product.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly states when to use: after certain prior tools, and what the tool can do (surface reviewUrl, execute approved action, etc.). Also provides rationale for existence (stateless agents).

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_create_next_move_actionAInspect

P75 — turn a Next Move suggestion into an approval-gated draft action. USE WHEN you've called chieflab_suggest_next_move and the suggestion's kind is not 'wait' or 'noop'. Creates an actionStore entry with status='awaiting_approval', the suggested draft body inline, and an executionMatrix that points at the right next-execution path. The reviewer sees the new card in the Launch Room / IDE chat like any other approval card — same approve / revise / reject flow. Closes the loop: launch → measure → next move → approve → execute → repeat.

ParametersJSON Schema

Name	Required	Description
`runId`	Yes	Required. The originating run id (the next-move action is tagged with this so the brain can compound launch → measure → next-move chains).
`channel`	Yes	Channel the measurement was for (used to ground the suggestion).
`metrics`	Yes	Same metrics object passed to chieflab_suggest_next_move.
`workspaceId`	No	Optional workspace id.

Tool Definition Quality

A4.6/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries full responsibility for behavioral disclosure. It effectively explains that the tool creates a draft action with status 'awaiting_approval', includes the suggested draft body inline and an executionMatrix, and that the result appears in approval interfaces. While it doesn't cover error conditions or side effects, it sufficiently conveys the immediate outcome.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is concise and well-structured, with the core purpose front-loaded in the first sentence. It uses efficient language without redundancy, covering conditions, outcome, and workflow in a compact paragraph that is easy to parse.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (4 parameters, one nested) and the lack of output schema, the description provides sufficient information about what the tool does and produces. It explains the resulting actionStore entry, its status, and the review process. However, it omits details about error handling or the exact structure of the executionMatrix, which would enhance completeness.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 100% description coverage, providing a baseline of 3. The description adds value by explaining the runId parameter's role in chaining ('so the brain can compound launch → measure → next-move chains') and explicitly noting that the metrics object is the same as passed to chieflab_suggest_next_move. This additional context enhances understanding beyond the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: turning a Next Move suggestion into an approval-gated draft action. It specifies the condition for use (suggestion's kind not 'wait' or 'noop') and the outcome (actionStore entry with status='awaiting_approval'), effectively distinguishing it from sibling tools like chieflab_approve_action and chieflab_execute_approved_action.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly states when to use this tool: 'USE WHEN you've called chieflab_suggest_next_move and the suggestion's kind is not 'wait' or 'noop'.' It also contextualizes the tool within the broader loop: 'Closes the loop: launch → measure → next move → approve → execute → repeat.' This provides clear guidance on the tool's role and prerequisites.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_create_work_requestAInspect

USE WHEN your operator hits a gap it can't fulfill itself and wants to route the work to another operator — 'I need a demo video for this launch', 'request design help for the hero image'. Creates an open work request that other operators can offer against. Once an offer is submitted, the requester (or their human) reviews + accepts via chieflab_accept_offer (which then creates an approval-gated action for the spend).

ParametersJSON Schema

Name	Required	Description
`brief`	No	Structured brief — { goal, references, constraints, asset_links, etc. }. The more specific, the better the offers.
`budget`	No	Optional. { kind: 'max_usd', amount: 50 } \| { kind: 'quote' } \| { kind: 'free' }.
`deadline`	No	Optional ISO 8601 timestamp.
`sourceRunId`	No	Optional. The parent runId this work attaches to (e.g. the launch that needs the asset).
`workspaceId`	No
`capabilityNeeded`	Yes	Capability slug to match (e.g. 'demo_video'). Should match a published capability — call chieflab_search_capabilities first if unsure.
`sourceOperatorId`	No	Optional. The operatorId that's requesting (e.g. 'chieflab-launch').

Tool Definition Quality

A3.9/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries full burden. It explains that a work request is created and can be offered on, but does not disclose details like permission requirements, rate limits, or any side effects. Adequate for a creation tool but not comprehensive.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is concise, with four sentences that front-load the usage condition. Every sentence adds value without redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description explains the workflow (create request, then offer, accept) but does not specify the tool's return value (e.g., the created request ID). Given no output schema, a bit more detail on the response would be helpful.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is high (86%) with detailed parameter descriptions, so the description adds limited value for parameters. It provides context for the tool's purpose but does not explain parameter semantics beyond the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description begins with a clear 'USE WHEN' statement, specifying the condition (operator hits a gap) and providing concrete examples. It differentiates from siblings by outlining the workflow and mentioning chieflab_accept_offer.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description gives explicit conditions for use ('USE WHEN your operator hits a gap') and mentions the subsequent step (accept offer). It lacks an explicit 'do not use' guideline, but the positive guidance is strong.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_execute_approved_actionAInspect

P71 — closed-loop executor wrapper. USE WHEN chiefmo_approve_action just succeeded on an action whose executionMatrix.mode is oauth_connector or api_key_connector AND connectorReady is true. Routes to the right native executor (chiefmo_publish_approved_post for social / chiefmo_send_approved_email for email), runs the execute-time sanity gate (re-classifies the action's stored body to catch DEGRADED MODE / clarification / refusal text that slipped through draft-time classification), captures the executor's result, writes a standardized metadata.proof shape ({executedAt, artifactUrl, artifactId, executionStatus, failureReason, measurementDueAt}) on the action, and schedules the 24h followup via launchFollowupStore so chiefmo_post_launch_review fires automatically. Returns a uniform shape regardless of channel so the agent / Launch Room doesn't have to branch on connector type. For manual_handoff actions, returns guidance to call chieflab_record_manual_publish with the live URL instead. For blocked actions, returns the connector-wiring instruction.

ParametersJSON Schema

Name	Required	Description
`subject`	No	For email sends: subject line override. Defaults to the action's stored subject when present.
`actionId`	Yes	The publishAction.id whose executionMatrix you want to execute. The action must be in status='approved' (call chiefmo_approve_action first).
`mediaUrls`	No	Optional media URLs (image / video) for social posts. Zernio uploads them to its CDN before posting.
`platforms`	No	For oauth_connector (social): Zernio platform descriptors, e.g. [{platform: 'linkedin', accountId: '<id>'}]. Required for social posts unless the workspace has a default mapping; the tool refuses with reason='platforms_required' otherwise.
`recipients`	No	For api_key_connector (email): recipient email addresses. Required for email sends unless the action carries a stored recipient list.
`workspaceId`	No	Optional workspace id.

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, so description carries full burden. It details internal steps: sanity gate re-classifies body, captures executor result, writes metadata.proof shape, schedules 24h followup. It mentions uniform return shape. Lacks explicit statement about mutability, but overall thorough.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Description is long but information-dense. It front-loads the core purpose but includes many details in a single paragraph. Could be more concise by splitting into clear steps or using bullet points. Adequate but not exemplary.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given 6 parameters, no output schema, and no annotations, the description covers preconditions, internal workflow, return shape, and edge cases (manual, blocked). It explains the uniform return shape without output schema. Minor gap: no explicit error behavior, but still quite complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%. The description contextualizes parameters (e.g., mediaUrls for social, platforms required for social, recipients for email) but does not add significant meaning beyond the schema's own descriptions. Baseline score of 3 applies.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it is a closed-loop executor wrapper that routes to appropriate native executors for social/email, runs sanity gates, captures results, writes standardized metadata, and schedules follow-up. It distinguishes from siblings by specifying the exact conditions (oauth_connector, api_key_connector) and handling manual_handoff/blocked actions.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says 'USE WHEN chiefmo_approve_action just succeeded on an action whose executionMatrix.mode is oauth_connector or api_key_connector AND connectorReady is true.' Also provides guidance for manual_handoff (call chieflab_record_manual_publish) and blocked actions (returns connector-wiring instruction).

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_force_measurement_dueAInspect

P93 — admin escape hatch. Forces a proof_ledger row's measurement_due_at into the past so the cron's next tick picks it up. USE WHEN smoke testing the launch → measure → next-move chain without waiting 24h, OR an ops user needs to retry a stuck measurement. Gated by CHIEFLAB_ADMIN_TOKEN header (same gate as /api-keys/issue). Refuses if the row is already measured. Pass dueAt (ISO string) to set a specific time; defaults to now - 60s.

ParametersJSON Schema

Name	Required	Description
`dueAt`	No	Optional ISO timestamp. Defaults to (now - 60s) so the next cron tick picks it up.
`proofId`	Yes	proof_ledger row id (from chieflab_record_manual_publish or chieflab_execute_approved_action response).
`adminToken`	Yes	Required. Must match CHIEFLAB_ADMIN_TOKEN env var.

Tool Definition Quality

A4.7/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, so description carries full burden. It discloses the forced update behavior, refusal condition (already measured), authentication requirement (admin token), and default timing. Could mention rollback or side effects, but adequately transparent for a mutation tool.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Three concise, well-structured sentences. First sentence states purpose and context, second gives use cases, third adds constraints. No wasted words, front-loaded with key information.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, description explains core behavior (forces due date into past, refuses if already measured) and authentication. Lacks return value details but adequate for a simple forced-update tool with clear input semantics.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema covers all parameters with descriptions, but description adds valuable context: proofId origin sources, adminToken matching env var, dueAt default behavior (now - 60s). This goes beyond schema and enhances understanding.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description explicitly states it's an admin escape hatch that forces a proof_ledger row's measurement_due_at into the past, clearly differentiating from siblings by specifying when to use and referencing related tools like chieflab_record_manual_publish or chieflab_execute_approved_action for getting proofId.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly states two use cases: smoke testing the launch-measure-next-move chain without waiting 24h, and retrying a stuck measurement. Also notes gated by admin token and refuses if already measured, providing clear guidance on when to invoke.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_get_capabilityAInspect

USE WHEN you have a specific operatorId + capability and want the full machine-readable profile (inputs, outputs, pricing, approval-required flag, endpoint). Pre-flight check before chieflab_create_work_request.

ParametersJSON Schema

Name	Required	Description	Default
`capability`	Yes
`operatorId`	Yes

Tool Definition Quality

A4.7/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Without annotations, the description carries the full burden. It discloses what the tool returns (inputs, outputs, pricing, approval-required flag, endpoint) and labels it a 'Pre-flight check', strongly indicating a read‑only operation. However, it does not explicitly state that it is non‑destructive, idempotent, or whether authentication is required, leaving minor gaps.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, no wasted words. The first sentence immediately states the usage condition and what the tool returns ('full machine-readable profile'). The second sentence adds critical context about its role as a pre-flight check. Information is front‑loaded for quick parsing.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple retrieval tool with only two string parameters and no output schema, the description covers all essential aspects: when to use, what inputs are needed, what the output contains, and how it fits into the larger workflow (pre‑flight for create_work_request). No critical omissions given the tool's low complexity.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 0% description coverage, but the description explicitly names both parameters ('operatorId + capability') and explains that they together identify which capability to fetch. It adds meaning beyond the raw schema by clarifying their joint purpose, though it does not detail expected formats or constraints (e.g., string length).

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description explicitly states that the tool retrieves the full machine-readable profile for a specific operatorId+capability pair. It names the exact components returned and positions itself as a pre-flight check before chieflab_create_work_request, clearly distinguishing it from sibling tools like search or match.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description starts with 'USE WHEN you have a specific operatorId + capability', providing a clear condition for use. It also names a specific tool (chieflab_create_work_request) as the next step, implicitly telling the agent when not to use this tool (if the goal is creation without checking the profile first) and offering an alternative path.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_get_company_profileAInspect

P88 — read the workspace's stored company profile. Returns { profile, source, confidence, updated_at }. Use before launch to confirm what ChiefLab will ground in.

ParametersJSON Schema

Name	Required	Description	Default
`workspaceId`	No	Optional workspace id.

Tool Definition Quality

A4.1/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, so description bears full burden. States it's a read operation (non-destructive) and returns specific data, but lacks details on authorization requirements or rate limits.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two efficient sentences: first defines purpose and return structure, second provides usage context. No unnecessary words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple read tool with one optional parameter and no output schema, the description sufficiently covers purpose, return format, and when to use.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema already describes parameter as 'Optional workspace id.' with 100% coverage. Description adds no additional information about parameter behavior or constraints.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states it reads the workspace's stored company profile and specifies return fields. Distinguishes from siblings like set_company_profile (write) and launch_product (uses this data).

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly recommends using before launch to confirm grounding. Does not mention when not to use or alternatives, but context is clear given siblings.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_get_publishing_targetsAInspect

P79 — read the workspace's stored publishing target defaults. Returns { channelTargets, store }. Useful before launch so the agent can surface 'You haven't connected an Instagram account yet — set it via chieflab_set_publishing_targets' instead of blocking on first auto-execute.

ParametersJSON Schema

Name	Required	Description	Default
`workspaceId`	No	Optional workspace id.

Tool Definition Quality

A4.1/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Describes read-only operation and return format, sufficient given no annotations. No side effects or special permissions noted.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Concise single sentence with example usage, but includes cryptic reference 'P79' that adds no value.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Covers purpose, return format, and usage context; no output schema needed. Adequate for a simple read tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema provides 100% coverage for optional workspaceId parameter. Description adds no extra info beyond schema, so baseline 3.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description states 'read the workspace's stored publishing target defaults' with clear verb and resource, and returns specific structure. It is distinct from sibling chieflab_set_publishing_targets.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Indicates it is 'useful before launch' and suggests alternative action when targets missing, providing context for use vs. other tools.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_get_users_after_buildAInspect

Outcome-named alias for chiefmo_launch_product. USE WHEN a coding agent just built, shipped, or deployed something and the user asks 'get users', 'find customers', 'make money from this', 'get paying users', 'get eyes on this', 'what now?', or 'how do I launch this?' This is the agent-dependency tool: after build, call this to turn the product into a repo-grounded money/distribution loop with launchPack, signed reviewUrl, approval-gated publish/send actions, 24h measurement, memory, next move, moneyLoop, and agentDependency. Gather repoContext first. Route exactly like chiefmo_launch_product; do not use chiefmo_diagnose_marketing for a new product.

ParametersJSON Schema

Name	Required	Description
`goal`	No	Desired outcome, e.g. 'Get first 100 users', 'find paying customers', 'get design partners', 'make money from this'.
`brand`	No	Optional inline brand context.
`channels`	No	Optional launch channels. Default is auto-selected from the detected product playbook — service agencies skip Product Hunt; B2B SaaS / devtools / consumer / AI-agent products include it; internal tools default to email + landing only. Pass an explicit subset to override.
`tenantId`	No	Optional tenant/end-user id for stored brand context and memory.
`outputMode`	No	context \| draft \| full \| demo. 'demo' is the first-use / YC-demo concise mode — forces 4 punch-list channels (linkedin / x / product_hunt / email), draft rendering, no image gen, scrubbed inline drafts. Defaults to context/draft behavior from chiefmo_launch_product.
`productUrl`	No	Live product URL. Pass either productUrl or productDescription.
`repoContext`	No	Strongly recommended. Include whatChanged, recentCommits, changedFiles, routes, readme, targetCustomer, launchGoal.
`responseShape`	No	summary (default, smallest — promotes launchId / launchRoomUrl / publishActionIds / firstResponseScript; per-channel draft bodies live behind the reviewUrl) \| compact (per-channel previews + truncated draft bodies inline) \| full (everything inline; debug/export only — large).
`idempotencyKey`	No	Optional dedupe key to prevent duplicate launch preparation.
`productDescription`	No	Short product description if no public URL exists yet.
`includeFullArtifacts`	No	Alias for responseShape:"full". Backward-compat with older P29 clients.

Tool Definition Quality

A4.4/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Although no annotations are provided, the description details the tool's behavior: it sets up a launchPack with approval-gated actions, measurement, memory, etc. It also mentions prerequisites ('Gather repoContext first'). The description effectively communicates the tool's role as an 'agent-dependency tool' without contradicting any annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is front-loaded with the core purpose and structured logically: alias declaration, use cases, what the tool does, prerequisites, and routing. While somewhat verbose, every sentence adds value and is necessary for agent understanding. The length is justified by the tool's complexity.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the absence of output schema and annotations, the description provides a comprehensive overview including the tool's role, output artifacts (launchPack, reviewUrl), lifecycle details (24h measurement, next move), and explicit usage context. It leaves no ambiguity about when and how to invoke the tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 100% description coverage, so the baseline is 3. The description adds minimal additional parameter guidance beyond the schema, except for clarifying the productUrl/productDescription alternative and emphasizing repoContext. The schema itself is already fairly descriptive.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it is an 'Outcome-named alias for chiefmo_launch_product' and specifies the exact scenarios (after build, when user asks for users/customers/money). It distinguishes itself from siblings by explicitly noting it should not be confused with chiefmo_diagnose_marketing.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides explicit when-to-use criteria: 'USE WHEN a coding agent just built, shipped, or deployed something and the user asks...' along with specific user queries. It also gives routing instructions and a clear alternative ('do not use chiefmo_diagnose_marketing for a new product').

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_helpAInspect

FIRST TOOL — call before anything else when you don't know what ChiefLab does or where to start. Returns the operator menu + workspace state (connectors wired, last run, signup status) + the single next action for this workspace. Cheap, no side effects. After this, call chieflab_launch_product for a full launch or chieflab_post for a single channel.

ParametersJSON Schema

Name	Required	Description	Default
No parameters

Tool Definition Quality

A4.7/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

States it has 'no side effects' and is 'cheap', which is good behavioral disclosure for a tool with no annotations. It describes what it returns, but could mention any prerequisites like workspace context being set.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Three concise sentences, front-loaded with the most important instruction ('FIRST TOOL'), and no wasted words. Every sentence adds value.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Despite lacking an output schema, the description explains what the tool returns (operator menu, workspace state, next action), which is sufficient for an initial help tool. No critical gaps.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

No parameters exist, so the description naturally adds no param info. With 100% schema coverage for no params, a baseline of 4 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it is the first tool to call when you don't know what ChiefLab does or where to start. It specifies the returns (operator menu, workspace state, next action) and distinguishes itself from sibling tools like chieflab_launch_product and chieflab_post.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says 'FIRST TOOL — call before anything else', providing clear when-to-use guidance. It also gives alternative next steps, making the usage context very clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_inboxAInspect

USE WHEN the user asks 'what came in from my launches?' / 'show me my replies' / 'what's in my inbox?' / 'who responded to the LinkedIn post?'. Returns engagement events for THIS workspace, ordered by recency. Filter by status (new / drafted / approved / sent / dismissed) — default new. Pair with chieflab_draft_reply to handle them.

ParametersJSON Schema

Name	Required	Description
`limit`	No	Max events to return. Default 50.
`status`	No	new \| drafted \| approved \| sent \| dismissed \| all (default: new)
`sinceHours`	No	Only events received in the last N hours. Default 168 (7 days). Max 720 (30 days).
`workspaceId`	No

Tool Definition Quality

A3.6/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden of behavioral disclosure. It describes that it returns engagement events and lists filter options, but does not mention whether the operation is read-only, potential side effects, rate limits, or data size constraints. The description is insufficient for an agent to fully understand the tool's behavior.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is concise: two sentences that front-load usage scenarios and then specify behavior and filters. Every sentence adds value without redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's simplicity (read operation with filters) and lack of output schema or annotations, the description covers basic purpose and usage. However, it omits details about return format, pagination, or max event limits (though limit param exists). It is adequate but not fully complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 75% description coverage (workspaceId missing description). The description does not add meaning beyond what the schema already provides for limit, status, and sinceHours. It repeats the default 'new' filter but offers no additional context, like how status values behave or what happens if limit is exceeded.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool returns engagement events for the workspace ordered by recency, with example queries like 'what came in from my launches?' and 'what's in my inbox?'. It also names a sibling to pair with (chieflab_draft_reply), distinguishing it from other inbox tools.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly says 'USE WHEN' and provides several example user intents, giving clear context for when to invoke this tool. It also suggests pairing with chieflab_draft_reply, but does not specify when not to use it or provide exclusion criteria.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_launch_productCInspect

Operator: chieflab-launch (primary). Spec v0.1 reference implementation of the orchestrator. End-to-end launch loop: repo-grounded positioning + per-channel drafts (linkedin, x, hn, reddit, email, landing_hero, product_hunt) + image briefs + approval-gated publishActions + signed reviewUrl + 24h measurement queue. Internally composes chieflab-post + chieflab-email under one runId. Spec-renamed alias of chiefmo_launch_product (backwards-compat retained).

ParametersJSON Schema

Name	Required	Description	Default
`goal`	No
`brand`	No
`channels`	No
`tenantId`	No
`productUrl`	No
`webhookUrl`	No
`repoContext`	No
`workspaceId`	No
`imagesNeeded`	No
`idempotencyKey`	No

Tool Definition Quality

C2.8/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries full burden. It discloses internal composition (post, email under one runId), approval-gating, review URL generation, and 24h measurement queue. However, it does not mention authentication needs, rate limits, or destructive nature beyond implied mutations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is single-paragraph but front-loads the operator context. However, it is dense and uses spec-revision language ('Spec v0.1', 'spec-renamed alias') that may be verbose without aiding utility. Every sentence contributes but structure could be cleaner.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (10 parameters, nested objects, no output schema, no annotations), the description explains the orchestration flow but lacks specifics on input requirements, error handling, prerequisites, or return value format. Leaves gaps for an AI agent to select and invoke correctly.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0% despite 10 parameters (some nested). The description hints at channels (linkedin, x, etc.) and image briefs, but does not explicitly map to schema properties like goal, brand, productUrl, or idempotencyKey. High complexity with minimal compensation.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly identifies the tool as an end-to-end launch orchestrator with a specific verb and resource ('launch product'). It mentions composed sub-tools (chieflab-post, chieflab-email) and distinguishes from them, but the dense spec-wording may reduce immediate clarity.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description states it is the 'primary' operator and a spec reference, but does not explicitly say when to use this tool vs. alternatives like chiefmo_launch_product or other launch-related siblings. It notes backwards compatibility with chiefmo_launch_product, leaving confusion about which to invoke.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_list_specialistsAInspect

P87 — list the specialist agents ChiefLab can delegate to (design / video / research / outreach / seo / analytics). USE WHEN the user asks 'what can ChiefLab do beyond launch posts?' or before calling chieflab_request_specialist. Returns the kind + label for each so the caller can pick the right one.

ParametersJSON Schema

Name	Required	Description	Default
No parameters

Tool Definition Quality

A4.7/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description discloses that the tool returns kind and label for each specialist, implying a non-modifying query. While no annotations are present, the description sufficiently indicates it is a read-only operation. It could be slightly more explicit about being non-destructive, but the verb 'list' strongly implies that.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single concise sentence that front-loads the key information (P87 — list) and includes usage conditions. Every word serves a purpose with no redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no parameters or output schema, the description is fully complete: it explains what the tool does, when to use it, and what it returns. It leaves no gaps for a tool of this simplicity.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

There are no parameters, so the schema provides full coverage. The description does not need to add parameter details; it correctly focuses on the purpose and output.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool lists specialist agents ChiefLab can delegate to, specifying categories (design/video/research/outreach/seo/analytics). It distinguishes its purpose from the sibling tool chieflab_request_specialist by stating it should be called before that tool.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly tells when to use: when the user asks 'what can ChiefLab do beyond launch posts?' or before calling chieflab_request_specialist. It provides clear context and an alternative action, making it easy for the agent to decide.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_mark_action_doneAInspect

P135 — mark a next-move action as completed when the founder has done the move BUT there's no URL to record (e.g. reply_to_commenters, dm_engagers, mark replies done). Flips the action to executed with metadata.markedManuallyAt + metadata.executedManually=true. The next-move card disappears from /command/'s Today's Loop. For moves that DO have a URL (cross-post, new post), use chieflab_record_manual_publish instead — it both records the URL and marks executed.

ParametersJSON Schema

Name	Required	Description
`note`	No	Optional founder note about what was done. Stored in metadata.markedManuallyNote for the brain to read later.
`actionId`	Yes	The next-move action id.
`workspaceId`	No	Optional workspace id.

Tool Definition Quality

A4.6/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It discloses that the action is flipped to executed, metadata fields (markedManuallyAt, executedManually=true) are set, and the card disappears from Today's Loop. However, it does not discuss error conditions, permissions, or reversibility, leaving some gaps.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is front-loaded with the core purpose and provides necessary details. The 'P135' reference adds minor noise but does not detract significantly. It is structured logically: purpose, examples, effect, and alternative. Could be slightly more concise but overall efficient.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the simple action (flip state with optional note), schema coverage is complete, and no output schema exists. The description covers when to use, what it does, parameter purpose, and UI effect. It is fully sufficient for correct invocation without additional context.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% (baseline 3). The description adds value beyond the schema by explaining the note parameter's storage location and purpose (metadata.markedManuallyNote for the brain), and the actionId and workspaceId are clear from their schema descriptions. This extra context justifies a 4.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool marks a next-move action as completed when there is no URL to record, providing specific examples (reply_to_commenters, dm_engagers) and explicitly differentiating from chieflab_record_manual_publish. The verb 'mark' and resource 'next-move action' are specific.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly states when to use this tool (no URL to record) and when not to use it (when there is a URL), directing the agent to chieflab_record_manual_publish as an alternative. This provides clear guidance on tool selection.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_measure_hacker_newsAInspect

P109 — pull a Hacker News post's score + comments + top replies via the public HN API and write them to the action's metadata.proof. USE WHEN a published Hacker News action is ripe for 24h measurement (checks the publishedUrl on the action, parses the item id, calls the Firebase API). Auto-creates a next-move action based on the outcome (trending → reply + cross-post; engaged → reply; stalled → rewrite angle). Idempotent — second call returns the existing measurement. No API key required; HN is public-read.

ParametersJSON Schema

Name	Required	Description
`force`	No	Re-measure even if metadata.proof.measuredAt is already set. Default false.
`actionId`	Yes	The action id whose publishedUrl points at the HN post. Required.
`workspaceId`	No	Optional workspace id.

Tool Definition Quality

A4.1/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description discloses idempotency ('second call returns existing measurement'), no API key requirement, auto-creation of next-move actions, and uses the Firebase API. It does not cover error handling or rate limits, but the provided behavioral traits are substantial for safe agent use.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is concise and front-loaded with the core purpose. Each phrase adds value (idempotency, auth, outcome actions). The single-sentence structure is effective but could be slightly improved with bullet points for clarity.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema and 3 params, the description covers the tool's purpose, usage condition, idempotency, auth, and automated follow-up actions. It lacks explicit error scenarios but provides a complete mental model for an AI agent.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so baseline is 3. The description adds minimal extra meaning beyond the schema: it reinforces 'force' behavior and repeats actionId context, but workspaceId is not elaborated. No new parameter semantics are provided.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool pulls a Hacker News post's score, comments, and top replies via the public HN API and writes them to metadata.proof. It specifies the action type (published Hacker News action) and distinguishes it from sibling tools like chieflab_check_measurement_due which only check due status.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly says 'USE WHEN a published Hacker News action is ripe for 24h measurement', providing clear context. It outlines the decision logic based on outcome (trending/engaged/stalled) but does not explicitly exclude alternative tools, though siblings are largely unrelated.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_measure_redditAInspect

P115 — pull a Reddit post's score + comments + upvote ratio + top replies via the public Reddit JSON API and write them to the action's metadata.proof. USE WHEN a published Reddit action is ripe for 24h measurement. Parses subreddit + post id from the publishedUrl, calls https://www.reddit.com/r//comments/.json (no auth), auto-creates next-move actions based on the outcome (trending → reply + cross-post; engaged → reply; downvoted → rewrite angle). Idempotent — second call returns existing measurement unless force:true.

ParametersJSON Schema

Name	Required	Description
`force`	No	Re-measure even if metadata.proof.measuredAt is already set. Default false.
`actionId`	Yes	The action id whose publishedUrl points at the Reddit post. Required.
`workspaceId`	No	Optional workspace id.

Tool Definition Quality

A4.3/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description fully discloses behaviors: uses public API without auth, parses subreddit and post ID from publishedUrl, auto-creates next-move actions based on outcome (trending, engaged, downvoted), and idempotent unless force:true. This is comprehensive behavioral transparency.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single dense paragraph. It front-loads the core purpose and is reasonably concise, though the auto-creation logic adds length. It could be slightly more structured, but overall efficient.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, the description explains what data is written to metadata.proof and the auto-creation of next-move actions. It covers the input, process, and outcome adequately for a specialized tool, though exact proof structure is omitted.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so the description adds little beyond param descriptions. It mentions 'force' in context and explains how actionId is used, but this is already captured in the schema. Baseline 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: pulling a Reddit post's score, comments, upvote ratio, and top replies via the public Reddit JSON API and writing them to metadata.proof. It uses a specific verb+resource and distinguishes from siblings like chieflab_measure_hacker_news.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides a specific usage condition: 'USE WHEN a published Reddit action is ripe for 24h measurement.' It also mentions idempotency and the force parameter for re-measurement, but does not explicitly contrast with alternative tools or state when not to use it.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_persistence_healthAInspect

P91 — verify the persistent stores. Probes proof_ledger / publishing_targets / company_profiles tables (Supabase) OR confirms the in-memory fallback. USE WHEN the user just ran the SQL migrations and wants to confirm the API is now using durable storage; OR before a real founder launch to make sure data won't evaporate on cold start; OR when debugging why a previously-stored target / profile disappeared. Returns per-store: { kind: 'supabase'|'in_memory', durable: bool, tableExists?: bool, sampleQueryOk: bool, rowCount?: number, error?, warning? }. The 'tableExists: false' case means the SQL migration has NOT been run.

ParametersJSON Schema

Name	Required	Description	Default
`workspaceId`	No

Tool Definition Quality

A3.9/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It details what the tool probes (tables vs. in-memory), checks for durability, table existence, etc., and explains the 'tableExists: false' case. However, it does not mention if there are any side effects or read-only guarantees.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is relatively long but well-structured with a badge, purpose, use cases, and return format. Each sentence adds value, though some minor trimming could improve conciseness.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description provides a detailed return format without an output schema, which is helpful. However, it lacks explanation of the workspaceId parameter, and it does not cover scenarios like missing data or permission issues. It is adequate but not fully complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has one parameter (workspaceId) with no description and 0% schema coverage. The description does not mention this parameter at all, leaving its purpose and effect unexplained. Since coverage is low, the description should compensate but fails to do so.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: 'verify the persistent stores' and lists specific tables and fallback. It provides explicit use cases (after migrations, before launch, debugging) that distinguish it from sibling tools like chieflab_query_proof_ledger or chieflab_status.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly states when to use the tool: 'USE WHEN the user just ran the SQL migrations... OR before a real founder launch... OR when debugging...' This provides clear context, though it does not explicitly mention when not to use or alternatives.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_post_launch_reviewAInspect

[chieflab_* alias of chiefmo_post_launch_review] USE WHEN ≥24h has passed since chiefmo_publish_approved_post fired and the user asks 'how did the launch perform?', 'what worked?', 'metrics from my launch'. Closes the loop: pulls 24h+ engagement (likes / comments / views / shares / follower delta) for posts in the launch run + recommends next iteration (rewrite, refresh creative, double down, pause). Returns per-account analytics + top posts + best-time-to-post + a structured next-move recommendation brief.

ParametersJSON Schema

Name	Required	Description
`runId`	Yes	The launchId returned by chiefmo_launch_product. Used to scope analytics to posts from that launch.
`accountIds`	No	Optional. Specific Zernio accountIds to pull analytics for. Defaults to all accounts that posted in this launch.
`outputMode`	No	"context" (default): returns metrics + a brief your LLM renders into the customer-facing recommendation. "full": ChiefLab generates the recommendation server-side.
`lookbackDays`	No	How many days of history to consider (default 7, max 90).

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It discloses that the tool pulls engagement data (likes, comments, views, shares, follower delta) and returns per-account analytics, top posts, best-time-to-post, and a structured next-move recommendation. It does not mention destructive actions or auth requirements, but its read-only nature is implied. The description adds valuable context beyond annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is relatively long but every sentence adds value. It front-loads the alias and usage condition, then lists outputs. Could be slightly more concise, but overall efficient.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

No output schema is provided, so the description compensates by listing what is returned (per-account analytics, top posts, recommendation brief). It explains the 24h condition but does not cover error cases or prerequisites beyond that. Still, it provides sufficient context for an agent to use the tool correctly.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with descriptions for all four parameters. The description adds context: runId is 'the launchId returned by chiefmo_launch_product', accountIds defaults to all accounts, outputMode defaults to 'context', and lookbackDays defaults to 7 (max 90). This goes beyond the schema's basic descriptions.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool is for post-launch review after 24 hours, listing specific user queries it addresses ('how did the launch perform?', 'what worked?', 'metrics from my launch'). It includes an alias note and distinguishes itself by describing its specific function (pulling engagement, recommending next iteration) among many sibling tools.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly provides a condition for use: 'USE WHEN ≥24h has passed since chiefmo_publish_approved_post fired and the user asks...', which clearly tells the agent when to invoke this tool versus others. It also implies not to use before that time.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_prepare_next_moveAInspect

[chieflab_* alias of chiefmo_prepare_next_move] USE WHEN the user has just published a launch (or ≥24h post-launch) and asks 'what's next?', 'follow-up post?', 'iterate on this', 'plan day 2'. P13 — turns one launch into a 7-day launch mode: drafts a queue of follow-up actions (day 2 reply post, day 3 case study, day 5 metrics share, day 7 retrospective) grounded in the prior launch's runId, the per-tenant brand voice + repo facts + channel performance from the P9 company brain, and the prior recommendation (so the next move doesn't repeat the last one). Returns a draft queue + signed reviewUrl + the standard agentGuide block. Each next-move action lands in awaiting_approval; nothing fires without explicit human approval.

ParametersJSON Schema

Name	Required	Description
`runId`	Yes	The prior launchId returned by chiefmo_launch_product. The prepared queue grounds in this run's measurement + memory.
`horizon`	No	"day_2" \| "day_3" \| "week" — how far to plan. Default: "week" (7-day mode).
`channels`	No	Optional. Restrict next-moves to these channels. Defaults to the channels of the prior launch.
`workspaceId`	No	Optional workspace id.

Tool Definition Quality

A4/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries full burden. It discloses that actions land in 'awaiting_approval' and require human approval, and mentions use of company brain and brand voice. But it lacks disclosure of side effects, permissions needed, or rate limits.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is informative but somewhat long. It front-loads key information (usage condition) but includes detailed explanations that could be condensed. Still, it is structured and readable.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, the description explains the return value (draft queue, reviewUrl, agentGuide block). It covers input, behavior, and output for a 4-parameter tool without nested objects, making it reasonably complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, baseline 3. The description adds meaning beyond schema: explains that runId is the 'prior launchId from chiefmo_launch_product', horizon lists allowed values with default, and channels is optional. This enriches understanding.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: 'turns one launch into a 7-day launch mode: drafts a queue of follow-up actions'. It specifies the verb ('prepare next move') and resource ('queue of follow-up actions'), and distinguishes from siblings by noting it is an alias of chiefmo_prepare_next_move.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly says 'USE WHEN the user has just published a launch (or ≥24h post-launch) and asks...' providing clear context. It also mentions grounding in prior launch's runId and company brain. However, it does not explicitly state when NOT to use it or mention alternative tools.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_publish_approved_postAInspect

[chieflab_* alias of chiefmo_publish_approved_post] Publish an approved social post (LinkedIn / X / Threads / Instagram / Facebook / Bluesky / TikTok) through the social publishing rail (current adapter: zernio). USE WHEN the user has approved a publishAction from chiefmo_launch_product on the reviewUrl and you need to fire the post. Strict approval gate: returns { reason: 'requires_approval', reviewUrl } if the action isn't approved yet. Once executed, returns the live platform post URL. Money/external-system action — never call this without an approved actionId.

ParametersJSON Schema

Name	Required	Description
`content`	Yes	The final rendered post text (your LLM rendered this from the brief in the launch pack).
`actionId`	Yes	The publishAction.id from chiefmo_launch_product. Required.
`mediaUrls`	No	Optional. Image/video URLs (or base64 data URLs from launch graphics).
`platforms`	Yes	Which platforms + Zernio accountIds to publish to. Get accountIds from chieflab_list_publish_accounts.
`scheduleAt`	No	Optional ISO 8601. Schedule for later instead of immediate.

Tool Definition Quality

A4.6/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Describes the approval gate behavior (returns { reason: 'requires_approval', reviewUrl }) and that it outputs live post URL. Calls it a "Money/external-system action". Lacks details on rate limits but covers key behaviors well for a publish tool.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Description is concise (3-4 sentences) and front-loaded with the alias and main purpose. Every sentence adds value without fluff. Slightly longer than necessary but still efficient.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the complexity (5 params, no output schema, many siblings), the description covers the approval flow, usage context, links to other tools, and warnings. Provides sufficient detail for safe invocation.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, but description adds context: content is "the final rendered post text", actionId is from chiefmo_launch_product, platforms accounts from chieflab_list_publish_accounts, scheduleAt is ISO 8601. Adds meaningful guidance beyond schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states it publishes approved social posts via Zernio, listing platforms. Distinguishes from siblings by requiring an approved actionId from chiefmo_launch_product.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says "USE WHEN the user has approved a publishAction from chiefmo_launch_product" and "never call this without an approved actionId", providing clear when-to-use and when-not-to-use guidance.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_query_proof_ledgerAInspect

P80 — query the persistent proof ledger. USE WHEN the user (or a dashboard) asks 'what has ChiefLab actually shipped for this workspace?' or 'show me the launch history.' Returns the proof rows for executed publishes / sends / manual-posts with the artifact URLs, channels, execution modes, and measurement state. Persistent across cold starts when deps.proofLedgerStore is wired to Supabase; falls back to in-memory (warm function lifetime) otherwise.

ParametersJSON Schema

Name	Required	Description
`limit`	No	Page size (1-500, default 50).
`runId`	No	Optional. Filter to one run's proofs.
`workspaceId`	No	Optional workspace id (defaults to auth workspace).
`sinceTimestamp`	No	Optional ISO timestamp; only return proofs published at or after.
`measurementDueOnly`	No	When true, returns only rows with measurement_due_at <= now AND measured_at IS NULL. Use this to find what's ready for 24h readback.

Tool Definition Quality

A4.5/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description fully discloses the tool's behavior: it is a read-only query, returns specific fields, and explains persistence behavior (persistent if Supabase wired, otherwise in-memory). This provides complete transparency for an agent.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is concise and front-loads the purpose and usage. It is a single paragraph that effectively communicates key points without unnecessary fluff, though it could be slightly more structured (e.g., bullet format).

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description covers the return format (proof rows with artifact URLs, channels, etc.) and persistence behavior, which is sufficient for a query tool. However, it lacks details on ordering or default pagination, which would be useful given no output schema.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Although schema coverage is 100%, the description adds value by explaining the purpose of the 'measurementDueOnly' filter (rows ready for 24h readback) and contextualizing the parameters. This goes slightly beyond the schema descriptions.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool queries the persistent proof ledger and gives concrete use cases ('what has ChiefLab actually shipped for this workspace?', 'show me the launch history'). It also lists the returned data (proof rows, artifact URLs, etc.), making the purpose unambiguous and distinct from other tools.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly says 'USE WHEN' followed by specific user queries, providing clear usage context. However, it does not mention when not to use this tool or suggest alternatives, which would further enhance guidance.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_record_manual_metricsAInspect

P116 — paste-what-you-see metric fallback for channels without public-API measurement (X, LinkedIn, Email, landing pages, Product Hunt). USE WHEN a published action's channel doesn't have a public-JSON metrics endpoint (anything except hacker_news / reddit) AND the founder is ready to paste the visible numbers. Reads metadata.proof.artifactUrl to confirm the post is recorded, writes the provided metrics into metadata.proof.metrics + measuredAt, auto-classifies the outcome (channel-specific thresholds: X needs likes/replies, LinkedIn needs reactions/comments, Email needs opens/clicks/replies, etc.), and creates next-move actions just like the auto-measurement path. Returns the metric schema for the channel so callers can render the right form fields.

ParametersJSON Schema

Name	Required	Description
`force`	No	Re-record even if metadata.proof.measuredAt is already set. Default false.
`metrics`	No	Free-form metrics map. Channel-specific keys: x → {impressions, likes, replies, retweets, clicks}; linkedin → {impressions, reactions, comments, reposts, clicks}; email → {sent, opens, clicks, replies, unsubscribes}; landing_hero → {sessions, signups, avg_time_seconds, bounce_rate_pct}; product_hunt → {upvotes, comments, rank}. Call without metrics to just retrieve the schema for the action's channel.
`actionId`	Yes	The action id to attach metrics to.
`workspaceId`	No	Optional workspace id.

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Discloses internal actions: reads metadata.proof.artifactUrl, writes metrics into metadata.proof.metrics + measuredAt, auto-classifies outcome, creates next-move actions, and returns metric schema. With no annotations, this provides sufficient transparency.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Information-dense but not overly long; each sentence adds value. Front-loaded with a clear identifier and purpose. Could be broken into shorter sentences for readability.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given complexity (manual entry, auto-classification, next-move creation) and no output schema, the description fully covers purpose, usage, parameter details, and what the tool returns.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, and the description adds significant value by listing channel-specific keys for the 'metrics' object and explaining behavior when called without metrics (retrieve schema).

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly defines the tool as a manual metric fallback for channels without public API measurement, naming specific channels (X, LinkedIn, Email, landing pages, Product Hunt) and explicitly distinguishing from siblings by stating 'anything except hacker_news / reddit'.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

It provides explicit usage conditions: 'USE WHEN a published action's channel doesn't have a public-JSON metrics endpoint... AND the founder is ready to paste the visible numbers.' It implies alternatives for hacker_news/reddit but does not name sibling tools directly.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_record_manual_publishAInspect

USE WHEN the user has manually posted to a channel returned by chieflab_use_manual_fallback (Product Hunt / HN / Reddit / Discord / etc.) and wants to feed the live URL back to ChiefLab so the closed loop continues. Records the URL on the original publishAction (status flips from 'approved' to 'executed' with metadata.executedManually=true + metadata.publishedUrl), persists a proof_asset to the P9 company brain, and queues 24-hour metrics readback via chiefmo_post_launch_review. Without this tool, manually-posted channels are lost to ChiefLab's measurement loop.

ParametersJSON Schema

Name	Required	Description
`notes`	No	Optional. Any context on how the post performed in the first hour (early upvotes, top comment, reactions). Persists into the P9 brain.
`actionId`	Yes	The publishAction.id that the user manually published (from chieflab_use_manual_fallback). Required.
`publishedAt`	No	Optional ISO 8601 timestamp. Defaults to now.
`workspaceId`	No	Optional workspace id.
`publishedUrl`	Yes	The live URL of the manual post (e.g. producthunt.com/posts/<slug>, news.ycombinator.com/item?id=<id>, reddit.com/r/SaaS/comments/<id>).

Tool Definition Quality

A4.7/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Despite no annotations, the description fully discloses behavioral traits: it flips the publishAction status, persists a proof_asset, and queues metrics readback. This gives the agent a clear understanding of side effects and consequences of not using the tool.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, well-structured paragraph. It front-loads the usage condition and explains functionality and consequences concisely. No unnecessary text, but could be broken into multiple sentences for clarity.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the 5-parameter schema and no output schema, the description provides sufficient context: usage scenario, side effects, and importance. It does not describe return values, but the tool's purpose implies it completes an action, and the side effects are well explained.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Input schema has 100% coverage with good descriptions. The tool description adds overall context but does not significantly enhance individual parameter meaning beyond the schema. However, it references the fallback tool for actionId and publishedUrl, slightly elevating utility.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: to record the live URL of a manually posted channel back to ChiefLab. It specifies the verb 'record' and the resource 'manual publish', and distinguishes from siblings by tying to chieflab_use_manual_fallback and explaining the closed loop continuation.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description starts with 'USE WHEN', explicitly stating the condition (user manually posted using chieflab_use_manual_fallback) and the motivation (feed back URL for closed loop). It also warns that without this tool, manually-posted channels are lost, providing clear guidance on when to use.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_redraftAInspect

USE WHEN the user says 'make this more casual' / 'shorter' / 'less salesy' / 'add a specific number' / 'redraft this' about a published draft. Pulls the existing brief for the actionId, merges the instruction, returns an updated brief your LLM should render. After rendering call chieflab_record_rendered_copy({ assetId, body }) to commit. Multi-turn flow stays coherent: each redraft increments action.metadata.revision; original brief preserved in metadata.history.

ParametersJSON Schema

Name	Required	Description
`body`	No	Optional. If your LLM already rendered the redrafted body, pass it here and ChiefLab updates the action's brief directly.
`actionId`	Yes	The publishAction.id from chiefmo_launch_product.
`instruction`	Yes	Plain-English instruction. 'shorter', 'more casual', 'add the $40 number from our pricing page', etc.
`workspaceId`	No

Tool Definition Quality

A4.6/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description fully discloses behavioral traits: it pulls the existing brief, merges the instruction, returns an updated brief for LLM rendering, requires a follow-up commit call, and details multi-turn coherence (increments revision, preserves original in history). No contradictions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is concise (4 sentences), front-loaded with the usage trigger, and avoids superfluous details. Every sentence serves a purpose: trigger, action, post-requisite, and behavioral note.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description covers when to use, the core workflow, post-requisite action, and behavioral details (revision tracking, history preservation). It does not explain error cases or return format, but given the complexity and lack of output schema, it is fairly complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema covers 3 of 4 parameters with descriptions (75% coverage). The description adds valuable context for 'instruction' (plain-English, examples) and 'body' (optional, pre-rendered). However, 'workspaceId' lacks description in both schema and description, slightly reducing clarity.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states what the tool does: redraft a published draft when the user says phrases like 'make this more casual' or 'shorter'. It specifies the action of pulling the brief, merging the instruction, and returning an updated brief, which differentiates it from sibling drafting tools.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides explicit usage triggers ('USE WHEN the user says ... about a published draft') and a critical follow-up step (call chieflab_record_rendered_copy). However, it does not explicitly mention when not to use this tool or list alternatives, leaving some ambiguity.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_regenerate_visual_assetAInspect

P72 — regenerate a single visual asset (LinkedIn graphic, X graphic, landing hero, etc.) with a different style / fidelity / headline / model. USE WHEN a reviewer hits 'Regenerate' on a launch image in the Launch Room. Reads the original launch's brief from the run metadata, calls produceVisualAsset with the new parameters, and returns the updated asset shape (assetId, dataUrl, dimensions, prompt, mode). Does NOT auto-persist back to the run — caller decides whether to swap the asset in place (via runStore) or treat the regen as a parallel candidate.

ParametersJSON Schema

Name	Required	Description
`cta`	No	Override the CTA text.
`seed`	No	Optional seed string for deterministic layout-hint picking.
`model`	No	gemini-2.5-flash-image \| gemini-3-pro-image-preview. Caller override; defaults to the model the brief's fidelity selects.
`runId`	Yes	Launch run id that produced the original asset. Used to look up the LaunchIntelligenceBrief stored on the run.
`style`	No	clean \| bold \| editorial \| playful. Defaults to 'editorial'.
`assetId`	No	Optional original assetId — when present, the regenerated asset reuses the id so the run viewer can swap in place.
`channel`	Yes	Channel the asset is for (linkedin / x / email / landing_hero / etc.).
`fidelity`	No	on_brand \| polished \| experimental. Defaults to 'on_brand'.
`headline`	No	Override the headline rendered into the image.
`brandColors`	No	Optional {primary, accent, bg, text} hex override.
`subheadline`	No	Override the subheadline.
`workspaceId`	No	Optional workspace id.

Tool Definition Quality

A4.6/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations present, but description fully discloses key behaviors: reads original brief, does NOT auto-persist, and lists return fields. This is excellent transparency.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Single paragraph with clear structure: purpose, usage, internal logic, return info, persistence caveat. Slightly verbose but all sentences add value.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given 12 parameters, no output schema, and nested objects, the description covers everything: usage context, internal steps, return shape, and persistence decision. Complete for an AI agent.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

All 12 parameters have schema descriptions (100% coverage). Description adds value for key params like runId (looks up brief) and assetId (swap in place). Barely misses 5 due to some params being straightforward.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description explicitly states it regenerates a visual asset with different style/fidelity/headline/model, and distinguishes from siblings which are about actions, approvals, or publishing.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Provides specific usage condition: when a reviewer hits 'Regenerate' in Launch Room. Also clarifies non-persistence. Lacks explicit when-not or alternatives, but context is clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_request_specialistAInspect

P87 — delegate to a specialist agent. USE WHEN the launch needs more than copy: a graphic brief (kind='design'), a demo video shot list (kind='video'), a prospective-customer research plan (kind='research'), personalized outreach drafts (kind='outreach'), landing-page SEO recommendations (kind='seo'), or a verdict + recommended next move from measurement data (kind='analytics'). Specialists are deterministic stubs today — they return structured briefs / plans / drafts the founder can act on immediately. Future LLM-backed implementations swap in transparently. Each request creates an approval-gated action so the brief lands in the Launch Room. Pass runId for grounding in the originating launch's intelligence brief.

ParametersJSON Schema

Name	Required	Description
`kind`	Yes	design \| video \| research \| outreach \| seo \| analytics
`brief`	No	kind-specific brief fields (asset/channel for design; durationSeconds/platform for video; goal for research; contacts/channel for outreach; url/targetTerms for seo; metrics/channel for analytics)
`runId`	No	Optional. Source run id; the specialist grounds in that launch's brief (productName / audience / market / positioning).
`workspaceId`	No	Optional workspace id.
`autoCreateAction`	No	Default true. When true, creates an approval-gated action in actionStore so the brief lands in the Launch Room. When false, returns the specialist output without persistence (use for one-off / preview).

Tool Definition Quality

A4.3/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description bears full burden. It discloses that specialists are deterministic stubs, return structured briefs/plans/drafts, create an approval-gated action, and can be bypassed with autoCreateAction=false. It does not cover authentication or rate limits, but the behavioral traits are well communicated.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is well-structured: purpose first, then usage, then behavior. Each sentence adds value. The 'P87' prefix is minor clutter but does not harm clarity. Could be slightly shorter but effective.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description covers purpose, usage, behavioral traits, and parameter significance. It explains the return format (structured briefs/plans/drafts) and integration with Launch Room. Given no output schema, the description provides sufficient context for an agent to use the tool correctly.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so baseline is 3. The description adds value by explaining the purpose of runId ('grounding in launch brief') and the effect of autoCreateAction. It also contextualizes kind with examples, though schema already lists allowed values. Overall, description enhances parameter understanding.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description explicitly states the tool delegates to a specialist agent for specific kinds of tasks beyond copy. It lists six kinds with concrete examples, clearly distinguishing from siblings like chieflab_create_work_request. The purpose is specific and unambiguous.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description specifies when to use: 'USE WHEN the launch needs more than copy' and enumerates the six specialist kinds. It provides context for parameters like runId and autoCreateAction but does not explicitly state when not to use or name alternatives. The guidance is clear and actionable.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_retry_blocked_actionAInspect

P13 — re-check a blocked publishAction's connector readiness. Returns whether the channel is now ready (so the agent can re-fire chiefmo_publish_approved_post / chiefmo_send_approved_email) or still blocked with the up-to-date connect path. USE WHEN the user has just connected a missing provider and the agent needs to know if the action can fire.

ParametersJSON Schema

Name	Required	Description	Default
`actionId`	Yes	The blocked publishAction.id from chiefmo_launch_product.
`workspaceId`	No

Tool Definition Quality

A3.9/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, so description carries full burden. It states the return value (ready or still blocked), but does not disclose side effects (e.g., whether any state is modified) or error conditions. Adequate but not comprehensive.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two concise sentences with no filler. Front-loaded with action and purpose, followed by usage guidance. Every sentence adds value.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, the description provides return value context. Covers the main use case, but lacks details on error handling or exact response format. Still sufficient for the tool's simplicity.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema describes actionId but not workspaceId (50% coverage). The tool description adds no additional meaning for parameters; workspaceId remains unexplained. The description does not compensate for the schema gap.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states the verb 're-check' and the resource 'blocked publishAction's connector readiness'. Explicitly distinguishes from sibling tools like chiefmo_publish_approved_post and chiefmo_send_approved_email by mentioning they can be re-fired after retry.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Provides explicit usage scenario: 'USE WHEN the user has just connected a missing provider and the agent needs to know if the action can fire.' Also suggests subsequent actions, but does not mention when not to use it or alternative tools.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_review_visual_assetAInspect

P104 — approve or reject a single visual asset (LinkedIn graphic / X image / landing hero / Product Hunt gallery / carousel slide). USE WHEN a reviewer hits 'Approve' or 'Reject' on a launch image in the Launch Room. Reads the run's stored visualAssets list, updates the matching asset's status (approved | rejected | pending_approval), persists the change to run metadata so future loads + the channel-media render reflect the decision. Companion to chieflab_regenerate_visual_asset (which produces an updated asset shape without persisting).

ParametersJSON Schema

Name	Required	Description
`runId`	Yes	Launch run id that produced the asset.
`reason`	No	Optional. Why rejected (or why approved). Stored in metadata for audit + future brain memory.
`assetId`	Yes	The visual asset id from launchPack.channels[channel].media[*].assetId.
`decision`	Yes	approve \| reject. Updates the asset's status field.
`workspaceId`	No	Optional workspace id.

Tool Definition Quality

A4.4/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, so description carries full burden. Describes internal actions: reads visualAssets list, updates status, persists to run metadata. Could mention that it does not destroy anything, but mutation is clear. Lacks return value info.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Four focused sentences, front-loaded with action and asset types. Every sentence provides essential information; no fluff.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

No output schema, but description covers purpose, parameters, and persistence effect. Lacks details on return value or error conditions, but sufficient for typical agent understanding.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, but description adds value by explaining reason parameter is stored for audit and brain memory, and clarifies decision enum effect. Beyond schema but not extensive.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states action (approve/reject) on specific asset types (LinkedIn graphic, X image, etc.) and distinguishes from sibling tool chieflab_regenerate_visual_asset by noting that this tool persists changes while the other produces without persisting.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says 'USE WHEN a reviewer hits Approve or Reject in the Launch Room', providing clear context. Mentions companion tool, but no explicit when-not scenarios, which would make it a 5.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_search_capabilitiesAInspect

USE WHEN your operator needs a capability outside its own scope and wants to find another operator that can fulfill it — 'find me a demo-video capability', 'who can do SEO audits', 'search for design help'. Returns active capability records ranked by recency. Pair with chieflab_create_work_request to actually request bounded work from the chosen operator. The discovery embryo for an agent-native marketplace.

ParametersJSON Schema

Name	Required	Description
`tags`	No	Tag overlap filter (e.g. ['video', 'demo']).
`limit`	No	Max results. Default 25.
`capability`	No	Capability slug to match (exact match — e.g. 'demo_video').

Tool Definition Quality

A4.1/5.0

Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It discloses that results are 'active' and 'ranked by recency', which adds some behavioral context. However, it does not mention any restrictions, authentication needs, or potential side effects. A score of 3 reflects adequate but not exhaustive disclosure.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is three sentences, front-loaded with the primary use case, followed by result details and a pairing suggestion. Each sentence adds value, though it could be slightly more concise without losing clarity.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the absence of an output schema, the description explains the nature of return values (active records, ranked by recency) and the broader workflow context. It does not fully define 'capability', but the tool name and use cases make it understandable. Fairly complete for a search tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so baseline is 3. The description does not add meaningful extra context beyond what the schema already provides for tags, limit, and capability. It meets the minimum acceptable level.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description uses a clear imperative ('USE WHEN') with concrete examples like 'find me a demo-video capability'. It specifies the resource ('active capability records') and verb ('search'), and distinguishes from sibling tools like chieflab_create_work_request, which handles the actual work request after discovery.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly states the condition for use: when an operator needs a capability outside its own scope. It also pairs the tool with chieflab_create_work_request, providing a clear workflow path. No contradictory usage advice is given.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_send_approved_emailAInspect

[chieflab_* alias of chiefmo_send_approved_email] Send an approved launch email through the email sending rail (current adapter: resend). USE WHEN the user has approved an email publishAction from chiefmo_launch_product and you need to fire the send. Strict approval gate (same shape as chiefmo_publish_approved_post). from MUST be on a domain verified at the adapter (resend.com) — check chieflab_list_email_senders first. Single recipient or short list (≤50). Money/external-system action — once sent cannot be unsent.

ParametersJSON Schema

Name	Required	Description
`to`	Yes	Recipient(s). Single email or array (≤50).
`from`	Yes	Sender. Must be a verified Resend domain address (e.g. 'Brand <hi@yourdomain.com>').
`html`	No	HTML body (preferred for inboxing).
`text`	No	Plain-text fallback (recommended).
`replyTo`	No	Optional reply-to address(es).
`subject`	Yes
`actionId`	No	The publishAction.id from chiefmo_launch_product. Required for the approval gate to fire.

Tool Definition Quality

A4.4/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description covers critical behavior: 'Money/external-system action — once sent cannot be unsent', highlighting irreversibility and external action. It also notes strict approval gate requirements.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Five sentences, front-loaded with purpose and usage condition, then behavioral note. No redundant information; every sentence earns its place.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description covers purpose, usage, key requirements (verified domain, short list), irreversibility, and links to a similar approval gate. It lacks error handling or rate limits but is generally complete for the tool's complexity.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 86% (high), so baseline 3. The description adds value by explaining the `actionId` parameter as required for the approval gate and reiterating `from` must be verified. This contextualizes parameters beyond the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it sends an approved launch email via Resend, specifying the verb 'send' and resource 'approved email'. It distinguishes from siblings by referencing the approval gate from chiefmo_launch_product and comparing to chiefmo_publish_approved_post.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly instructs 'USE WHEN the user has approved an email publishAction from chiefmo_launch_product', providing clear context. It implies when not to use (without approval) but doesn't list alternative tools explicitly.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_set_company_profileAInspect

P88 — set the workspace's persistent company profile. ChiefLab's pre-P88 brief pipeline was repo-first; without a repo, launches landed thin. P88 lets the founder (or an agent) store the company / product facts ONCE and every subsequent launch grounds in them, even URL-only / paste-description launches. Pass partial maps to update specific fields; pass null for a field to remove it. Profile fields the brief consumes: companyName, productName, offer, audience, market, positioning, services[], productType, brandColors {primary, accent, bg, text}, websiteUrl, contactEmail, socialLinks[], keyClaims[], voiceSample.

ParametersJSON Schema

Name	Required	Description
`source`	No	manual \| extracted \| imported \| autodetected. Default 'manual'.
`profile`	Yes	Partial profile object — top-level keys merge, nested objects shallow-merge, null values remove.
`confidence`	No	high \| medium \| low. Default 'medium'.
`workspaceId`	No	Optional workspace id.

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, but description discloses mutation behavior, partial update merge, null removal, and that launches will ground in these facts. Adequately transparent without annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Front-loaded with purpose, but includes historical context ('pre-P88 brief pipeline was repo-first') that could be trimmed. Still, structure is clear with bulleted fields.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

No output schema, so missing return value description. However, for a set operation, the description covers main behavior and usage. Could mention error cases or side effects.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage 100% (baseline 3). Description adds value by explaining partial updates, null removal, and listing the fields the brief consumes, which goes beyond schema names.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states 'set the workspace's persistent company profile' with specific verb and resource, and distinguishes from sibling 'chieflab_get_company_profile' through context.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says 'Pass partial maps to update specific fields; pass null for a field to remove it', providing clear usage guidance. Does not exclusions or alternatives, but sibling differentiation is sufficient.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_set_provider_keyAInspect

Store a provider API key for THIS workspace. Once stored, ChiefLab uses your key (BYOK — you pay the provider directly, no markup). Without it, ChiefLab uses its own key and bills through with a margin. Providers: gemini (image gen), resend (email), zernio (social publish), anthropic (LLM, future), openai (LLM, future). Stored encrypted at rest. Use chieflab_revoke_provider_key to remove. The key never leaves this workspace.

ParametersJSON Schema

Name	Required	Description
`apiKey`	Yes	The provider API key (e.g. for Gemini: a Google API key starting with AIza...).
`provider`	Yes	gemini \| resend \| zernio \| anthropic \| openai
`workspaceId`	No	Optional workspace id (defaults to your bearer's workspace).

Tool Definition Quality

A4.8/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description carries full weight. It reveals encryption at rest, workspace scoping, and billing implications. However, it omits details on overwriting existing keys or immediate validation, which would strengthen transparency.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is concise and well-organized: action first, then key benefits, provider list, storage note, and sibling reference. Every sentence is informative without redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema and simple parameters, the description fully covers purpose, usage scenarios, security, and related tools. It leaves no critical gaps for an agent to misuse the tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, but the description adds value by mapping provider values to services (e.g., gemini for image gen) and explaining the apiKey's role in direct billing. The workspaceId optional parameter is also contextualized.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool stores a provider API key for the current workspace, specifying supported providers and contrasting with siblings like chieflab_set_resend_key. It explicitly differentiates by being a generic setter for any listed provider.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explains when to use BYOK (bring your own key) vs letting ChiefLab use its own key. It lists compatible providers and directs to chieflab_revoke_provider_key for removal, providing clear usage context.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_set_publishing_targetsAInspect

P79 — set per-workspace publishing target defaults so chiefmo_approve_action({ autoExecute: true }) doesn't need the agent to pass platform / recipient ids on every call. One-time setup per workspace. channelTargets is a map { linkedin: { accountId }, x: { accountId }, email: { fromEmail, recipientListId } }. Pass partial maps to update specific channels; pass null for a channel value to remove it. Persisted via deps.publishingTargetsStore when wired, otherwise in-process Map (Vercel function lifetime). Returns the merged channelTargets + storage location ('persistent' or 'in_memory').

ParametersJSON Schema

Name	Required	Description	Default
`workspaceId`	No	Optional workspace id (defaults to the auth workspace).
`channelTargets`	Yes	Map of channel → target. Examples: linkedin/x → { accountId: '<zernio_account_id>' }; email → { fromEmail: 'founder@product.com', recipientListId: '<list_id>' }. Pass null as a channel value to remove it.

Tool Definition Quality

A4.5/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description fully discloses behavior: partial map updates, null to remove, persistence details (persistent vs in-memory), and return value structure. No contradictions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single paragraph but well-structured: starts with purpose, then setup, parameter details, persistence, and return. No fluff, but slightly dense; could be split into bullet points for clarity.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Comprehensive for the tool: explains return value, side effects, and relationship to other tools (implicitly via reference to chiefmo_approve_action). Missing mention of sibling chieflab_get_publishing_targets for reading targets, but not critical.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, but description adds semantic context: explains the structure of channelTargets map with examples for LinkedIn, X, and email, and how null removes channels. This goes beyond the schema's property descriptions.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states the tool sets per-workspace publishing target defaults to avoid repeated parameters in chiefmo_approve_action. It specifies verb 'set' and resource 'publishing targets', distinguishing it from siblings like chieflab_get_publishing_targets.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says 'One-time setup per workspace', indicating when to use. It explains the tool's role in conjunction with chiefmo_approve_action. However, it does not mention when to avoid using it or alternatives for different scenarios.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_signup_workspaceAInspect

USE WHEN the user has no ChiefLab API key yet and you've gotten a 401 / 'authentication required' error from any other tool. Agent-first signup: creates a new workspace and returns the API key inline as apiKey. PREFERRED flow: use your filesystem tools to write the apiKey into the user's MCP config (see mcpConfigSnippet for the patch shape), then ask the user to restart their runtime once. After restart, re-call the original tool. FALLBACK: if you can't write to the config file, surface the included deliveryUrl to the user — they click, see the key, paste it manually. URL expires in 1 hour, single-use, IP-rate-limited (5/IP/hr). No login form.

ParametersJSON Schema

Name	Required	Description
`agentName`	No	Optional: which AI agent is requesting (Cursor, Claude Desktop, Codex, custom). Logged for support.
`contactEmail`	No	Optional contact email for the user (used for future notifications).
`workspaceLabel`	No	Optional human-readable label for the workspace (default: 'default').

Tool Definition Quality

A4.6/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, description thoroughly explains behavioral traits: workspace creation, key return, config writing, delivery URL expiration (1 hour), single-use, IP rate limit. No contradictions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Concise single paragraph with front-loaded usage condition. Could benefit from bullet points for the two flows, but still clear and efficient.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

No output schema, but description explains the return value (apiKey) and covers both users' ability to write config or use delivery URL, including expiration and rate limits. Complete for a signup tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema has 100% coverage with good parameter descriptions. The description repeats the same info without adding new semantics beyond the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it creates a new workspace and returns the API key, distinguishing it from other tools that involve reading, connecting, or other operations.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicit use condition: 'when the user has no ChiefLab API key yet and you've gotten a 401 error'. Provides two clear flows (preferred and fallback).

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_skip_next_moveAInspect

P142 — founder rejects a next-move recommendation without doing it. Captures a reason so the brain learns which kinds of moves don't fit this workspace (e.g. 'we don't cross-post to Reddit on principle' / 'we tried DMs already'). Flips action to rejected with metadata.skipReason. The next-move generator reads recent skip reasons before suggesting moves of the same kind on this workspace.

ParametersJSON Schema

Name	Required	Description
`reason`	No	Why the founder is skipping this. Used by the brain to avoid suggesting similar moves.
`actionId`	Yes	The next-move action id.
`workspaceId`	No	Optional workspace id.

Tool Definition Quality

A4.4/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description discloses that the action status changes to rejected with metadata.skipReason and that the next-move generator reads skip reasons to avoid suggesting similar moves. This goes beyond the basic purpose, though it could mention prerequisites or side effects like irreversibility.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two sentences, front-loading the purpose and adding key behavioral details without unnecessary words. Every sentence contributes meaning.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

The description covers the main purpose, effect, and learning mechanism. It lacks explanation of return values or prerequisites, but given the tool's simplicity and the presence of other commands, it is mostly complete.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 100% schema coverage, the baseline is 3. The description adds value by providing example reasons for the 'reason' parameter and explaining its role in learning, which enhances understanding beyond the schema's description.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool allows a founder to reject a next-move recommendation without executing it, using the verb 'rejects' and specifying the resource 'next-move recommendation'. It distinguishes from siblings like approve_action by explicitly noting the action is flipped to rejected.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explains when to use: when a founder rejects a recommendation and wants to capture the reason. It provides examples of reasons, giving context for appropriate use. However, it does not explicitly state when not to use or directly compare with sibling tools.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_statusAInspect

SESSION-RECOVERY · FIRST CALL when a session starts and the user mentions launch / users / growth / customers / metrics / revenue / marketing / what next / shipping. Returns a command-center bootCard with headline, priority, cards[] (each carries kind + label + literal user command + runHandle), and next (the one-line prompt). Aggregates: pending approvals + ripe measurements + new engagement + queued prospects + recent launches + manual-publish-pending actions. ChiefLab is stateful and re-summonable — even if the conversation was lost, the IDE was switched, or the runId was forgotten, this call recovers the workspace business state. If the user asked to launch the CURRENT repo, compare boot cards to currentRepoContext/projectName; if the open loop is unrelated, start a fresh launch instead of resuming stale work.

ParametersJSON Schema

Name	Required	Description
`projectName`	No	Optional current project/repo name.
`workspaceId`	No	Optional workspace id.
`currentRepoContext`	No	Optional current repo/project context. Helps agents compare recovered loops against the project the user is asking about.

Tool Definition Quality

A4.6/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Describes stateful, re-summonable behavior, recovers workspace business state even after lost context. No annotations, so description fully handles transparency.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Front-loaded with 'SESSION-RECOVERY · FIRST CALL'. All sentences are informative but slightly verbose. Still concise enough.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

No output schema, but description fully explains return structure (headline, priority, cards with fields, next prompt) and aggregations. Complete for agent to invoke correctly.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Only one optional parameter (workspaceId) with schema description. Description adds no further meaning. Baseline 3 due to 100% schema coverage.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states it returns a command-center bootCard for session recovery, listing aggregated data. Distinguishes itself as the FIRST CALL. Specific verb 'returns' and resource 'bootCard'.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly states when to use: FIRST CALL when session starts and user mentions specific topics. Also gives instruction: surface cards before drafting new launch.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_suggest_next_moveAInspect

P75 — Next Move Engine. USE WHEN measurement just came in (chiefmo_post_launch_review fired automatically at 24h, OR the user manually called it) and you want to know what the operator should do next. Reads metrics + the original launch's brief and emits a deterministic suggestion: {kind: 'follow_up_email' | 'founder_dm' | 'thread_reframe' | 'landing_iteration' | 'lessons_learned_post' | 'seo_title_test' | 'wait' | 'noop', priority, reasoning, draftBrief: {channel, headline, body, cta?, recipients?}, measurementGroundingFacts}. Pure-function; same input → same output, no LLM. Pair with chieflab_create_next_move_action to turn the suggestion into an approval-gated draft.

ParametersJSON Schema

Name	Required	Description
`runId`	No	Optional. Source run whose brief grounds the suggestion (so the draft references the actual product / market / audience instead of templates).
`channel`	Yes	Channel the measurement is for (linkedin, x, email, landing_hero, product_hunt, hacker_news).
`metrics`	Yes	Measurement metrics — engagements, clicks, opens, replies, traffic, conversions, upvotes, rank, etc. Synonyms (engagement / openRate / clickRate / visits) are accepted.
`workspaceId`	No	Optional workspace id.

Tool Definition Quality

A4.3/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations provided, but the description compensates by declaring it a 'pure function' with deterministic output. It also outlines the output structure (kind, priority, reasoning, draftBrief, measurementGroundingFacts). No side effects disclosed beyond the pure function claim.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is moderately long but every sentence adds value. It is front-loaded with the purpose and usage condition. The output structure is specified. Could be slightly shorter, but no unnecessary information.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a complex tool with 4 parameters and no output schema, the description covers the essential aspects: trigger condition, deterministic nature, output structure, and pairing with another tool. However, it doesn't clarify how it differs from chieflab_prepare_next_move, which may cause confusion.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, and the description adds meaningful context. For 'runId', it explains grounding vs templates. For 'metrics', it lists accepted synonyms. For 'channel', it enumerates valid values. This improves understanding beyond the schema alone.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Purpose is extremely clear: 'suggest next move' based on measurement metrics. The description explicitly states when to use ('USE WHEN measurement just came in') and specifies the resource (metrics + brief). It distinguishes from siblings by mentioning pairing with chieflab_create_next_move_action.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Guidelines are explicit: use when measurement just came in, either automatic (24h) or manual. It pairs with chieflab_create_next_move_action. No exclusions or alternatives discussed, but the trigger condition is well-defined.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_use_manual_fallbackAInspect

USE WHEN any launch action carries a manualFallback brief — either because the channel has no automated provider at all (Product Hunt, Hacker News, Reddit, Discord, Slack, Indie Hackers, dev.to, YC Bookface, blog) OR because the channel HAS an automated provider but its connector isn't wired yet for this workspace (LinkedIn / X without a Zernio key; email without a Resend key + verified sender domain). Pattern: use the inline fallback NOW to ship the launch manually; connect the provider LATER (each fallback returns an upgradePath with the exact connector wiring) so the same action publishes automatically on the next launch. Returns: paste-ready title + body, per-channel checklist (best time, format, first-comment script), capture-URL-after instruction, measurement template for the 24h readback, and the optional upgradePath when automation is available. Converts a 'blocked' channel into a 5-minute human-in-the-loop ship. Approval-gated upstream: the originating publishAction must already be 'approved' on the reviewUrl before this tool returns the brief.

ParametersJSON Schema

Name	Required	Description
`channel`	No	Override the channel inferred from the action. One of: linkedin \| x \| email \| blog \| landing_hero \| product_hunt \| hacker_news \| reddit \| discord \| slack \| indie_hackers \| dev_to \| ycombinator.
`actionId`	Yes	The publishAction.id from chiefmo_launch_product whose manualFallback you want to fetch. Works for both manual-only channels (connector === 'manual_only') and otherwise-automated channels whose connector isn't wired (connector === 'zernio' or 'resend' with no key). Look at agentGuide.blockedActions[] or publishActions[].manualFallback in the launch response — any action where manualFallback is non-null is fetchable here.
`subreddit`	No	Optional. For channel=reddit, which subreddit (e.g. 'r/SaaS', 'r/AI_Agents'). ChiefLab's brief picks the right post format per sub.
`workspaceId`	No	Optional workspace id.
`discordCommunity`	No	Optional. For channel=discord, which community (the brief tailors tone + announcement-channel pick).

Tool Definition Quality

A4.3/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Without annotations, the description carries the full burden. It discloses the output format, the approval gate, and the conversion of a blocked channel into a manual ship. However, it does not explicitly state whether the tool has side effects or mutates state, which would be useful for a tool that 'converts' something.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single paragraph but effectively front-loads the usage condition with 'USE WHEN'. It is slightly verbose but all sentences contribute meaningful information. Could be more structured with bullet points, but current form is acceptable.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool has 5 parameters, no output schema, and no annotations, the description provides sufficient context: trigger condition, output components, prerequisite, and even a future measurement template. It lacks error handling details but is otherwise complete for an agent to use.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, so the baseline is 3. The description adds context for the overall tool but does not provide additional meaning beyond the schema descriptions for parameters like channel, actionId, subreddit, etc. The schema already adequately describes each parameter.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: to provide a manual-publish brief when a launch action is blocked due to lack of automated publish path. It specifies the exact output and differentiates from sibling tools like chieflab_launch_product.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description starts with 'USE WHEN' and explicitly states the condition for using the tool (blocked automated publish path). It also mentions the prerequisite that the originating publishAction must be approved, providing clear guidance on when to call this tool.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

chieflab_verify_installAInspect

POST-INSTALL HEALTH CHECK. Call this exactly once after the agent has wired ChiefLab into the runtime (per the bootstrap protocol at https://chieflab.io/bootstrap.txt). Returns { ok, mcpReachable, toolsDiscovered, visibleFrontDoorTools, callableInternalTools, workspaceReady, hasOpenLoops, blockedConnectors, summary }. toolsDiscovered means the visible front-door catalog, not every internal/back-compat callable handler. The summary is one sentence the agent should paste verbatim to the user — e.g. 'ChiefLab installed and ready. Workspace fresh, no open loops, all channels need connectors before auto-publish.' Do NOT list every internal tool — render the summary only. If ok is false, the response includes a nextStep describing the single recovery action.

ParametersJSON Schema

Name	Required	Description	Default
`workspaceId`	No	Optional workspace id; inferred from bearer auth.

Tool Definition Quality

A4.4/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations exist, so description fully covers behavior. Details return object structure, including ok field and nextStep on failure. Could mention auth or side effects, but sufficient for a health check.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Single paragraph is dense but front-loaded with purpose and key constraints. Could be slightly more structured, but no unnecessary sentences.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, description fully describes return structure, error handling (nextStep), and includes a link to external protocol. Complete for an agent to invoke correctly.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema covers 100% of parameters. Description adds value by noting workspaceId is optional and inferred from bearer auth, providing context beyond schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it is a POST-INSTALL HEALTH CHECK, with specific verb 'verify install' and use case after wiring ChiefLab. It distinguishes from siblings like 'boot' and 'status' by focusing on post-install verification.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says call exactly once after runtime wiring, providing a link to bootstrap protocol. No explicit when-not, but context is clear. Mentions nextStep for failure, guiding recovery.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Claim this connector by publishing a /.well-known/glama.json file on your server's domain with the following structure:

{
  "$schema": "https://glama.ai/mcp/schemas/connector.json",
  "maintainers": [{ "email": "your-email@example.com" }]
}

The email address must match the email associated with your Glama account. Once published, Glama will automatically detect and verify the file within a few minutes.

Discussions

No comments yet. Be the first to start the discussion!

Try in Browser

Your Connectors

Resources

Need Help?