nucleus
Server Details
Sovereign Agent OS — Persistent Memory, Governance & Compliance for AI Agents.
- Status
- Healthy
- Last Tested
- Transport
- Streamable HTTP
- URL
- Repository
- eidetic-works/nucleus-mcp
- GitHub Stars
- 2
- Server Listing
- Nucleus MCP
Glama MCP Gateway
Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.
Full call logging
Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.
Tool access control
Enable or disable individual tools per connector, so you decide what your agents can and cannot do.
Managed credentials
Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.
Usage analytics
See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.
Tool Definition Quality
Average 3.5/5 across 17 of 17 tools scored. Lowest: 2.2/5.
Each tool targets a distinct domain (agents, audit, engrams, features, federation, governance, infra, orchestration, relay, relay_subscribe, route, sessions, slots, sync, tasks, telemetry, ccr_arm) with clear boundaries. No two tools have overlapping purposes; an agent can easily select the correct tool based on the task at hand.
All tool names follow a consistent 'nucleus_<domain>' pattern in snake_case, with no mixing of conventions. The naming is predictable and hierarchical, making it easy for an agent to infer tool purpose from the name alone.
With 17 tools, the surface is well-scoped for a comprehensive agent OS. Each tool encapsulates a coherent set of functionality without being overly granular. The count is slightly high but justified by the breadth of capabilities (agent management, audit, memory, etc.).
The tool surface covers major lifecycle aspects: agent creation, task management, memory, governance, federation, telemetry, and orchestration. Minor gaps exist (e.g., no direct file I/O or external API integration beyond what's listed), but the core workflows are well-supported.
Available Tools
17 toolsnucleus_agentsAgent SpawningBInspect
Agent spawning, critic, swarm, memory, ingestion & dashboard tools.
Actions: spawn_agent - Spawn Ephemeral Agent. params: {intent, execute_now?, persona?, confirm?}. HITL: requires confirm=true. apply_critique - Apply critique fixes. params: {review_path} orchestrate_swarm - Start multi-agent swarm. params: {mission, agents?} search_memory - Search long-term memory. params: {query} read_memory - Read memory category. params: {category} respond_to_consent - Respond to respawn consent. params: {agent_id, choice?} list_pending_consents - List agents awaiting consent critique_code - Run Critic review. params: {file_path, context?} fix_code - Auto-fix code. params: {file_path, issues_context} session_briefing - Get session briefing. params: {conversation_id?} register_session - Register session focus. params: {conversation_id, focus_area, role?, tier?, charter_path?, parent_session?}. Tier ∈ {opus,sonnet,haiku}; role must start with tier prefix. handoff_task - Hand off task. params: {task_description, target_session_id?, priority?} ingest_tasks - Ingest tasks. params: {source, source_type?, session_id?, auto_assign?, skip_dedup?, dry_run?} rollback_ingestion - Rollback ingestion. params: {batch_id, reason?} ingestion_stats - Get ingestion statistics dashboard - Enhanced dashboard. params: {detail_level?, format?, include_alerts?, include_trends?, category?} snapshot_dashboard - Create dashboard snapshot. params: {name?} list_dashboard_snapshots - List snapshots. params: {limit?} get_alerts - Get active alerts set_alert_threshold - Set alert threshold. params: {metric, level, value}
| Name | Required | Description | Default |
|---|---|---|---|
| action | Yes | ||
| params | No |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | Yes |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations declare readOnlyHint=false, openWorldHint=true, destructiveHint=false. The description adds HITL requirement for spawn_agent and lists mutation actions, but does not detail side effects, permissions, or rate limits. Some behavioral context is provided, but not comprehensive.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
Organized as a bulleted list with a summary header. Each action is described in one line, making it scannable. No redundant sentences, though the list format is lengthy.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the complexity (16 sub-actions, output schema exists), the description provides a high-level overview but lacks details on return values, error handling, and when to use specific sub-actions. Adequate but not fully complete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema is generic (action string + params object) with 0% parameter description coverage. The description compensates by listing each action's parameters with brief notes (e.g., 'require confirm=true'). This adds meaningful context beyond the schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description states the tool covers 'Agent spawning, critic, swarm, memory, ingestion & dashboard tools.' This provides a broad category but lacks a single specific verb-resource statement. It lists many sub-actions, making the overall purpose somewhat clear but not sharply distinguished from sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool versus sibling tools like nucleus_audit or nucleus_governance. The description does not provide context or alternatives, leaving the agent to infer usage from the action names alone.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
nucleus_auditAudit LogBRead-onlyInspect
W8 Team-tier tamper-evident audit log (SHA-256 hash chain).
Actions: log_event - Append an audit event to the chain. params: {event_type, actor, resource, outcome, metadata?, team_id?, ts?} query - Single-tenant query. Rejects team_id=''. params: {team_id, since?, until?, actor?, event_type?, limit?, offset?} admin_query - Cross-tenant query (team_id='' allowed). Requires NUCLEUS_AUDIT_ADMIN_TOKEN env match. Logs every successful call to the synthetic 'admin' chain. params: {admin_token, team_id?, since?, until?, actor?, event_type?, limit?, offset?} verify - Verify SHA-256 chain integrity for a team. params: {team_id}
| Name | Required | Description | Default |
|---|---|---|---|
| action | Yes | ||
| params | No |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | Yes |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description contradicts the readOnlyHint annotation with the log_event action being a write operation. It does not acknowledge this or clarify side effects aside from admin_query logging.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is well-structured with bullet points and code formatting. It is relatively concise given the multiple actions, though some redundancy exists.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers the main actions and their parameters but omits return values and error handling. The output schema exists but its content is unknown, so completeness is moderate.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has only two high-level fields with minimal constraints, but the description lists action-specific parameters, compensating for the 0% schema coverage. However, it lacks detail on parameter types and formats.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it is an audit log with a hash chain, listing the available actions. It distinguishes the tool from siblings by specifying its team-tier and tamper-evident nature.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides some usage context by explaining admin_token requirement for admin_query and rejection of team_id='*' in query. However, it does not explicitly explain when to choose this tool over siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
nucleus_ccr_armCCR ArmAInspect
One-shot convenience: resolve canonical inbox + arm long-poll subscription.
Per PR #2 (CCR server-side auto-arm) — IDE-agnostic relay-arrival arming.
This is the RECOMMENDED entry point for SessionStart auto-arming across all MCP clients (Claude Code, Antigravity, Cursor, Windsurf, etc.). Equivalent to: 1. resolve_canonical_inbox_name(role) → canonical inbox name 2. nucleus_relay_subscribe(inbox_filter=, timeout_seconds=...)
Why this exists vs nucleus_relay_subscribe + inbox_filter: nucleus_relay_subscribe + inbox_filter requires the caller to KNOW the canonical inbox name for their role. nucleus_ccr_arm hides that step. Agent just calls nucleus_ccr_arm() with no args; server detects role from CC_SESSION_ROLE / NUCLEUS_SESSION_ROLE env OR detect_session_role().
| Name | Required | Description | Default |
|---|---|---|---|
| role | No | explicit role override (e.g., "antigravity", "cc_tb"). If omitted, server detects via env vars / registry ancestry / provider heuristics (per detect_session_role). | |
| timeout_seconds | No | max subscription duration (60..1800, default 270 under FastMCP context-cache TTL). |
Output Schema
| Name | Required | Description |
|---|---|---|
No output parameters | ||
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations are basic (readOnlyHint=false, etc.), but the description adds valuable behavioral context: it's a one-shot convenience that creates a long-poll subscription, server detects role from environment variables, and timeout_seconds has a default and range. No contradictions with annotations. Slight deduction for not discussing potential side effects like duplicate subscriptions.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is front-loaded with the main purpose. It includes some internal references (PR #2, IDE-agnostic) that may not be universally helpful, but overall every sentence adds value. Slightly verbose but justified by complexity.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's complexity (combining two operations), the presence of an output schema, full schema parameter coverage, and sibling context, the description is remarkably complete. It covers purpose, usage, parameter semantics, and differentiation from alternatives, leaving no major gaps for the agent.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, providing baseline 3. The description adds context beyond the schema: role can be explicitly overridden but server auto-detects if omitted, and timeout_seconds has a default tied to FastMCP context-cache TTL. This adds meaningful usage guidance.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's purpose: 'resolve canonical inbox + arm long-poll subscription'. It distinguishes itself from the sibling tool nucleus_relay_subscribe by explaining that it hides the step of knowing the canonical inbox name, making it a convenience entry point.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description explicitly states why this tool exists vs nucleus_relay_subscribe + inbox_filter, and recommends it as the 'RECOMMENDED entry point for SessionStart auto-arming'. It provides clear context for when to use this tool over its alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
nucleus_engramsMemory & EngramsAInspect
Engrams, health, observability, DSoR & tier system tools.
Actions: health - Get system health status version - Get Nucleus version info export_schema - Export MCP toolset as JSON Schema performance_metrics - Get perf metrics. params: {export_to_file?} prometheus_metrics - Get Prometheus metrics. params: {format?} audit_log - View cryptographic interaction log. params: {limit?} write_engram - Write engram to memory. params: {key, value, context?, intensity?}. context: Feature|Architecture|Brand|Strategy|Decision. intensity: 1-10. (alias: "add") query_engrams - Query engrams. params: {context?, min_intensity?, limit?}. limit default 50, max 500. search_engrams - Search engrams. params: {query, case_sensitive?, limit?}. limit default 50, max 500. (alias: "search") governance_status - Get governance status morning_brief - Daily Nucleus Morning Brief hook_metrics - Monitor auto-write engram hooks compounding_status - Compounding Loop status end_of_day - Capture EOD learnings. params: {summary, key_decisions?, blockers?} session_inject - Session-start context injection weekly_consolidate - Weekly consolidation. params: {dry_run?} list_decisions - List DecisionMade events. params: {limit?} list_snapshots - List context snapshots. params: {limit?} metering_summary - Token metering summary. params: {since_hours?} ipc_tokens - List IPC auth tokens. params: {active_only?} dsor_status - Comprehensive DSoR status pulse_and_polish - God Combo: automated health check pipeline. params: {write_engram?}. Runs prometheus→audit→brief→engram. self_healing_sre - God Combo: SRE diagnosis pipeline. params: {symptom, write_engram?}. Runs search→metrics→diagnose→recommend. fusion_reactor - God Combo: self-reinforcing memory loop. params: {observation, context?, intensity?, write_engrams?}. Compounds knowledge. context_graph - Build engram relationship graph. params: {include_edges?, min_intensity?}. Returns nodes, edges, clusters. engram_neighbors - Get neighborhood of an engram. params: {key, max_depth?}. BFS traversal of context graph. billing_summary - Usage cost tracking from audit logs. params: {since_hours?, group_by?}. group_by: tool|tier|session. render_graph - ASCII visualization of engram context graph. params: {max_nodes?, min_intensity?}. federation_dsor - Federation DSoR status routing_decisions - Query routing decision history. params: {limit?} list_tools - List tools at current tier. params: {category?} tier_status - Get tier configuration status dsor_query_decisions- Query the DSoR decision ledger. params: {limit?} dsor_get_trace - Get full provenance trace for a decision. params: {decision_id} heartbeat_check - Proactive context-triggered check-in. params: {notify?, brain_path?}. Checks stale blockers/decisions, velocity drops, session gaps. heartbeat_status - Get heartbeat daemon installation status. params: {brain_path?}. Shows install state + recent check history.
| Name | Required | Description | Default |
|---|---|---|---|
| action | Yes | ||
| params | No |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | Yes |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations already indicate readOnlyHint=false and destructiveHint=false. The description details each action's behavior (e.g., 'write_engram' is a write operation, 'health' is read-only) but does not go beyond what the actions imply. No contradiction with annotations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is excessively long (over 30 lines) listing many sub-actions. While it is structured as a bullet list, it lacks conciseness. Key details are buried, and a more compact summary with links to separate definitions would be better.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's complexity (32 sub-actions) and the existence of an output schema (not shown), the description covers each action's purpose and most parameters. Missing operational context like error handling or rate limits prevents a perfect score.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema is minimal (action + params object), but the description compensates fully by listing and explaining parameters for each action (e.g., write_engram: key, value, context?, intensity?). Schema description coverage is 0%, so the description provides essential information that the schema lacks.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool covers 'Engrams, health, observability, DSoR & tier system tools' and lists many specific actions, each with a concise one-line purpose. It distinguishes itself from siblings (e.g., nucleus_agents) by focusing on engram/memory operations and system status.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage by listing actions but does not explicitly state when to use this tool over alternatives or provide exclusion criteria. For a meta-tool with many sub-actions, more explicit guidance would help an agent decide.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
nucleus_featuresFeature FlagsAInspect
Feature tracking, proof generation & MCP server mounting.
Actions: add - Add a feature. params: {product, name, description, source, version, how_to_test, expected_result, status?, tags?} list - List features. params: {product?, status?, tag?} get - Get feature by ID. params: {feature_id} update - Update feature fields. params: {feature_id, status?, description?, version?} validate - Mark feature validated. params: {feature_id, result} search - Search features. params: {query} mount_server - Mount external MCP server. params: {name, command, args?} thanos_snap - Trigger Instance Fractal Aggregation unmount_server - Unmount MCP server. params: {server_id} list_mounted - List mounted MCP servers discover_tools - Discover tools from mounted servers. params: {server_id?} invoke_tool - Invoke tool on mounted server. params: {server_id, tool_name, arguments?} traverse_mount - Recursively mount downstream servers. params: {root_mount_id} generate_proof - Generate proof document. params: {feature_id, thinking?, deployed_url?, files_changed?, risk_level?, rollback_time?} get_proof - Get proof for a feature. params: {feature_id} list_proofs - List all proof documents
| Name | Required | Description | Default |
|---|---|---|---|
| action | Yes | ||
| params | No |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | Yes |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations indicate readOnlyHint=false and destructiveHint=false, but the description does not elaborate on behavioral traits like mutability or side effects. It merely lists actions, which implies mutability for some but lacks explicit disclosure.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is front-loaded with a summary and organized as a list of actions. It is somewhat verbose but structured for readability. Minor redundancy exists (e.g., 'Actions:' header is helpful).
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers all sub-actions and their parameters, but lacks return value explanations (though output schema exists). It does not address error handling, prerequisites, or per-action usage context, which is notable given the tool's complexity.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema is generic (action + params), providing no parameter details. The description compensates fully by enumerating each action and its specific parameters, which is critical for correct invocation.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Feature tracking, proof generation & MCP server mounting' as the purpose, and distinguishes it from sibling tools like nucleus_agents or nucleus_governance. However, the tool is multi-action, which slightly dilutes focus.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance is provided on when to use this tool versus alternatives. The description lists sub-actions but does not explain the context or prerequisites for choosing this tool over siblings.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
nucleus_federationFederationBInspect
Federation management for multi-brain coordination.
Actions: status - Get comprehensive federation status join - Join a federation via seed peer. params: {seed_peer} leave - Leave the federation gracefully peers - List all federation peers with details sync - Force immediate synchronization with all peers route - Route a task to the optimal brain. params: {task_id, profile?} health - Get federation health dashboard
| Name | Required | Description | Default |
|---|---|---|---|
| action | Yes | ||
| params | No |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | Yes |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations already declare readOnlyHint=false and destructiveHint=false, indicating potential mutation without destruction. The description lists actions like join, leave, and sync, which are consistent with these traits. However, it does not disclose additional behavioral details (e.g., whether route requires authorization, or what happens during sync). With annotations present, the description adds minimal extra transparency.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise, starting with a clear purpose statement followed by a bulleted list of actions. Each action has a one-line description. There is no extraneous information, though the list could benefit from slightly more context.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's complexity (multiple sub-actions) and a vague input schema, the description lacks completeness. It does not explain return values, error scenarios, or dependencies (e.g., must be in a federation to leave). With an output schema available but not described, the agent is left without guidance on expected responses.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema is generic (action + params object), with 0% description coverage. The description partially compensates by listing expected parameters for join (seed_peer) and route (task_id, profile?). However, it does not fully define parameter structures or types, leaving significant ambiguity for the agent.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The title 'Federation' and description 'Federation management for multi-brain coordination' clearly indicate the tool's domain. The action list (status, join, leave, peers, sync, route, health) provides specific verbs and resources. However, it does not explicitly differentiate from sibling tools like nucleus_route or nucleus_sync, which could have overlapping functionality.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides no guidance on when to use this tool versus alternatives. It merely lists actions without explaining context, prerequisites, or when not to use it. For a multi-action tool, this is a significant gap.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
nucleus_governanceGovernanceADestructiveInspect
Governance, Hypervisor & security tools for the Nucleus Agent OS.
Actions: auto_fix_loop - Auto-fix loop: Verify->Diagnose->Fix->Verify (3 retries). params: {file_path, verification_command} lock - [HYPERVISOR] Lock a file/dir immutable (chflags uchg). params: {path} unlock - [HYPERVISOR] Unlock a file/dir. params: {path} set_mode - [HYPERVISOR] Switch IDE context: "red" or "blue". params: {mode} list_directory - [GOVERNANCE] List files in a directory. params: {path} delete_file - [GOVERNANCE] Delete a file (governed by Hypervisor). params: {path, confirm?}. HITL: requires confirm=true. watch - [HYPERVISOR] Monitor a file/folder for changes. params: {path} status - [HYPERVISOR] Report current security state of Agent OS curl - [EGRESS] Proxied HTTP fetch for air-gapped agents. params: {url, method?} pip_install - [EGRESS] Proxied pip install for air-gapped agents. params: {package} validate_strategic_plan - [PROTOCOL] Validate Strategic mode PLAN has Big Bang [BB##] refs. params: {plan_text, mode?} comply_list - [COMPLIANCE] List available regulatory jurisdictions comply_apply - [COMPLIANCE] Apply jurisdiction config. params: {jurisdiction, brain_path?} comply_report - [COMPLIANCE] Generate compliance status report. params: {brain_path?} audit_report - [COMPLIANCE] Generate audit-ready report. params: {report_format?, since_hours?, brain_path?} kyc_review - [COMPLIANCE] Run KYC demo review. params: {application_id?, brain_path?} sovereign_status - [STATUS] Get sovereignty posture report. params: {brain_path?} trace_list - [DSoR] List decision traces. params: {trace_type?, brain_path?} trace_view - [DSoR] View specific trace. params: {trace_id, brain_path?}
| Name | Required | Description | Default |
|---|---|---|---|
| action | Yes | ||
| params | No |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | Yes |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations provide destructiveHint true, and the description adds behavioral details: e.g., delete_file requires confirm=true (HITL), lock/unlock change file immutability, curl/pip_install are proxied. This adds significant context beyond annotations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is well-structured as a bullet list with concise lines per action. It starts with a summary line. Though lengthy due to many actions, each line is informative and free of fluff.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's complexity (15+ sub-actions) and minimal schema, the description covers each action's purpose and parameters. With an output schema present, return values need not be detailed. Adequately complete for a dispatcher tool.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema is generic with only action and params, but the description lists each sub-action with its specific parameters, fully compensating for the lack of schema coverage. Each action's parameters are explicitly named, providing essential semantics.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Governance, Hypervisor & security tools for the Nucleus Agent OS' and lists sub-actions with brief explanations. While purpose is clear, it's broad as a dispatcher; the tool aggregates many operations, making it distinct from sibling tools.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
Tags like [HYPERVISOR] and [GOVERNANCE] provide context, but no explicit guidance on when to use this tool vs. alternatives (e.g., nucleus_tasks). The sub-action descriptions imply usage for specific tasks but lack when-not suggestions.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
nucleus_infraInfrastructureCDestructiveInspect
Infrastructure: file changes, cloud, marketing & strategy tools.
Actions: file_changes - Get pending file change events gcloud_status - Check GCloud auth status gcloud_services - List Cloud Run services. params: {project?, region?} list_services - List Render.com services scan_marketing_log - Scan marketing log for failures synthesize_strategy - Analyze marketing & update strategy. params: {focus_topic?} status_report - Generate State of the Union. params: {focus?} optimize_workflow - Self-optimize workflow cheatsheet manage_strategy - Read/Update strategy doc. params: {action, content?} update_roadmap - Read/Update roadmap. params: {action, item?} growth_pulse - Full growth pipeline: brief→metrics→streak→compound. params: {write_engrams?} capture_metrics - Refresh GitHub+PyPI metrics + gate evaluation. params: {write_engram?}
| Name | Required | Description | Default |
|---|---|---|---|
| action | Yes | ||
| params | No |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | Yes |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations indicate destructiveHint=true and readOnlyHint=false, but the description adds no further behavioral context (e.g., auth needs, side effects, state changes). For example, 'manage_strategy' and 'update_roadmap' imply writes, but no specifics on what gets modified.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is well-structured with a clear intro and bullet-like list of actions. It is concise given the number of sub-actions, with each line providing a short description. A minor improvement would be grouping related actions.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite having an output schema (from context), the description does not explain return values or how results are structured. For a complex dispatcher tool with multiple sub-actions, more detail on behavior and outcomes is needed.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema is generic with only 'action' and 'params'. The description adds value by listing valid actions and mentioning parameters for some (e.g., 'params: {project?, region?}' for gcloud_services). However, coverage is incomplete—many actions lack parameter hints, and the schema does not define action enum values.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states the tool's domain: 'Infrastructure: file changes, cloud, marketing & strategy tools.' It lists sub-actions, giving a specific scope. However, it does not explicitly describe that it is a dispatcher for these sub-actions, which slightly affects clarity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance on when to use this tool versus sibling tools like nucleus_agents or nucleus_audit. The description lists actions but does not provide conditions for choosing this tool or its sub-actions over alternatives.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
nucleus_orchestrationOrchestrationCInspect
Satellite view, commitments, loops, patterns & metrics.
Actions: satellite - Unified satellite view. params: {detail_level?} scan_commitments - Scan artifacts for new commitments archive_stale - Auto-archive commitments older than 30 days export - Export brain to zip list_commitments - List open commitments. params: {tier?} close_commitment - Close a commitment. params: {commitment_id, method} commitment_health - Get commitment health summary open_loops - View all open loops. params: {type_filter?, tier_filter?} add_loop - Add a new open loop. params: {description, loop_type?, priority?} weekly_challenge - Manage weekly challenge. params: {action?, challenge_id?} patterns - Manage learned patterns. params: {action?} metrics - Get coordination metrics pr_watch - Enumerate stale PRs (>N days), classify (auto-mergeable / billing-stuck / needs-verdict), fire one relay per stale PR to the integration coord. params: {threshold_days?, dry_run?}
| Name | Required | Description | Default |
|---|---|---|---|
| action | Yes | ||
| params | No |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | Yes |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations provide no hints (readOnlyHint=false, destructiveHint=false). The description lists actions like 'close_commitment' and 'archive_stale' which imply side effects, but does not disclose behavioral details such as required permissions, idempotency, or consequences of actions. This is a significant gap.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is structured as a bullet list with action names and parameter hints, which aids scanning. However, it is verbose due to the many sub-actions and could be shortened. The front-loaded summary line is good, but the list dominates.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Despite having an output schema (not shown), the description does not mention return values or behavior for each action. It lacks information on how to interpret results, error handling, or action-specific success conditions. For a tool with many sub-actions, this is incomplete.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema is minimal (action string, params object) with 0% parameter description coverage. However, the description lists each sub-action and its informal parameters (e.g., 'params: {detail_level?}', 'params: {commitment_id, method}'). This adds meaningful context beyond the bare schema, partially compensating for the lack of formal definitions.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description lists many sub-actions under a broad header 'Satellite view, commitments, loops, patterns & metrics.' It gives a general sense of orchestration activities but lacks a specific verb-resource description for the tool as a whole. The purpose is vague and not clearly differentiated from siblings.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance is provided on when to use this tool versus the many sibling tools. There are no context trigger, prerequisites, or alternative tool mentions. The agent must guess usage from the action list alone.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
nucleus_relayRelay MessagingAInspect
Relay-substrate facade — post / read / ack / status.
Actions: post - Send a relay envelope. params: {to, subject, body, sender?, priority?, in_reply_to?, context?, id?, from_session_id?} sender? auto-fills from your session role when omitted. to accepts short role aliases: main, peer, tb, ops, agy, board. "recipient" is accepted as an alias for "to". Returns {sent: bool, id: str (server message_id), error?: str}. inbox - List inbox messages. params: {role?, unread_only?, limit?} role auto-fills from CC_SESSION_ROLE env. Returns {messages: [...], role: str}. ack - Mark messages seen. params: {message_ids: [str], role?} Returns {acked: int, failed: int}. status - Diagnostic (no server call). params: {role?} Returns {is_http_mode, relay_url_set, bearer_set, canonical_role, resolved_inbox_dir}.
Bearer resolves per-role at call-time (~/.tb/relay_token_, falling back to NUCLEUS_RELAY_BEARER env) — never passed via this tool's prompt surface. Per ADR-0036 amendment c068abc1 + v0.2.1 Layer A.
| Name | Required | Description | Default |
|---|---|---|---|
| action | Yes | ||
| params | No |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | Yes |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description discloses behavioral traits beyond annotations, such as auto-fill of sender and role, bearer token resolution without exposing secrets, and return formats. No contradiction with annotations (readOnlyHint false, destructiveHint false, openWorldHint true).
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is well-structured with clear action headings and detailed parameter lists. It is slightly verbose but every sentence adds value. Could be more concise, but front-loads the main purpose.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the complexity (4 actions, many optional parameters, security details), the description is extremely complete. It covers all actions, parameters, return values, and security configuration. The output schema exists and is described, so return values are clear.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has only two generic parameters (action, params) with 0% description coverage. The description compensates fully by detailing all possible parameters for each action (e.g., to, subject, body, sender?), including optional fields and aliases, making the tool usable.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it is a 'Relay-substrate facade — post / read / ack / status' and lists each action with specific verbs and resources. It distinguishes from sibling tools like nucleus_relay_subscribe by focusing on core relay operations.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides clear context for when to use this tool (relay operations) and lists all actions. However, it does not explicitly exclude alternatives or mention when to use sibling tools like nucleus_relay_subscribe, leaving some ambiguity.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
nucleus_relay_subscribeRelay SubscriptionARead-onlyInspect
Long-poll subscription that pushes ctx.info() on each new inbox file.
Replaces bash polling daemons (watch-relay-*.sh) with server-initiated push. Call once at session start (e.g. via SessionStart hook). Server holds the subscription, watches the calling agent's role-specific inbox dir, and fires info-level notifications on each new relay file arrival. Client re-calls this in a loop for persistent coverage.
Per PR #1 (CCR-inversion-for-relay-pickup): inbox_filter parameter added
to BYPASS role-based dir resolution. Use when role detection is unreliable
OR when subscribing to a specific canonical inbox (e.g., 'cc_tb').
Closes 3-week-old feedback_relay_arrival_invisible_midsession HARD RULE.
| Name | Required | Description | Default |
|---|---|---|---|
| role | No | override calling agent's role (default: env-detected from CC_SESSION_ROLE or detect_session_role()) | |
| inbox_filter | No | explicit inbox dir NAME (e.g., "cc_tb", "claude_code_main"). When provided, BYPASSES role detection entirely. Recommended for SessionStart hook deterministic arming. Resolves the bug where cc-tb's role detection mapped to wrong inbox. | |
| timeout_seconds | No | max subscription duration (60..1800, default 270 = under FastMCP context-cache TTL so re-subscribe doesn't burn cache) |
Output Schema
| Name | Required | Description |
|---|---|---|
No output parameters | ||
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations already declare readOnlyHint=true and destructiveHint=false. The description adds context about the subscription holding, polling replacement, and role detection bypass. It does not contradict annotations and provides behavioral context beyond them.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is front-loaded with purpose, then usage, then parameter details. It is slightly verbose but every sentence adds value. Could be tightened but overall well-structured.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the complexity (long-poll, role detection, inbox filter) and the presence of annotations and output schema, the description covers all necessary behavioral and usage aspects. It also includes background references (PR #1, HARD RULE) for full context.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Schema coverage is 100%, but the description adds value: for inbox_filter it explains bypassing role detection and recommends for SessionStart hook; for timeout_seconds it ties to FastMCP TTL; for role it mentions default detection. Enhances understanding beyond schema.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states it's a 'Long-poll subscription that pushes ctx.info() on each new inbox file.' The verb-subscribe and resource-inbox file arrival are specific. It distinguishes from sibling tools like nucleus_relay by mentioning it replaces bash polling daemons.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description gives explicit when-to-use guidance: 'Call once at session start (e.g. via SessionStart hook)' and 'Use when role detection is unreliable OR when subscribing to a specific canonical inbox.' It implies a loop for persistence but does not explicitly list alternatives for when not to use.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
nucleus_routeCost RouterARead-onlyInspect
W5 tier routing — route a prompt to the optimal model tier.
Actions: route - Route a prompt to cheapest capable model. params: {prompt, complexity?, context?, estimated_output_tokens?} complexity ∈ 'routine' | 'complex' | 'sovereign' (default 'routine') Returns: provider, model, cost estimates, sovereignty tier.
| Name | Required | Description | Default |
|---|---|---|---|
| action | Yes | ||
| params | No |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | Yes |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations already provide readOnlyHint=true and destructiveHint=false. The description adds behavioral context by specifying the return structure (provider, model, cost estimates, sovereignty tier) and the complexity parameter enum. No contradictions with annotations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is concise and front-loaded with the core purpose. It uses a structured list to present parameters efficiently, with no extraneous information.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers the main 'route' action adequately, but there is no mention of other possible actions (given the generic action parameter). The output schema is hinted at but not fully described. More detail would improve completeness.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
With schema coverage at 0%, the description compensates by detailing the sub-parameters of the 'route' action (prompt, complexity, context, estimated_output_tokens). However, it only documents one action, leaving other potential actions undocumented.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'route a prompt to the optimal model tier' and provides a specific example action 'route' with its parameters. The tool name and title align, and the purpose is distinct from sibling tools like nucleus_agents and nucleus_audit.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description implies usage when routing a prompt to the cheapest capable model, but it does not provide explicit guidance on when to use this tool versus alternatives, nor does it mention any exclusions or prerequisites.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
nucleus_sessionsSession ManagementBInspect
Session management, events, state & checkpoint tools.
Actions: save - Save session for later. params: {context, active_task?, pending_decisions?, breadcrumbs?, next_steps?} resume - Resume a saved session. params: {session_id?} list - List all saved sessions check_recent - Check for recent session to resume (alias: "current") end - End work session. params: {summary?, learnings?, mood?} start - Mandatory session start protocol archive_resolved - Archive .resolved.* backup files propose_merges - Detect redundant artifacts, generate merge proposals garbage_collect - Archive stale tasks. params: {max_age_hours?, dry_run?} emit_event - Emit event to brain ledger. params: {event_type, emitter, data, description?} read_events - Read recent events. params: {limit?} get_state - Get brain state. params: {path?} update_state - Update brain state. params: {updates} checkpoint - Save task checkpoint. params: {task_id, step?, progress_percent?, context?, artifacts?, resumable?} resume_checkpoint - Resume from checkpoint. params: {task_id} handoff_summary - Generate handoff summary. params: {task_id, summary, key_decisions?, handoff_notes?} ingest_conversations - Ingest Claude Code JSONL transcripts. params: {mode?: "incremental"|"batch"|"single", session_id?, limit?, dry_run?} search_conversations - Search ingested conversations. params: {query, limit?, session_id?, date_from?, date_to?} list_conversations - List ingested sessions. params: {limit?, offset?, sort?: "recent"|"size"|"turns"} conversation_stats - Aggregate conversation corpus statistics register - [T3.11] Register agent session envelope. params: {session_id, agent, role, provider, worktree_path?, pid?, heartbeat_interval_s?} heartbeat - [T3.11] Touch last_heartbeat on an envelope. params: {session_id} unregister - [T3.11] Delete a session envelope. params: {session_id} list_agents - [T3.11] List registered agent envelopes. params: {worktree_path?, role?, alive_only?} detect_splits - [T3.11] Report (worktree, role) buckets with >1 alive session. params: {worktree_path?}
| Name | Required | Description | Default |
|---|---|---|---|
| action | Yes | ||
| params | No |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | Yes |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations are minimal (readOnlyHint=false, destructiveHint=false), so the description bears more responsibility. While the action names imply behavior (e.g., 'save' is a write, 'list' is read), the description does not disclose side effects, required permissions, or error behaviors. The per-action comments like 'Mandatory session start protocol' provide some transparency but overall lacks depth.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description begins with a clear one-sentence summary, which is good. However, the subsequent list of 20+ sub-actions is lengthy and somewhat repetitive. It could be condensed by grouping similar actions or using bullet points. The structure is clear but not minimal; it earns its space but is verbose.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's complexity (many sub-actions, nested params, 15+ siblings), the description covers actions and their parameters well. However, it omits return values (even though an output schema exists), error conditions, and practical usage context for each sub-action. For a dispatcher tool, more guidance on action interactions would be beneficial.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema is bare (only action and params with no descriptions), and schema description coverage is 0%. The description compensates by listing each action's expected parameter keys (e.g., for save: context, active_task?, etc.). This adds significant meaning beyond the schema, helping the agent know what to include. However, it does not specify types or constraints within those nested objects.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The title 'Session Management' and the opening line 'Session management, events, state & checkpoint tools.' clearly indicate the tool's domain. The detailed list of sub-actions (save, resume, list, etc.) provides specific verbs and resources, distinguishing it from siblings like nucleus_tasks or nucleus_agents. However, the primary purpose is somewhat buried in a long list; a single concise sentence would improve clarity.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit when-to-use or when-not-to-use guidance is provided. The description is purely a list of sub-actions without explaining in what scenarios to prefer this tool over its siblings (e.g., nucleus_tasks for task-specific operations). An agent would have to infer usage from the action names alone, which is insufficient.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
nucleus_slotsSlots & SprintsBInspect
Orchestration slots, sprints & mission management.
Actions: orchestrate - THE GOD COMMAND. params: {slot_id?, model?, alias?, mode?} slot_complete - Mark task complete. params: {slot_id, task_id, outcome?, notes?} slot_exhaust - Mark slot exhausted. params: {slot_id, reset_hours?} status_dashboard - ASCII dashboard. params: {detail_level?} autopilot_sprint - Sprint command. params: {slots?, mode?, halt_on_blocker?, halt_on_tier_mismatch?, max_tasks_per_slot?, budget_limit?, dry_run?} force_assign - Force assign task. params: {slot_id, task_id, acknowledge_risk?} autopilot_sprint_v2 - Enhanced sprint V3.1. params: {slots?, mode?, halt_on_blocker?, halt_on_tier_mismatch?, max_tasks_per_slot?, budget_limit?, time_limit_hours?, dry_run?} start_mission - Start mission. params: {name, goal, task_ids, slot_ids?, budget_limit?, time_limit_hours?, success_criteria?} mission_status - Get mission status. params: {mission_id?} halt_sprint - Halt sprint. params: {reason?} resume_sprint - Resume sprint. params: {sprint_id?}
| Name | Required | Description | Default |
|---|---|---|---|
| action | Yes | ||
| params | No |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | Yes |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations indicate it is not read-only and not necessarily destructive. The description lists mutating actions (e.g., slot_complete, force_assign) but does not disclose side effects, permission requirements, rate limits, or whether operations are reversible. It adds little behavioral context beyond the action names.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is dense and lengthy, listing all actions with parameters in a pseudo-bullet format. While structured, it is verbose and could be more concise by grouping or abbreviating redundant information. Still, it is organized and readable.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
The description covers all actions and their parameters, but lacks context on return values (despite an output schema being present), error handling, workflow sequencing, and when to prefer this tool over siblings. For a complex multi-action tool, more contextual guidance would be beneficial.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema has 0% description coverage, so the description fully compensates by enumerating each action's parameters with names, optional indicators (?), and brief descriptions (e.g., 'THE GOD COMMAND'). This provides complete semantic meaning that the schema lacks.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description states the tool is for 'Orchestration slots, sprints & mission management.' This gives a clear domain. The list of actions further clarifies specific capabilities. However, it does not explicitly distinguish from sibling tools like nucleus_orchestration or nucleus_tasks, relying on the action names to imply scope.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No explicit guidance on when to use this tool versus alternatives. The description lists actions but does not state prerequisites, exclusions, or comparative advantages over sibling tools. An agent would have to infer usage from action names alone.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
nucleus_syncCross-Agent SyncCInspect
Sync, artifact, trigger & deploy management for multi-agent coordination.
Actions: identify_agent - Register agent identity. params: {role, provider, session_id} (per ADR-0005 §D1) OR legacy {agent_id, environment, role?} (coerced per §D5 until end of Cycle C+2) sync_status - Check current multi-agent sync status sync_now - Manually trigger sync. params: {force?} sync_auto - Enable/disable file watching. params: {enable} sync_resolve - Resolve a file conflict. params: {file_path, strategy?} read_artifact - Read an artifact file. params: {path} write_artifact - Write to an artifact file. params: {path, content} list_artifacts - List artifacts. params: {folder?} trigger_agent - Trigger an agent via event. params: {agent, task_description, context_files?} get_triggers - Get all defined neural triggers evaluate_triggers - Evaluate triggers for an event. params: {event_type, emitter} start_deploy_poll - Start monitoring a Render deploy. params: {service_id, commit_sha?} check_deploy - Check deploy poll status. params: {service_id} complete_deploy - Mark deploy complete. params: {service_id, success, deploy_url?, error?, run_smoke_test?} smoke_test - Run a smoke test. params: {url, endpoint?} shared_read - Read shared state. params: {key} shared_write - Write shared state. params: {key, value, agent_id?} shared_list - List all shared state keys notify - Send notification to all channels. params: {title, message, level?} list_channels - List configured notification channels add_channel - Add a channel. params: {channel_type, webhook_url?} test_channel - Test a channel. params: {channel_name?} relay_post - Post message to another session type (Cowork↔Claude Code). params: {to, subject, body, priority?, context?, sender?, to_session_id?, from_session_id?, in_reply_to?} relay_inbox - Read messages for current session type. params: {unread_only?, limit?, recipient?, session_id?} relay_ack - Mark a relay message as read. params: {message_id, recipient?, session_id?} relay_status - Get relay mailbox status across all session types relay_clear - Clean up old relay messages. params: {recipient?, older_than_hours?} relay_log_event - Log a fire/skip event. params: {event, side, subject, tags?, match_reason?, priority?, message_id?, in_reply_to?} relay_skip_review - List recent unclassified skips. params: {limit?} relay_classify_skip - Classify a skip event. params: {ts, subject, classification, note?} relay_event_stats - Compute override + skip rates from event_log.jsonl marketplace_search - Search registered capability cards. params: {tags?, min_tier?, limit?} marketplace_whoami - Get caller's address, tier, reputation. params: {role?} marketplace_can_call - Pre-flight permission check. params: {caller, target} marketplace_recommend - Recommend agents by task description. params: {task, top_k?} marketplace_dashboard - Aggregated health snapshot. params: {} marketplace_history - Reputation event timeline for an address. params: {address, limit?} marketplace_promote - Admin: manually set address tier. params: {address, new_tier, caller?} marketplace_quarantine - Admin: flag address quarantined. params: {address, caller?, reason?} marketplace_audit - Replay admin_actions.jsonl with filters. params: {caller?, target?, action_type?, since_timestamp?, limit?, offset?} marketplace_compare - Head-to-head comparison of two addresses. params: {a, b} marketplace_trends - Tier distribution trend over N days. params: {days?} marketplace_alert - Subscribe to alert rules. params: {subscriber, target, event_types?} marketplace_export - Full registry snapshot (read-only). params: {} marketplace_diff - Diff two registry snapshots. params: {snapshot_a, snapshot_b} marketplace_subscribe - Subscribe to tier-change events. params: {subscriber, target?, event_types?} marketplace_unsubscribe - Remove subscription. params: {subscriber, target?} marketplace_subscriptions - List subscriptions. params: {subscriber?} marketplace_federation_proxy - Proxy an action to a remote federation brain. params: {target_brain, action, payload?} marketplace_federation_register - Register local brain as a federated capability card. params: {address, capabilities?, display_name?, tags?} marketplace_federation_sync - Force federation sync and reconcile marketplace registry. params: {}
| Name | Required | Description | Default |
|---|---|---|---|
| action | Yes | ||
| params | No |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | Yes |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations are minimal (readOnlyHint=false, destructiveHint=false), and the description adds little behavioral context. Many sub-actions imply state changes (e.g., write_artifact, trigger_agent), but the description does not disclose side effects, authorization requirements, or safety considerations.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is excessively long (30+ lines) and poorly structured as a monolithic list. It lacks an executive summary or categorization, making it difficult for an agent to quickly grasp the tool's capabilities.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
While the description enumerates many sub-actions, it fails to cover usage context, error handling, return value interpretation (despite an output schema), or integration with siblings. The tool's complexity demands richer guidance than a mere list.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
Despite 0% schema description coverage, the description embeds parameter hints for specific sub-actions (e.g., identify_agent, resolve_deploy), partially compensating for the generic {action, params} schema. However, many sub-actions lack detail, and the open-ended params object remains vague.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The title 'Cross-Agent Sync' suggests synchronization, but the description covers a wide range of unrelated features including artifacts, triggers, deploy, relay, and marketplace operations. This diffuseness obscures the core purpose and fails to distinguish nucleus_sync from sibling tools like nucleus_relay or nucleus_federation.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description provides no guidance on when to use this tool versus alternatives. It simply enumerates sub-actions without explaining the overall use case, prerequisites, or conditions that would lead an agent to select nucleus_sync over other tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
nucleus_tasksTask ManagementAInspect
Task management, depth tracking & ADHD context-switch tools.
Actions: list - List tasks. params: {status?, priority?, skill?, claimed_by?} get_next - Get highest-priority unblocked task. params: {skills} claim - Atomically claim a task. params: {task_id, agent_id} update - Update task fields. params: {task_id, updates} add - Create a new task. params: {description, priority?, blocked_by?, required_skills?, source?, task_id?, skip_dep_check?} (alias: "create") import_jsonl - Import tasks from JSONL. params: {jsonl_path, clear_existing?, merge_gtm_metadata?} escalate - Escalate task for human help. params: {task_id, reason} depth_push - Go deeper into subtopic. params: {topic} depth_pop - Come back up one level depth_show - Show current depth state depth_reset - Reset depth to root depth_set_max - Set max safe depth. params: {max_depth} depth_map - Generate exploration map context_switch - Record context switch / ADHD drift check. params: {new_context} context_switch_status - Get context switch metrics context_switch_reset - Reset context switch counter
| Name | Required | Description | Default |
|---|---|---|---|
| action | Yes | ||
| params | No |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | Yes |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
The description lists actions like 'claim', 'update', 'add', 'import_jsonl' which are clearly mutating, but it does not explicitly disclose side effects, destructive potential, or permission requirements. The annotations (readOnlyHint=false, destructiveHint=false) are not contradicted but the description could add more behavioral context.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is well-organized with a clear title and bullet-like action list, but it is lengthy (17 actions). The most important information (action names and parameters) is front-loaded, though some repetition could be trimmed.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's complexity (multiple sub-actions) and the generic input schema, the description thoroughly enumerates all actions and their parameters. It does not cover return values or error conditions, but the presence of an output schema likely fills that gap. Overall, it provides sufficient context for an agent to select and invoke the right action.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema is generic (action + params object with additionalProperties:true), so the description compensates by fully detailing each action's parameters (e.g., 'list - params: {status?, priority?, skill?, claimed_by?}'), which is essential for correct invocation. Schema coverage is 0%, making this description critical.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description clearly states 'Task management, depth tracking & ADHD context-switch tools,' providing a specific verb-resource combination that distinguishes it from sibling tools like nucleus_agents or nucleus_audit.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
The description lists many actions with their purposes, implying when to use the tool for task, depth, or context-switch operations. While it does not explicitly compare with siblings, the tool's domain is clear and distinct from the sibling tools.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
nucleus_telemetryTelemetryCInspect
LLM tiers, telemetry, PEFS notifications & protocol tools.
Actions: set_llm_tier - Set default LLM tier. params: {tier} get_llm_status - Get LLM tier configuration record_interaction - Record user interaction timestamp value_ratio - Get Value Ratio metric check_kill_switch - Check Kill Switch status pause_notifications - Pause PEFS notifications resume_notifications - Resume PEFS notifications record_feedback - Record notification feedback. params: {notification_type, score} mark_high_impact - Mark loop closure as high-impact check_protocol - Check protocol compliance. params: {agent_id} request_handoff - Request agent handoff. params: {to_agent, context, request, priority?, artifacts?} get_handoffs - Get pending handoffs. params: {agent_id?} agent_cost_dashboard - Get agent cost tracking dashboard dispatch_metrics - Get dispatch telemetry (per-action timing, error rates) rate_limit_status - Get dispatch rate limiter status (calls per facade, window)
| Name | Required | Description | Default |
|---|---|---|---|
| action | Yes | ||
| params | No |
Output Schema
| Name | Required | Description |
|---|---|---|
| result | Yes |
Tool Definition Quality
Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?
Annotations indicate readOnlyHint=false and destructiveHint=false, and the description mentions mutation actions like record_interaction, but it does not disclose side effects, authorization requirements, rate limits, or error handling beyond the action names.
Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.
Is the description appropriately sized, front-loaded, and free of redundancy?
The description is structured as a bullet list with actions and brief descriptions, making it easy to scan. The initial sentence provides a high-level summary. However, the list is long and could be condensed by grouping similar actions.
Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.
Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?
Given the tool's complexity (many actions, nested objects in output schema) and lack of structured parameter descriptions, the description is incomplete. It does not explain return values, error conditions, or action dependencies, leaving an agent underinformed.
Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.
Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?
The input schema provides no descriptions (0% coverage) and the description only lists parameter names for some actions (e.g., {tier}, {notification_type, score}) without types, formats, or defaults. This is insufficient for an agent to construct valid parameters.
Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.
Does the description clearly state what the tool does and how it differs from similar tools?
The description opens with a clear summary 'LLM tiers, telemetry, PEFS notifications & protocol tools' and lists specific actions, which collectively define the tool's domain. However, the purpose could be more explicitly stated as a single coherent function rather than a collection.
Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.
Does the description explain when to use this tool, when not to, or what alternatives exist?
No guidance is provided on when to use this tool versus sibling tools such as nucleus_governance or nucleus_agents. The description simply enumerates actions without context for selection.
Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.
Claim this connector by publishing a /.well-known/glama.json file on your server's domain with the following structure:
{
"$schema": "https://glama.ai/mcp/schemas/connector.json",
"maintainers": [{ "email": "your-email@example.com" }]
}The email address must match the email associated with your Glama account. Once published, Glama will automatically detect and verify the file within a few minutes.
Control your server's listing on Glama, including description and metadata
Access analytics and receive server usage reports
Get monitoring and health status updates for your server
Feature your server to boost visibility and reach more users
For users:
Full audit trail – every tool call is logged with inputs and outputs for compliance and debugging
Granular tool control – enable or disable individual tools per connector to limit what your AI agents can do
Centralized credential management – store and rotate API keys and OAuth tokens in one place
Change alerts – get notified when a connector changes its schema, adds or removes tools, or updates tool definitions, so nothing breaks silently
For server owners:
Proven adoption – public usage metrics on your listing show real-world traction and build trust with prospective users
Tool-level analytics – see which tools are being used most, helping you prioritize development and documentation
Direct user feedback – users can report issues and suggest improvements through the listing, giving you a channel you would not have otherwise
The connector status is unhealthy when Glama is unable to successfully connect to the server. This can happen for several reasons:
The server is experiencing an outage
The URL of the server is wrong
Credentials required to access the server are missing or invalid
If you are the owner of this MCP connector and would like to make modifications to the listing, including providing test credentials for accessing the server, please contact support@glama.ai.
Discussions
No comments yet. Be the first to start the discussion!
Your Connectors
Sign in to create a connector for this server.