nexus-agents
Integrates with arXiv as one of 9 research discovery sources for the research system, enabling literature review, state-of-the-art analysis, and auto-cataloging of academic papers.
Provides sandboxing capabilities for security through Docker/policy-based isolation, enabling secure execution of AI-generated code and workflows.
Enables GitHub issue tracking and PR review capabilities, allowing the system to analyze codebases, review pull requests, and integrate with GitHub workflows.
Provides GitLab issue tracking integration similar to GitHub, supporting repository analysis and issue management within GitLab projects.
Routes tasks to Google's Gemini CLI for long-context and multimodal AI capabilities as part of the intelligent routing system to multiple AI providers.
Integrates with OpenAI's Codex CLI and Codex MCP for code generation and reasoning tasks, plus supports custom OpenAI-compatible endpoints through the opencode adapter.
Connects to Semantic Scholar as one of 9 research discovery sources, enabling academic research, paper analysis, and knowledge synthesis for the research system.
Nexus Agents
Governance substrate for your AI coding agents — adversarial review, drift-detected rules, immutable audit, closed-loop telemetry
Why Nexus Agents?
Nexus-agents is a governance layer that sits above your AI coding agents — Claude Code, Codex, Gemini, and OpenCode. The agents do the engineering; nexus-agents enforces the rules they have to follow, reviews their work adversarially before it ships, audits everything they touch, and routes the next task based on what actually worked.
What it gives you:
Adversarial PR review —
pr_reviewruns 5 voter roles (architect, security, devex, catfish, scope_steward) with a 4-point verification gate. On the v5 evaluation set (10 PRs): 100% bug-catch and 50% raw false-positive rate; manual triage reclassified most "FPs" as legitimate findings the dataset had mislabeled. Full numbers: docs/research/pr-review-experiment-results-v5.mdDrift-detected charter —
CLAUDE.md+governance:check+ blocking CI gates fail the build when documented rules drift from registered behavior (model registry, MCP tools, expert types, skills)Immutable audit trail — every tool call, every voter decision, every routing choice flows through
AuditTrailwith structured logging and hash-chained append-only storage; integrity is verifiable via theverify_audit_chainMCP toolClosed-loop routing —
OutcomeStorefeeds production telemetry back into LinUCB + TOPSIS scoring so the system actually learns from what shipped vs what regressedMulti-voter consensus —
consensus_voteruns a default 7-role panel (architect, security, devex, ai_ml, pm, catfish, scope_steward;--quickuses 3). Six strategies: simple/super-majority, unanimous, higher-order Bayesian, opinion-wise, proof-of-learning
You: "Review this PR / orchestrate this task / vote on this proposal"
↓
nexus-agents: enforce rules → route → adversarial review → audit → learn from outcome
↓
Engineering agents: Claude Code · Codex · Gemini · OpenCode
↓
Code: actual edits, tests, PRs, issuesWhat this is NOT:
Not another autonomous coding agent. OpenHands, SWE-agent, AutoGen, Devin, Factory — those are agents. Nexus-agents is the layer above them. Use whichever agents fit; we govern them
Not a chat framework. Nothing here orchestrates conversations. It orchestrates real CLI tool invocations with real file I/O and outcome tracking
Not a model API proxy. The value is the rules, the gates, the audit, and the learning. Routing is a side effect of the governance work, not the product
Where nexus-agents sits in your stack
Human / IDE / CLI
(Claude Code, Cursor, VS Code, terminal)
│ MCP Protocol
▼
┌─────────────────────────────────────────────────────┐
│ GOVERNANCE SUBSTRATE — what nexus-agents provides │
│ │
│ Charter (drift-checked) Adversarial PR review │
│ Role registry Multi-voter consensus │
│ Immutable audit trail Closed-loop telemetry │
│ │
│ 34 MCP tools · multi-stage CompositeRouter │
└────────────────────────┬────────────────────────────┘
│
▼ delegates execution to
┌─────────────────────────────────────────────────────┐
│ ENGINEERING AGENTS — what does the actual work │
│ │
│ Claude Code · Codex · Gemini · OpenCode │
└────────────────────────┬────────────────────────────┘
│
▼ produces
Code, tests, PRs, issuesThe governance substrate is the layer that catches the mistakes engineering agents would otherwise make — bad code shipped, rules drifting from intent, audit gaps, telemetry-free routing — and routes the next task based on what actually worked the last time.
Quick Start (2 minutes)
1. Install
npm install -g nexus-agentsOr as a Claude Code plugin (single-command install from the official marketplace):
/plugin install nexus-agentsSee docs/getting-started/PLUGIN_INSTALL.md for plugin-specific setup, or llms-install.md for the short install guide an AI agent can follow.
2. Verify
nexus-agents doctor3. Use
With Claude Code (recommended):
nexus-agents setup # Auto-configures MCP serverThen in Claude: "orchestrate: Review this PR for issues"
Standalone CLI:
export ANTHROPIC_API_KEY=your-key
nexus-agents orchestrate "Explain the architecture of this codebase"Security: In default MCP mode, the server communicates only via stdio with the parent process (no network exposure). The REST API (opt-in) auto-generates an API key on first start. For network-exposed deployments, set
NEXUS_AUTH_ENABLED=true. See SECURITY.md.
Capabilities
Category | Details |
Adversarial PR Review |
|
Consensus Voting | 6 strategies: simple_majority, supermajority, unanimous, higher_order (Bayesian correlation-aware), opinion_wise, proof_of_learning |
Drift-Detected Charter |
|
Audit Trail | Structured logging for every tool call, voter decision, and routing choice. Hash-chained immutable storage; integrity verifiable via |
Closed-Loop Telemetry |
|
Security Pipeline | Sandboxing (Docker/policy), trust-tiered input handling, SARIF parsing, red-team patterns, ClawGuard access policies (audit/enforce) |
Multi-Expert Orchestration | 12 built-in expert types coordinated by Orchestrator. Roles bind prompt + tools + memory |
Development Pipeline | Research → Plan → Vote → Decompose → Implement → QA → Security. Three modes: autonomous, harness (caller implements), dry-run |
Memory & Learning | 5 user-facing backends (session, belief, agentic, adaptive, typed). Cross-session persistence feeds routing decisions |
Research System | 9 discovery sources (arXiv, GitHub, Semantic Scholar, etc). Auto-catalog, quality scoring, synthesis into topic clusters |
Graph Workflows | DAG-based workflow execution with checkpoint/resume, state reduction, and event hooks |
34 MCP Tools | Agent management, workflow execution, research, memory, codebase intelligence, repo analysis, consensus, operations |
Available Experts
Expert | Specialization |
Code | Implementation, debugging, optimization |
Architecture | System design, patterns, scalability |
Security | Vulnerability analysis, secure coding |
Testing | Test strategies, coverage, test generation |
QA | Acceptance criteria, regression checks |
Documentation | Technical writing, API docs |
DevOps | CI/CD, deployment, infrastructure |
Research | Literature review, state-of-the-art analysis |
PM | Product management, requirements, priorities |
UX | User experience, usability, accessibility |
Infrastructure | Server management, bare metal, networking |
Data Viz | Charts, dashboards, visual data presentation |
Supported CLIs & Providers
Nexus-agents routes tasks through 5 CLI adapters, each connecting to major AI providers:
CLI | Provider | Best For |
claude | Anthropic (Claude) | Complex reasoning, analysis |
gemini | Google (Gemini) | Long context, multimodal |
codex | OpenAI (Codex CLI) | Code generation, reasoning |
codex-mcp | OpenAI (Codex MCP) | MCP-native Codex integration |
opencode | Custom OpenAI-compat | Custom endpoints, local models |
CLI Commands
nexus-agents # Start MCP server (default)
nexus-agents doctor # Check installation health
nexus-agents setup # Configure Claude CLI integration
nexus-agents orchestrate "..." # Run task with experts
nexus-agents vote "proposal" # Multi-agent consensus voting
nexus-agents review <pr-url> # Review a GitHub PR
nexus-agents expert list # List available experts
nexus-agents workflow list # List workflow templates
nexus-agents config init # Generate config file
nexus-agents init --portable # Create workspace-local .nexus-agents/ for sandboxes
nexus-agents init --portable --mcp-config # Also emit .mcp.json wiring Claude Code to it
nexus-agents init --portable --install --mcp-config # …and install the binary into the workspace
nexus-agents fitness-audit # Run fitness score audit
nexus-agents research query # Query research registry
nexus-agents --help # Full command listSee docs/ENTRYPOINTS.md for the complete CLI reference (28+ commands).
MCP Tools
When running as an MCP server, the following tools are available:
Tool | Description |
| Task orchestration with Orchestrator coordination |
| Create a specialized expert agent |
| Execute a task using a created expert |
| Execute a workflow template |
| Route task to optimal model |
| List available expert types |
| List available workflow templates |
| Multi-model consensus voting on proposals |
| Query research registry (status, overlap, stats, search) |
| Add paper to registry by arXiv ID |
| Add non-paper source (GitHub repo, tool, blog) |
| Discover papers/repos from external sources |
| Analyze registry for gaps, trends, coverage |
| Review auto-cataloged research references |
| Synthesize registry into topic clusters with themes |
| Query across all memory backends |
| Memory system statistics dashboard |
| Write to typed memory backends |
| Multi-CLI performance weather report |
| Triage GitHub issues with trust classification |
| Execute graph-based workflows with checkpointing |
| Execute AI software factory spec pipeline |
| Generate draft model registry entry |
| Query execution traces for observability |
| Query the structured task-state log for a task ID |
| Verify hash chain of a FileAuditStorage audit log directory |
| Analyze GitHub repository structure |
| Generate security scanning pipeline for a repo |
| Extract code symbols from source files for analysis |
| Search codebase for patterns, symbols, or text |
| Full dev pipeline: research, plan, vote, implement, QA |
| Execute a pipeline plugin by name with typed input |
| Multi-voter PR review with verification gate (experimental) |
| Per-axis tradeoff vote for build-vs-buy / supply-chain decisions |
Configuration
Environment Variables:
Variable | Description |
| Claude API key |
| OpenAI API key |
| Gemini API key |
| Log level (debug/info/warn/error) |
Generate config file:
nexus-agents config init # Creates nexus-agents.yamlDocumentation
Topic | Link |
Full CLI Reference | |
Architecture | |
Contributing | |
Coding Standards | |
Quick Start Guide |
Development
git clone https://github.com/williamzujkowski/nexus-agents.git
cd nexus-agents
pnpm install
pnpm build
pnpm testRequirements: Node.js 22.x LTS, pnpm 9.x
Contributing
Fork the repository
Create a feature branch (
git checkout -b feat/amazing-feature)Commit with conventional commits (
feat(scope): add feature)Open a Pull Request
See CONTRIBUTING.md for details.
License
MIT - See LICENSE
Built with Claude Code
This server cannot be installed
Maintenance
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/williamzujkowski/nexus-agents'
If you have feedback or need assistance with the MCP directory API, please join our Discord server