trinity-local
Integrates with Google's Gemini models (via Gemini web and CLI) to run councils for answer synthesis and disagreement analysis.
Integrates with Ollama to automatically discover and include local models as council members for decentralized answer synthesis.
Integrates with OpenAI's GPT models (via ChatGPT and CLI) to run councils for answer synthesis and disagreement analysis.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@trinity-localRun a Trinity council on whether to use SQLite or DuckDB for this analytics workload."
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
class: live
⠕ Trinity Local
Related MCP server: Councly MCP Server
Own your taste.
Stop copy-pasting prompts between tabs like an animal. Ask once. Trinity fans it out to Claude, GPT, and Gemini. Shows you where they split. Makes the call the way your taste would.
Install — just an MCP and a Chrome extension. No new app, no cloud, no API key.

Inside Claude Code (or Codex CLI / Antigravity / Cursor) — just ask:
Run a Trinity council on whether to use SQLite or DuckDB for this analytics workload.
The agent calls mcp__trinity-local__run_council for you. Claude, Codex, and Gemini answer in parallel; the chairman synthesizes and returns the verdict inline:
Winner: DuckDB — all three agree it wins on analytical scan speed. Where they split: Claude flags SQLite's simpler ops story; Codex and Gemini don't. Why it matters for you: you've shipped solo before and kept picking the lower-ops option — so the chairman weights that split toward "SQLite if you'll operate it alone."
That last move — the chairman knowing which split matters to you — is the part no single chat tab can do. It reads your lens: the pattern in how you rephrase, judge, and decide, distilled from your own transcripts. The launchpad above is the same surface in a browser tab — open it from the Chrome extension to scan recent councils, your lens, and the topic graph.
The Chrome extension does two things. As you chat on claude.ai / chatgpt.com / gemini.google.com, it captures each conversation to ~/.trinity/conversations/ on your machine — no listening port, no upload; Chrome's Native Messaging spawns a local capture host on demand. And it hosts the launchpad you click open from the toolbar. Together with the CLI sessions on disk (~/.claude/, ~/.codex/, ~/.gemini/), the extension's captures are what your lens distills from.
You'll want at least Claude + Codex CLI installed. The magic is the disagreement — a council needs a second voice. One provider runs, but the "where they split" payoff needs two.
Own your taste. The council gives you a synthesized answer now; the lens makes the next one more yours. Every council, every rejected answer, every rephrase sharpens a profile of your judgment that lives only on your machine (Anthropic can't read your ChatGPT; OpenAI can't read your Claude). The longer you use it, the more the synthesized answer is the one you would have written.
No new app. No service. No API key. Captures flow to your machine; Trinity uploads nothing — your transcripts and lens stay on disk. Everything else is an MCP server inside the harnesses you already use. Free for individuals, forever — MIT, local. Running it across a team? Same product, with support: Trinity for teams.
Install
Recommended — one line. Clones the repo (you can read it end-to-end), installs the runtime deps, registers Trinity's MCP server in every harness it detects (Claude Code, Codex CLI, Antigravity, Cursor), and pre-wires the Chrome-capture host:
<!-- canonical:install_command -->curl -fsSL https://raw.githubusercontent.com/keepwhatworks/trinity/main/scripts/install.sh | bash<!-- /canonical -->No PyPI, no npm, no API key — just git clone + a couple of shell wrappers in ~/.local/bin/. Verify with trinity-local status. To remove: trinity-local uninstall --yes.
In Claude Code? One-command plugin install. Trinity ships as a Claude Code plugin that registers the MCP server and adds native slash commands (/trinity-local:council, :ask, :lens) — no manual install-mcp:
/plugin marketplace add keepwhatworks/trinity
/plugin install trinity-local@trinityYou still install Trinity itself once (the curl line above — there's no PyPI package) — the plugin's launcher finds it. No Stop-hooks / review-gate: the plugin only adds commands + the MCP server, so it never gates your responses or runs away with your quota. Details: plugins/trinity-local/README.md.
Not comfortable in a terminal? Paste that one line into Claude Code — it runs inside your terminal and in the Claude Desktop app — and let Claude run the install for you. That's the easiest path if you arrived via the Chrome extension and have never opened a shell.
Manual MCP config — if the bootstrap missed a harness, or you want to wire one by hand, that's exactly what trinity-local install-mcp writes. Substitute PYTHON with your interpreter (which python3, or the absolute path the bootstrap printed).
For Claude Code (~/.claude.json), Cursor (~/.cursor/mcp.json), Antigravity (~/.gemini/settings.json), and other JSON harnesses — merge into the top-level mcpServers object:
{
"mcpServers": {
"trinity-local": {
"command": "PYTHON",
"args": ["-m", "trinity_local.main", "--mcp"]
}
}
}For Codex CLI, append to ~/.codex/config.toml:
[mcp_servers.trinity-local]
command = "PYTHON"
args = ["-m", "trinity_local.main", "--mcp"]For Antigravity (agy CLI) — model selection happens inside agy itself, not via MCP. Run /model and pick your Gemini (e.g. Gemini 3.1 Pro); Trinity's launchpad reads the persisted selection from ~/.gemini/antigravity-cli/settings.json.
Then ask any of these agents: "Run a Trinity council on …" — the MCP tools appear inline. Free, local, MIT. The CLI (trinity-local status, trinity-local dream, etc.) is the engine; the MCP tools are the agent surface.
Requirements: Python 3.10+ and at least one of the claude / codex / agy CLIs authenticated — Trinity works with just one (chairman synthesis + your lens), gets stronger with two (real disagreement), full canonical council with three. Ollama / MLX models you've pulled locally are auto-discovered and join the routing pool as free council members (ollama:<model> / mlx:<model>) — no config edit, no extra MCP tools. To remove: trinity-local uninstall --yes.
How it works
Trinity reads the transcripts on your machine — CLI sessions on disk (Claude Code, Codex CLI, Antigravity), web chats the Chrome extension auto-captures locally (claude.ai, chatgpt.com, gemini.google.com), and any manual exports you've imported (claude.ai exports, ChatGPT exports, Gemini Takeout) — and distills the pattern in how you rephrase, push back, and decide into a taste lens. The chairman reads that lens on every council, so the synthesis comes back in your voice, not in the voice of a generic model.
Anthropic can't recommend ChatGPT. OpenAI can't recommend Claude. Google can't recommend either. The competitive constraint is structural, not technical. The labs that built the models you trust are commercially blocked from helping you use a competitor — so the cross-provider memory layer has to come from outside the labs. That's what Trinity is.
And — when a new model lands, score it against your taste
trinity-local eval-build # one-time: build from your rejection signal (~/.trinity/me/preference_acts.jsonl)
trinity-local eval-run --target claude # re-target whenever a new model lands (provider name: claude / codex / antigravity)
trinity-local eval-show # per-axis bars: REFRAME / COMPRESSION / REDIRECT / SHARPENINGWhen Claude 5 lands: "Claude provider scored 0.88 on my taste — beats last release by 0.05." A headline number no lab can produce — because only the layer above the labs sees your transcripts across all three.
Your lens, generated from your prompts.
trinity-local dream is the consolidation pass. Like sleep: it
reweights old facts in light of everything that's come in since,
resolves memories that contradicted each other, and connects
memories that were just sitting there with their neighbors — turning
a corpus of raw prompts into a hierarchical lens (identity → paired
tensions → subject basins → vocabulary) that the chairman reads top-down
on every council.
Traceability is non-negotiable. If it can't show its work, it
doesn't get to claim the thought. Every lens entry carries
tension_decisions — backreferences to the specific rejection pairs
that justify it. Open the launchpad's lens card and each claim links
back, clickable, to the model-said-vs-you-substituted moments it was
extracted from. No hidden inference, no "trust me." Inspect any claim;
walk the chain to the source.
The folder is the API. ~/.trinity/ is a CC0 JSON-Schema-validated
on-disk contract — memories/lens.md, memories/topics.json,
memories/vocabulary.md, core.md, scoreboard/picks.json. Any tool
(Aider / Cline / Continue / your own) can read or write through that
folder without going through Trinity's process. Schema in
docs/lens.md + docs/PREFERENCE_CORPUS_SPEC.md.
For tool builders
~/.trinity/ is the API surface. CC0, JSON-Schema-validated, adoptable
by Aider / Cline / Continue / anything else. Schema:
docs/PREFERENCE_CORPUS_SPEC.md.
Privacy by default
Trinity uploads nothing. Your transcripts, prompt history, and lens stay on disk. (Councils dispatch your question to the labs through the CLIs you already authenticated — the same path as typing it there yourself — never to a Trinity server.)
Anonymous categorical telemetry is on by default (Google Analytics 4). Two payloads, both categorical/numeric only: the per-council event (
task_type,winner,member_count,mode) and, from the launchpad, an anonymous provider win-rate snapshot (per-provider Elo / wins / total games — no task text). No prompt content, no lens text, no user_substitute strings ever. Disable any time withtrinity-local telemetry-disable; the data immediately stops flowing. Sending also requires GA4 credentials that the public build does not ship — withoutTRINITY_GA4_MEASUREMENT_ID+TRINITY_GA4_API_SECRETset, both the CLI and the launchpad silently no-op (nothing leaves your machine).No hosted controller, no per-call billing. Trinity dispatches via the CLIs you already use — nothing to meter, nothing to bill. The taste signal you build stays yours.
Objections (the ones I had)
"I don't want to learn another UI — I just use Claude Code."
You don't. Trinity is an MCP server inside your existing harness (Claude Code, Codex CLI, Antigravity, Cursor). /trinity walks installation in one step. After that, your existing UI is the UI.
"I don't want a daemon running on my machine."
Trinity isn't a daemon. The MCP server spawns when your harness opens, exits when it closes. ~62 MB resident while connected. lsof -i | grep LISTEN shows nothing — no listening port, no background process.
"I don't want my data sent to a server."
Transcripts never leave your machine. Council fan-out goes from your laptop directly to the CLIs you already authenticated. No hosted controller. Anonymous categorical telemetry (the four discrete labels above — no prompt content) is on by default to close the feedback loop; turn it off any time with trinity-local telemetry-disable.
"I want my subscriptions actually used."
Trinity dispatches via your existing claude / codex / agy CLIs — using the tokens you've already paid for. Every council uses what you have. No new bill.
"I'm tired of copy-pasting between Claude / GPT / Gemini tabs." That's the whole point. Every council runs all three in parallel from one prompt.
"I want to know if a new model release is actually better for me."
trinity-local eval-run --target <provider> (claude / codex / antigravity — the provider you want to benchmark; the underlying model is whatever that provider currently ships) scores it against the prompts you've already rejected — your actual taste, not a synthetic benchmark.
"I want the right model picked for the right task, automatically."
Every council teaches Trinity which model wins for which kind of question — automatically. The chairman's pick (lens-governed) is the signal; compute_personal_routing_table() aggregates it per task type. No human rating step. The launchpad surfaces the personal routing table; a deterministic pass places each council into its nearest lens basin and tallies the chairman-winner there (scoreboard/picks.json), so the next call routes on the model that's been winning your questions in that basin.
"How is this different from Anthropic's Dreaming?"
Same verb, different domain. Dreaming consolidates Claude sessions inside Anthropic's runtime — single-lab. Trinity dreams across the labs: ~/.claude/ + ~/.codex/ + ~/.gemini/ + claude.ai + ChatGPT + Gemini exports, on your machine. Even if Anthropic moves Dreaming server-side tomorrow, the server-side version still can't see OpenAI or Google transcripts — the labs are commercially prevented from reading each other. Cross-lab dreaming has to come from outside the labs, by definition. Dreaming makes Claude smarter at being Claude; Trinity learns which model wins which kind of YOUR question.
"Won't Anthropic just build cross-provider memory themselves?" They literally can't. Anthropic can't recommend ChatGPT; OpenAI can't recommend Claude; Google can't recommend either. The competitive constraint is structural, not technical. The cross-provider layer has to come from outside the labs — that's the whole point.
"Who's behind this? Why trust a random repo with my transcripts?"
Single developer, MIT, public source — small enough to audit in an evening. Trinity reads transcripts on your machine — written there either by your CLI sessions or by the Chrome extension's local capture host. Nothing leaves the machine. If you stop using it, ~/.trinity/ is plain JSON you can cat | jq without us.
"What happens if you abandon this project?"
The folder is the API. ~/.trinity/memories/lens.md is Markdown; council outcomes are human-readable JSON; the schema is at docs/PREFERENCE_CORPUS_SPEC.md. Your taste capture survives Trinity disappearing.
How is this different from [X]
Trinity Local | LMArena | promptfoo / Claude evals | OpenRouter | Karpathy LLM Council | |
Data source | Your own prompts | Crowd votes | Test fixtures | n/a (router) | Yours, but no persistence |
Cost basis | Your own subscriptions | Hosted | Per-call API | Per-call API | Per-call API |
Output | Structured Routing JSON + your | Win-rate ranking | Pass/fail per case | Cheapest route | Three answers + summary |
Privacy | Corpus stays on disk | n/a | n/a | Prompts route through their servers | Hosted |
Personalization | Personal routing table improves with use | One global ranking | Per-test-suite | None | None |
Personal benchmarks |
| Synthetic prompts | Static fixtures | n/a | n/a |
Council reads through your lens | Chairman synthesizes in your voice — distilled from past transcripts | n/a | n/a | n/a | Generic synthesis |
Shareable artifact |
| Leaderboard link | Eval report | n/a | Per-prompt summary |
If you want "which model is best in general," LMArena. If you want "which model handles this codebase / this voice / this trade-off you keep making," Trinity.
Demo
A real council outcome — verbatim from ~/.trinity/council_outcomes/<id>.json after the council ran "name the single biggest remaining launch risk" against itself:
{
"winner": "claude",
"runner_up": "codex",
"confidence": "high",
"agreed_claims": [
"The #1 risk is the /trinity skill not installing by the pip path.",
"install-mcp must drop SKILL.md into ~/.claude/skills/trinity/ before ship."
],
"disagreed_claims": [{
"claim": "Post-validator must check for skill cache-staleness.",
"providers_for": ["claude"],
"providers_against": ["antigravity", "codex"],
"why_matters": "install-mcp can succeed on disk but /trinity stays invisible to the open Claude Code session."
}],
"routing_lesson": "For launch_readiness_decision, prefer claude — surfaces second-order failure modes."
}That's the payoff: agreed claims you can lean on, disagreed claims with the why, and a routing lesson that makes the next council pick the right chairman automatically. Trinity ran this against itself to ratify what would ship.
Architecture
Chairman synthesizes member outputs into structured Routing JSON; members run in
parallel (or chain mode for sequential refinement); lens-discovery is a 5-stage
pipeline (Stage 0 turn-pair rejections + Stages 1-4 basins→decisions→pair-mining→post-filter) ratifying tensions across ≥3 topical basins.
Want the full picture? docs/how-trinity-works.md walks the pipeline end-to-end — transcripts → embeddings → dream → lens → runtime. Wire diagram + design rationale in docs/architecture.md.
What's next
Current repo state: v1.7 line, exact package v<!-- canonical:version -->1.7.389<!-- /canonical -->. The shipped surface is MCP-first: lens, council, dream, status, and install are the advertised CLI verbs; the older lens-build / council-launch names remain as compatibility aliases for launchpad dispatch and existing scripts. The most recent arc collapsed routing into the lens — consolidate now places each council into its nearest lens basin and tallies the chairman-winner there, so the learned routing can never drift into a stale embedding space. Earlier work tightened the launch path: extension auto-wiring, schema migrations, real ModernBERT embeddings, TF-IDF abstain-gates for semantic flows, corpus-purity guards, personal eval integrity, no-PII telemetry gates, and install-wrapper Python fallback.
Help
Command | What it does |
| Health + scoreboard + recent councils (absorbed |
| Run a council from the terminal |
| Build your lens from prompt history |
| Rebuild the broader local memory layer |
| Install or repair MCP / extension wiring |
| Render your strongest lens as a PNG |
| Open the launchpad |
| Mobile-safe review links |
| Full command list |
License
MIT — see LICENSE.
This server cannot be installed
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/keepwhatworks/trinity'
If you have feedback or need assistance with the MCP directory API, please join our Discord server