Skip to main content
Glama
bitmaster162

continuityos

by bitmaster162

ContinuityOS

πŸ›‘οΈ ContinuityOS β€” AI Agent Governance Gateway

No dangerous tool runs unless ContinuityOS approves it. A local-first, MCP-native hard-boundary that AI coding agents (Claude Code, Cursor, Codex CLI) must pass through: every risky shell/file/git action gets a preflight decision β€” ALLOW Β· WARN Β· HOLD Β· DENY Β· REQUIRE_CONFIRMATION Β· DRY_RUN_ONLY β€” with reasons, an append-only tamper-evident audit ledger, and a rollback plan. Apache-2.0.

continuity run shell -- rm -rf /     # β›” BLOCKED β€” command was NOT executed
continuity run shell -- npm test     # βœ“ ALLOW β€” runs

ContinuityBench v0: 100% decision accuracy, 9/9 dangerous actions stopped (vs 0/9 with no gateway). What makes it smarter than a static policy engine: it decides with continuity context (your canon/rules/state), not just regex. See BUILD_GATE_STATUS.md.

The memory + continuity layers below are the context engine that powers those decisions.


Durable memory + continuity layer for AI agents and humans. Local-first, zero external services, Apache-2.0.

Not just a vector store β€” ContinuityOS keeps the thread between sessions and between versions of you and the model: memory (hybrid recall) + continuity (canon, frontiers, loops, checkpoints, anti-drift doctor, handoff) + a multi-agent council (many agents + you on one memory, authority levels & roles) + a digital twin (a behavioral model built from your own memory β€” the human↔AI co-evolution / dyad layer) + an operator control plane (correct, redact, rollback, export).

Your Claude / ChatGPT / agent forgets everything between sessions. ContinuityOS is a small local memory layer that stores what matters β€” who you are, your projects, your rules, decisions you've made β€” and gives it back when it's relevant. It recalls both structurally (folder-like namespaces + keyword search) and semantically (vector similarity), so the right memory surfaces whether you match the words or just the meaning.

Nothing leaves your machine. One SQLite file. No cloud, no account, no telemetry.


Related MCP server: Logica Context

Why

  • Agents forget. Every new session starts cold. ContinuityOS persists context across sessions and tools.

  • Hybrid recall. Keyword-only memory misses paraphrases; pure-vector memory misses exact facts and structure. ContinuityOS blends both.

  • Structure like folders. Memories live in namespaces β€” identity, projects, rules, facts, events, notes (or your own) β€” so recall can be scoped and a human can browse it.

  • For agents and humans. Use it from your code, from the CLI, from an MCP-capable client (Claude Desktop / Claude Code), or over a tiny HTTP API.

  • Local-first & private. Core is stdlib-only β€” no required dependencies, no services. Drop-in to anything.


Install

pip install continuityos          # core (stdlib-only)
# optional, for production-grade embeddings:
pip install "continuityos[embeddings]"

Requires Python 3.10+.


Quick start

From the CLI

cos remember "Robert prefers Apache-2.0 licenses" -n rules -t license
cos remember "ContinuityOS = hybrid memory: FTS + vectors" -n projects
cos recall  "which license should I pick?"
# 0.54 [rules] Robert prefers Apache-2.0 licenses  (semantic 0.22 + keyword)
cos namespaces

From Python

from continuityos import Memory

m = Memory("memory.db")
m.remember("The grid lab K=0.04 cohort led at +$1405 / 3 days", namespace="facts", tags=["trading"])

for hit in m.recall("best grid setup", k=3):
    print(hit.score, hit.namespace, hit.text)

# inject straight into an agent prompt:
print(m.context("what do I know about grid trading?"))

As an MCP server (Claude Desktop / Claude Code)

ContinuityOS ships an MCP stdio server so an agent can remember and recall on its own. Add to your MCP client config:

{
  "mcpServers": {
    "continuityos": {
      "command": "cos",
      "args": ["--db", "~/.continuityos/memory.db", "serve"]
    }
  }
}

Tools exposed: remember, recall, context, forget, list_namespaces, checkpoint, handoff, doctor, set_frontier, predict, alignment, preflight_action β€” 12 tools. Now the agent pulls relevant memory automatically before answering β€” and writes new facts back as it learns it.

Recommended: use the cross-platform bridge instead of cos serve:

{
  "mcpServers": {
    "continuityos": {
      "command": "python",
      "args": ["/path/to/mcp_bridge.py"]
    }
  }
}

See docs/MCP_INTEGRATION.md for Hermes, Claude Desktop, and Cursor setup.

Over HTTP (optional)

cos api --port 8077
curl -s "localhost:8077/recall?q=license&k=3"
curl -s -XPOST localhost:8077/remember -d '{"text":"hello","namespace":"notes"}'

The default embedder is offline & dependency-free. For real semantic quality (synonyms, paraphrases), switch in one line:

from continuityos import Memory
from continuityos.embedders import FastEmbedEmbedder   # pip install "continuityos[fast]"
m = Memory("memory.db", embedder=FastEmbedEmbedder())  # bge-small, ONNX, no torch

Benchmark (see BENCHMARKS.md): recall@5 0.50 β†’ 1.00, MRR 0.38 β†’ 0.58. Real LoCoMo harness ready in bench/locomo_bench.py.

With Docker

docker compose up -d        # HTTP API on :8077, memory persisted in ./cos-data

More than memory β€” the continuity layer

A chat is a terminal, not memory. ContinuityOS persists the operating state that keeps work coherent across sessions:

  • Canon β€” slow, non-negotiable truths (who you are, rules you don't break).

  • Frontiers β€” 1 trunk + 1 cash + 1 lab focus discipline; classify every idea.

  • Open loops β€” what's still unfinished, bounded so it can't sprawl.

  • Checkpoints β€” every session ends with delta + next irreversible action + proof.

  • Doctor β€” an anti-drift check: is a cash frontier set? loops bounded? checkpoint fresh? proof attached?

  • Handoff pack β€” one block (canon + frontiers + loops + last checkpoint) to resume in a new session or hand to another agent.

cos frontier trunk continuityos
cos frontier cash  inner-circle
cos loop "ship v0.2 to GitHub"
cos checkpoint --summary "built continuity layer" --next "update sites" --proof continuity.py
cos doctor       # βœ… healthy 5/5  (or flags drift)
cos handoff      # paste this into the next session
from continuityos import Continuity
c = Continuity(db="memory.db")
c.add_canon("Proof beats explanation. Closure beats branching.")
c.set_frontier("cash", "inner-circle")
c.checkpoint(summary="...", next_action="...", proof="path/to/artifact")
print(c.doctor())     # anti-drift report
print(c.handoff())    # resume-context block

Over MCP the agent gets checkpoint, handoff, doctor, set_frontier tools too β€” so it maintains its own continuity, not just its recall.


How it works

            remember(text, namespace, tags)
                        β”‚
                        β–Ό
        β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
        β”‚            Store               β”‚   one local SQLite file
        β”‚  items  +  FTS5  +  vectors    β”‚
        β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                        β–²
          recall(query) β”‚  HYBRID rank
            β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
   structural / keyword       semantic / vector
   (FTS5 + namespace)         (cosine over embeddings)
            β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                  blended score β†’ top-k
  • Structural layer β€” namespace (folder-like) + tags + FTS5 full-text index.

  • Semantic layer β€” each memory is embedded to an L2-normalized vector; recall ranks by cosine similarity.

  • Hybrid score β€” semantic_weight Β· semantic + (1 βˆ’ semantic_weight) Β· keyword (tunable; default 0.6).

  • Embeddings are pluggable β€” the default HashingEmbedder is deterministic and fully offline (great for privacy and tests). For best semantic quality, pass any str β†’ list[float] callable (e.g. a sentence-transformers model):

    from sentence_transformers import SentenceTransformer
    enc = SentenceTransformer("all-MiniLM-L6-v2")
    m = Memory("memory.db", embedder=lambda t: enc.encode(t, normalize_embeddings=True).tolist())

Privacy

ContinuityOS never sends your data anywhere. Memory is a single SQLite file on your disk. .gitignore is pre-configured to keep *.db, data/, and takeout/ out of version control by construction.


Standards & competitive position

The 2026 consensus is that guardrails belong at the gateway, not embedded in application code β€” a control point that intercepts every tool invocation, scores its risk, and approves or blocks before execution. ContinuityOS is exactly that control point, and maps onto the frameworks enterprises are now audited against:

  • OWASP LLM Top 10 β€” preflight classifies and gates the agentic risks directly: prompt-injection-driven destructive commands, tool poisoning (D3 schema/forbidden-pattern checks), excessive agency (SAP capability passports), and missing audit (append-only ledger).

  • NIST AI RMF / EU AI Act (high-risk obligations, in force Aug 2026) β€” the tamper-evident decision ledger + rollback plan provide the record-level traceability and human-oversight hooks these frameworks require. Every decision is logged with reasons, severity, and a restore command.

  • MCP-native β€” runs as an MCP server; the same preflight governs MCP tool calls, the layer competitors (Cisco AI Defense, Lasso) now target.

What no one else has. The crowded 2026 field (Galileo Agent Control, Maxim Bifrost, Palo Alto Prisma AIRS, Lasso, Defend AI) enforces generic policy β€” regex, ML classifiers, org rules. ContinuityOS decides with your continuity context: the same engine that remembers your canon and non-negotiable rules uses them to judge each action (_canon_check). That makes it the only gateway whose verdicts are personalized to the operator, not just the org. Plus two things detection-only tools skip: an instant local rollback module (snapshot β†’ continuity rollback <id>) and sovereign-local execution (zero data leaves the disk β€” no SaaS egress, which is itself the top enterprise blocker: only 14.4% of agents reached production with full security sign-off in 2026).

Honest scope: rollback covers local files only; it cannot undo irreversible external side effects (network, prod, third-party APIs). The gateway raises the floor β€” it is not a guarantee.

Two-tier memory & cost-aware routing

The strongest 2026 agents don't win on a bigger context window β€” they win on how they handle the finiteness of context. ContinuityOS implements the two-tier pattern Anthropic and OpenAI both converge on:

  • Session memory β€” the auto-compactible state of the current run (goal, live hypotheses, found IDs, tool outcomes, unresolved blockers). Carried forward instead of re-derived each turn.

  • Long-term memory β€” durable lessons, stable user preferences, recurring patterns, anti-patterns, domain facts. One lesson per file; update the existing note, don't spawn duplicates β€” the same discipline this repo's memory files follow.

context(query, k, max_tokens=…, compact=…) packs the most relevant long-term memories until a token budget is hit, so recall stays cheap, and its output order is deterministic β€” which matters for prompt-cache stability.

Cache-friendly memory rules (preserve the prompt-cache hash; cache miss = paying full price every turn):

  1. Never put volatile values (datetime.now(), random IDs, per-turn counters) in the system prompt or any cached prefix β€” they reset the cache every call. Put them in the body of the last user message.

  2. Keep tool definitions and the memory block in a stable, sorted order so the cached prefix is byte-identical across turns (compact=True + deterministic packing does this).

  3. The cache threshold on Opus-4.8 is ~1024 tokens β€” keep the cached prefix above it to actually benefit.

  4. To change instructions mid-run without busting the cache, inject a role:"system" message into the history rather than editing the cached system prompt.

Cost-aware routing. estimate_cost(text, model_id, output_tokens) prices a context block against a built-in MODEL_REGISTRY (Fable 5, Mythos 5, Opus 4.8, Haiku 4.5, GPT-5.5, Gemini 3.1 Pro / 3.5 Flash, Grok 4.3, DeepSeek V4 Pro β€” mid-2026 pricing). Same block costs ~28Γ— more on Fable 5 than DeepSeek V4 Pro, so callers can route commodity β†’ interactive β†’ high-stakes tiers instead of always paying frontier price for trivial work.

Status

v0.7.0 β€” 6 layers, 12 MCP tools, 18/18 tests, full audit passed. Unified core, all tested (FastEmbed-accelerated recall, session rituals boot/close/compress, recall benchmark in bench/): L1 Memory (hybrid FTS+vector, WAL mode) Β· L2 Continuity (canon/frontiers/loops/checkpoints/doctor/handoff) Β· L3 Council (multi-agent, authority levels + roles) Β· L4 Twin (digital twin: profile/predict/alignment β€” now in CLI too) Β· L5 Control Plane (correct/redact/rollback/export) Β· L6 Autopoiesis (self-maintenance doctor). CLI (cos + continuity), MCP server (12 tools, cross-platform mcp_bridge.py), HTTP API, Docker. CI via GitHub Actions.

Audit fixes applied: WAL crash resilience Β· FastEmbed default + auto-fallback Β· Git-backed DB with daily backup cron Β· Gate enforcement via Hermes shell hooks Β· CANONICAL_TRUTH.md (3-store hierarchy) Β· predict/alignment in CLI Β· docs/ + quickstart example.

Roadmap: incremental vector index for large stores, optional reranking, import adapters (chat exports, notes), web memory browser.

License

Apache-2.0. See LICENSE.

A
license - permissive license
-
quality - not tested
B
maintenance

Maintenance

–Maintainers
–Response time
–Release cycle
–Releases (12mo)
Commit activity

Resources

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/bitmaster162/continuityos'

If you have feedback or need assistance with the MCP directory API, please join our Discord server