CodeCortex
Persistent codebase knowledge layer for AI agents. Pre-builds architecture, dependencies, coupling, and risk knowledge so agents skip the cold start and go straight to the right files.
The Problem
Every AI coding session starts with exploration — grepping, reading wrong files, re-discovering architecture. On a 6,000-file codebase, an agent makes 37 tool calls and burns 79K tokens just to understand what's where. And it still can't tell you which files are dangerous to edit or which files secretly depend on each other.
The data backs this up:
AI agents increase defect risk by 30% on unfamiliar code (CodeScene + Lund University, 2025)
Code churn grew 2.5x in the AI era (GitClear, 211M lines analyzed)
The Solution
CodeCortex eliminates the cold start. It pre-builds codebase knowledge — architecture, dependencies, risk areas, hidden coupling — and injects it directly into your agent's context (CLAUDE.md, .cursorrules, etc.) so agents have project knowledge from the first prompt.
Not a middleware. Not a proxy. Just knowledge your agent loads on day one.
Tested on a real 6,400-file codebase (143K symbols, 96 modules):
Without CodeCortex | With CodeCortex | |
Tool calls | 37 | 15 (2.5x fewer) |
Total tokens | 79K | 43K (~50% fewer) |
Answer quality | 23/25 | 23/25 (same) |
Hidden dependencies found | No | Yes |
What makes it unique
Three capabilities no other tool provides:
Temporal coupling — Files that always change together but have zero imports between them. You can read every line and never discover this. Only git co-change analysis reveals it.
Risk scores — File X has been bug-fixed 7 times, has 6 hidden dependencies, and co-changes with 3 other files. Risk score: 35. You can't learn this from reading code.
Inline context injection — Project knowledge is injected directly into CLAUDE.md, .cursorrules, and other agent config files with architecture, risk map, and editing directives. Agents use it without any setup.
Example from a real codebase:
schema.help.tsandschema.labels.tsco-changed in 12/14 commits (86%) with zero imports between themWithout this knowledge, an AI editing one file would produce a bug 86% of the time
Quick Start
Requires Node 20 or 22. Node 24 is not yet supported (tree-sitter native bindings need an upstream update).
# Install (--legacy-peer-deps needed for tree-sitter peer dep mismatches)
npm install -g codecortex-ai --legacy-peer-deps
# Initialize knowledge for your project
cd /path/to/your-project
codecortex init
# Regenerate inline context in CLAUDE.md and agent config files
codecortex injectConnect to Claude Code
CLI (recommended):
claude mcp add codecortex -- codecortex serveOr add to MCP config manually:
{
"mcpServers": {
"codecortex": {
"command": "codecortex",
"args": ["serve"],
"cwd": "/path/to/your-project"
}
}
}Connect to Cursor
Add to .cursor/mcp.json:
{
"mcpServers": {
"codecortex": {
"command": "codecortex",
"args": ["serve"],
"cwd": "/path/to/your-project"
}
}
}What Gets Generated
All knowledge lives in .codecortex/ as flat files in your repo, plus inline context is injected into agent config files:
.codecortex/
cortex.yaml # project manifest
constitution.md # project overview for agents
overview.md # module map + entry points
graph.json # dependency graph (imports, calls, modules)
symbols.json # full symbol index (functions, classes, types...)
temporal.json # git coupling, hotspots, bug history
hotspots.md # risk-ranked files (static, always available)
AGENT.md # tool usage guide for AI agents
modules/*.md # per-module structural analysis
decisions/*.md # architectural decision records
sessions/*.md # session change logs
patterns.md # coding patterns and conventions
CLAUDE.md # ← inline context injected here
.cursorrules # ← and here (if exists)
.windsurfrules # ← and here (if exists)Six Knowledge Layers
Layer | What | File |
1. Structural | Modules, deps, symbols, entry points |
|
2. Semantic | What each module does, data flow, gotchas |
|
3. Temporal | Git behavioral fingerprint — coupling, hotspots, bug history |
|
4. Decisions | Why things are built this way |
|
5. Patterns | How code is written here |
|
6. Sessions | What changed between sessions |
|
MCP Tools (5)
Five focused tools that provide capabilities agents can't get from reading code:
Tool | Description |
| Architecture, modules, risk map. Call this first. |
| Import/export graph filtered by module or file. |
| Precise symbol lookup with kind and file path filters. |
| Files that must change together. Hidden dependencies flagged. |
| Pre-edit risk: co-change warnings, hidden deps, bug history. Always call before editing. |
MCP Resources (3)
Static knowledge available without tool calls:
Resource | Description |
| Full project constitution |
| Risk-ranked file table |
| Per-module documentation |
MCP Prompts (2)
Prompt | Description |
| Returns constitution + latest session context |
| Takes file paths, returns risk/coupling/bug briefing |
CLI Commands
Command | Description |
| Discover project + extract symbols + analyze git history |
| Start MCP server (stdio transport) |
| Re-extract changed files, update affected modules |
| Regenerate inline context in CLAUDE.md and agent config files |
| Show knowledge freshness, stale modules, symbol counts |
| Browse and filter the symbol index |
| Search across symbols, file paths, and docs |
| List modules or deep-dive into a specific module |
| Show files ranked by risk: churn + coupling + bug history |
| Manage git hooks for auto-updating knowledge |
| Check for and install the latest version |
How It Works
Hybrid extraction: tree-sitter native N-API for structure (symbols, imports, calls across 27 languages) + host LLM for semantics (what modules do, why they're built that way). Zero extra API keys.
Inline context injection: After analysis, CodeCortex injects a rich knowledge section directly into CLAUDE.md and other agent config files. This includes architecture overview, risk map with coupled file names, and editing directives — so agents have project context from the first prompt without needing MCP.
Git hooks keep knowledge fresh — codecortex update runs automatically on every commit, re-extracting changed files and updating temporal analysis.
Size-adaptive responses — CodeCortex classifies your project (micro → extra-large) and adjusts response caps accordingly. A 23-file project gets full detail. A 6,400-file project gets intelligent summaries. Every MCP tool response stays under 10K chars.
Supported Languages (27)
Category | Languages |
Web | TypeScript, TSX, JavaScript |
Systems | C, C++, Objective-C, Rust, Zig, Go |
JVM | Java, Kotlin, Scala |
.NET | C# |
Mobile | Swift, Dart |
Scripting | Python, Ruby, PHP, Lua, Bash, Elixir |
Functional | OCaml, Elm, Emacs Lisp |
Other | Solidity, Vue, CodeQL |
Tech Stack
TypeScript ESM, Node.js 20+
tree-sitter(native N-API) + 27 language grammar packages@modelcontextprotocol/sdk- MCP servercommander- CLIsimple-git- git integrationyaml,zod,glob
License
MIT
This server cannot be installed
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.