Skip to main content
Glama

CodeCortex

Persistent codebase knowledge layer for AI agents. Pre-builds architecture, dependencies, coupling, and risk knowledge so agents skip the cold start and go straight to the right files.

npm version npm downloads License: MIT

Website · npm · GitHub

The Problem

Every AI coding session starts with exploration — grepping, reading wrong files, re-discovering architecture. On a 6,000-file codebase, an agent makes 37 tool calls and burns 79K tokens just to understand what's where. And it still can't tell you which files are dangerous to edit or which files secretly depend on each other.

The data backs this up:

The Solution

CodeCortex eliminates the cold start. It pre-builds codebase knowledge — architecture, dependencies, risk areas, hidden coupling — and injects it directly into your agent's context (CLAUDE.md, .cursorrules, etc.) so agents have project knowledge from the first prompt.

Not a middleware. Not a proxy. Just knowledge your agent loads on day one.

Tested on a real 6,400-file codebase (143K symbols, 96 modules):

Without CodeCortex

With CodeCortex

Tool calls

37

15 (2.5x fewer)

Total tokens

79K

43K (~50% fewer)

Answer quality

23/25

23/25 (same)

Hidden dependencies found

No

Yes

What makes it unique

Three capabilities no other tool provides:

  1. Temporal coupling — Files that always change together but have zero imports between them. You can read every line and never discover this. Only git co-change analysis reveals it.

  2. Risk scores — File X has been bug-fixed 7 times, has 6 hidden dependencies, and co-changes with 3 other files. Risk score: 35. You can't learn this from reading code.

  3. Inline context injection — Project knowledge is injected directly into CLAUDE.md, .cursorrules, and other agent config files with architecture, risk map, and editing directives. Agents use it without any setup.

Example from a real codebase:

  • schema.help.ts and schema.labels.ts co-changed in 12/14 commits (86%) with zero imports between them

  • Without this knowledge, an AI editing one file would produce a bug 86% of the time

Quick Start

Requires Node 20 or 22. Node 24 is not yet supported (tree-sitter native bindings need an upstream update).

# Install (--legacy-peer-deps needed for tree-sitter peer dep mismatches)
npm install -g codecortex-ai --legacy-peer-deps

# Initialize knowledge for your project
cd /path/to/your-project
codecortex init

# Regenerate inline context in CLAUDE.md and agent config files
codecortex inject

Connect to Claude Code

CLI (recommended):

claude mcp add codecortex -- codecortex serve

Or add to MCP config manually:

{
  "mcpServers": {
    "codecortex": {
      "command": "codecortex",
      "args": ["serve"],
      "cwd": "/path/to/your-project"
    }
  }
}

Connect to Cursor

Add to .cursor/mcp.json:

{
  "mcpServers": {
    "codecortex": {
      "command": "codecortex",
      "args": ["serve"],
      "cwd": "/path/to/your-project"
    }
  }
}

What Gets Generated

All knowledge lives in .codecortex/ as flat files in your repo, plus inline context is injected into agent config files:

.codecortex/
  cortex.yaml          # project manifest
  constitution.md      # project overview for agents
  overview.md          # module map + entry points
  graph.json           # dependency graph (imports, calls, modules)
  symbols.json         # full symbol index (functions, classes, types...)
  temporal.json        # git coupling, hotspots, bug history
  hotspots.md          # risk-ranked files (static, always available)
  AGENT.md             # tool usage guide for AI agents
  modules/*.md         # per-module structural analysis
  decisions/*.md       # architectural decision records
  sessions/*.md        # session change logs
  patterns.md          # coding patterns and conventions

CLAUDE.md              # ← inline context injected here
.cursorrules           # ← and here (if exists)
.windsurfrules         # ← and here (if exists)

Six Knowledge Layers

Layer

What

File

1. Structural

Modules, deps, symbols, entry points

graph.json + symbols.json

2. Semantic

What each module does, data flow, gotchas

modules/*.md

3. Temporal

Git behavioral fingerprint — coupling, hotspots, bug history

temporal.json

4. Decisions

Why things are built this way

decisions/*.md

5. Patterns

How code is written here

patterns.md

6. Sessions

What changed between sessions

sessions/*.md

MCP Tools (5)

Five focused tools that provide capabilities agents can't get from reading code:

Tool

Description

get_project_overview

Architecture, modules, risk map. Call this first.

get_dependency_graph

Import/export graph filtered by module or file.

lookup_symbol

Precise symbol lookup with kind and file path filters.

get_change_coupling

Files that must change together. Hidden dependencies flagged.

get_edit_briefing

Pre-edit risk: co-change warnings, hidden deps, bug history. Always call before editing.

MCP Resources (3)

Static knowledge available without tool calls:

Resource

Description

codecortex://project/overview

Full project constitution

codecortex://project/hotspots

Risk-ranked file table

codecortex://module/{name}

Per-module documentation

MCP Prompts (2)

Prompt

Description

start_session

Returns constitution + latest session context

before_editing

Takes file paths, returns risk/coupling/bug briefing

CLI Commands

Command

Description

codecortex init

Discover project + extract symbols + analyze git history

codecortex serve

Start MCP server (stdio transport)

codecortex update

Re-extract changed files, update affected modules

codecortex inject

Regenerate inline context in CLAUDE.md and agent config files

codecortex status

Show knowledge freshness, stale modules, symbol counts

codecortex symbols [query]

Browse and filter the symbol index

codecortex search <query>

Search across symbols, file paths, and docs

codecortex modules [name]

List modules or deep-dive into a specific module

codecortex hotspots

Show files ranked by risk: churn + coupling + bug history

codecortex hook install|uninstall|status

Manage git hooks for auto-updating knowledge

codecortex upgrade

Check for and install the latest version

How It Works

Hybrid extraction: tree-sitter native N-API for structure (symbols, imports, calls across 27 languages) + host LLM for semantics (what modules do, why they're built that way). Zero extra API keys.

Inline context injection: After analysis, CodeCortex injects a rich knowledge section directly into CLAUDE.md and other agent config files. This includes architecture overview, risk map with coupled file names, and editing directives — so agents have project context from the first prompt without needing MCP.

Git hooks keep knowledge fresh — codecortex update runs automatically on every commit, re-extracting changed files and updating temporal analysis.

Size-adaptive responses — CodeCortex classifies your project (micro → extra-large) and adjusts response caps accordingly. A 23-file project gets full detail. A 6,400-file project gets intelligent summaries. Every MCP tool response stays under 10K chars.

Supported Languages (27)

Category

Languages

Web

TypeScript, TSX, JavaScript

Systems

C, C++, Objective-C, Rust, Zig, Go

JVM

Java, Kotlin, Scala

.NET

C#

Mobile

Swift, Dart

Scripting

Python, Ruby, PHP, Lua, Bash, Elixir

Functional

OCaml, Elm, Emacs Lisp

Other

Solidity, Vue, CodeQL

Tech Stack

  • TypeScript ESM, Node.js 20+

  • tree-sitter (native N-API) + 27 language grammar packages

  • @modelcontextprotocol/sdk - MCP server

  • commander - CLI

  • simple-git - git integration

  • yaml, zod, glob

License

MIT

-
security - not tested
-
license - not tested
-
quality - not tested

Resources

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/rushikeshmore/codecortex'

If you have feedback or need assistance with the MCP directory API, please join our Discord server