N3MO
A structural code intelligence layer that transforms source code into a queryable knowledge graph for search, impact analysis, and AI-powered development.
Parse once. Query forever. Know exactly what breaks before it does.
π Licensed under PolyForm Noncommercial 1.0.0 β Source available for noncommercial use. β’ Need commercial use? Get a commercial license β
What is N3MO β’ Benchmarks β’ Architecture β’ Installation β’ MCP β’ Usage β’ Roadmap
π― What is N3MO?
N3MO is a symbol-centric code intelligence layer. Instead of scanning raw text, it parses your source code's ASTs, maps call graphs, and models dependencies in a queryable relational database β deterministically, with zero LLM calls at index time.
For engineering leaders and teams, N3MO acts as a structural insurance policy for your codebases.
π‘ Why N3MO?
π‘οΈ Eliminate Regression Risks β Utility functions are rarely refactored because developers fear unknown side effects. N3MO maps the transitive blast radius of any symbol to arbitrary depth, showing you exactly what will break before you make the edit.
ποΈ Rapid Developer Onboarding β Instead of senior engineers spending hours explaining codebase flow to new hires, developers run one command to visualize complex call chains and parent-child dependencies interactively.
π€ AI-Agent Ready Infrastructure β Modern LLM agents (Cursor, Claude Desktop) are limited by context windows and text search. N3MO's native MCP server lets agents query the actual code graph, enabling fast, hallucination-free refactoring.
β‘ No Embeddings, No Drift β N3MO is pure static analysis: Tree-sitter AST parsing into PostgreSQL. There's no vector index to keep in sync, no embedding cost per repo, and no semantic-similarity guesswork β every edge in the graph is an exact, verifiable relationship.
π How N3MO Compares
Capability | Grep / Text Search | IDE "Find References" | N3MO Code Graph |
Analysis Basis | Substring matching | AST-based, direct refs only | Relational knowledge graph |
Transitive Traversal | β None | β Manual, one level at a time | β‘ Instant to arbitrary depth |
Blast Radius Mapping | β None | β Flat search-result list | π¨ Interactive visual orbit map |
CI/CD Integration | β None | β Bound to IDE runtime | βοΈ Dockerized CLI + CTE queries |
AI Agent Integration | β Injected file chunks | β οΈ Manual context copy | π€ Native MCP server |
Language Coverage | β Any text file | β οΈ Language-specific plugins | β 27 Tree-sitter grammars |
Indexing Method | N/A | N/A | β Deterministic AST parse β no embeddings, no LLM calls |
π οΈ The Core Problem N3MO Solves
β Without N3MO
β With N3MO
N3MO doesn't find text β it understands structure. It traces the actual call graph, not string matches.
Questions N3MO answers instantly:
Question | How | |
π | What functions and classes exist in this repo? | Full symbol index across 27 languages |
π― | Where is this symbol used β directly and transitively? | Recursive CTE traversal to arbitrary depth |
π₯ | What is the blast radius of changing this function? | Interactive orbit map with depth slider |
πΈοΈ | How do these components actually connect? | Call graph + parent-child hierarchy |
π€ | Can my AI agent understand this codebase structurally? | Native MCP server for Cursor / Claude |
Related MCP server: sourcebook
π Benchmarks
All benchmarks measured on Intel i5-13450HX, 24 GB RAM, NVMe SSD.
Django β Optimization History
Django is the primary benchmark target: 3,021 files, ~43K symbols, ~181K calls.
Django Index Time (minutes)
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
v0.3 Baseline ββββββββββββββββββββββββββββββββββββββββββββββ 23 min 1Γ
SPLIT_PART Fix ββββββββββββββββββββββ 11 min 2Γ
Batch Inserts βββββββββ 5 min 4.6Γ
+ Multiprocessing ββββ 2.5 min 9Γ π
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββOptimization | Index Time | Speedup | What Changed |
v0.3 baseline | 23 min | 1Γ | Per-symbol DB inserts, naive call resolution |
+ SPLIT_PART query fix | 11 min | 2Γ | Eliminated redundant string splitting in call resolution |
+ Batch inserts | 5 min | 4.6Γ | Symbols, imports, and calls batched per file (1 transaction) |
+ Multiprocessing | ~2.5 min | ~9Γ |
|
β All results are real measurements on the Django repository. Multiprocessing gains scale with core count.
TensorFlow β Enterprise-Scale Monorepo
Tested on TensorFlow β a 36,000-file, multi-language (C++/Python) monorepo.
Metric | Result |
Repo size (total files) | ~36,000 |
Files processed & indexed | 14,611 (after filtering tests, configs, and non-source files) |
Total symbols extracted | 79,523 |
Total call edges extracted | 480,851 |
Full index time (cold start) | 14.06 minutes |
Peak memory (Docker container) | 185 MB RAM |
CPU utilization | ~5% |
N3MO scales from a 3K-file pure-Python repo (Django) to a 36K-file multi-language enterprise monorepo (TensorFlow) β roughly a 5Γ larger indexing job at near-linear throughput, without significant resource overhead. Symbol/edge-per-file and incremental (warm) re-index numbers for TensorFlow are being finalized in the full benchmark report.
ScanCode Toolkit β Large Codebase
Tested on ScanCode Toolkit β ~600K lines of Python.
Metric | Result |
Lines of code | ~600,000 |
Full index time | ~3 minutes |
Processing mode | Single-threaded (v0.3) |
Incremental Re-Indexing
N3MO uses SHA-256 file hashing to skip unchanged files on subsequent runs.
Scenario | Time | Notes |
Full index (first run) | Baseline | All files parsed and inserted |
No changes (re-run) | < 1 second | Hash comparison only, zero DB writes |
1 file modified | < 2 seconds | Only the changed file is re-parsed and upserted |
These results are from the built-in benchmark script on a 20-file synthetic repository. Real-world incremental performance is proportional to the number of changed files, not the total repository size.
Query Performance
Impact analysis uses PostgreSQL recursive CTEs with cycle guards. Query times are independent of repository size β they depend only on the size of the result subgraph.
Query Type | Typical Latency |
Direct callers of a symbol | < 10 ms |
Full blast radius (depth β€ 5) | < 50 ms |
Complete graph traversal | < 200 ms |
Running the Benchmark
python benchmarks/benchmark_indexing.pyποΈ Architecture
Knowledge graph model
N3MO builds a symbol-centric knowledge graph stored in PostgreSQL:
graph TD
A["π Source Code"] -->|Tree-sitter| B["π³ AST Parser"]
B --> C["π Symbol Extractor"]
D["π Git Hooks"] -->|post-commit| A
C --> E[("ποΈ PostgreSQL<br/>Projects Β· Symbols Β· Calls<br/>Imports Β· Files")]
E --> F["π₯ Impact Analysis"]
E --> G["π Call Graph"]
E --> H["π Dependency Graph"]
F --> I["π¨ Visualizer"]
G --> I
H --> I
F --> J["π€ MCP Server"]
style A fill:#6c63ff,stroke:#4a3fbf,color:#fff
style B fill:#7c74ff,stroke:#4a3fbf,color:#fff
style C fill:#7c74ff,stroke:#4a3fbf,color:#fff
style D fill:#ffd93d,stroke:#d4b800,color:#1a202c
style E fill:#ff6b6b,stroke:#c53030,color:#fff,stroke-width:3px
style F fill:#45b7d1,stroke:#2c8ea8,color:#1a202c
style G fill:#45b7d1,stroke:#2c8ea8,color:#1a202c
style H fill:#45b7d1,stroke:#2c8ea8,color:#1a202c
style I fill:#9ae6b4,stroke:#2f855a,color:#1a202c
style J fill:#ffd93d,stroke:#d4b800,color:#1a202cSystem flow
sequenceDiagram
participant User as User / CI
participant CLI as N3MO CLI
participant DB as PostgreSQL (Docker)
participant Viz as Graph Visualizer
rect rgb(26, 27, 46)
Note over User, DB: Indexing Flow (Local CLI)
User->>CLI: n3mo index
CLI->>DB: Start PostgreSQL container (if not running)
CLI->>CLI: Walk file tree (SHA-256 hash checks)
CLI->>CLI: Parse AST (Tree-sitter, multiprocessing)
CLI->>DB: Batch insert symbols, calls, imports
CLI->>DB: Resolve imports & call links
DB-->>CLI: Success
CLI-->>User: Complete summary
end
rect rgb(26, 27, 46)
Note over User, Viz: Query & Visualization Flow
User->>CLI: n3mo impact "symbol" --graph
CLI->>DB: Recursive CTE traversal (depth & file filters)
DB-->>CLI: Blast radius subgraph
CLI->>Viz: Generate orbital vis.js HTML
CLI->>User: Launch local web server & open browser
endData model
erDiagram
PROJECT ||--o{ SYMBOL : contains
PROJECT ||--o{ CALL : tracks
PROJECT ||--o{ IMPORT : tracks
PROJECT ||--o{ FILE : indexes
SYMBOL ||--o{ CALL : "source of"
SYMBOL ||--o{ CALL : "resolved to"
SYMBOL ||--o{ SYMBOL : "parent of"
PROJECT {
uuid id PK
text name
text repo_url
timestamp created_at
}
SYMBOL {
uuid id PK
uuid project_id FK
text name
text file_path
text kind "function|class|method"
text signature
int start_line
int end_line
uuid parent_id FK
}
CALL {
uuid id PK
uuid project_id FK
uuid source_symbol_id FK
text call_name
int line_number
uuid resolved_symbol_id FK
}
IMPORT {
uuid id PK
uuid project_id FK
text file_path
text module
text name
text alias
uuid resolved_symbol_id FK
}
FILE {
uuid project_id FK
text file_path PK
text sha256
}β¨ Core Capabilities
Ingestion & Parsing
Multi-language support β 27 Tree-sitter grammars supported (dynamically loaded); actively benchmarked on 10 major languages including Python, JS/TS, Go, Java, and C/C++
Parallel AST ingestion β
ProcessPoolExecutordistributes CPU-bound parsing across all available coresIncremental re-indexing β SHA-256 file hashing skips unchanged files automatically
Idempotent operations β re-indexing updates existing data without duplication
Smart exclusions β case-insensitive directory filters and camelCase-aware filename checks prevent false positives (e.g. allows
contest.pywhile skippingtest_*.py)
Analysis & Querying
Symbol extraction β functions, classes, methods with full file path + line context
Hierarchical modeling β parent-child relationships (Module β Class β Method)
Call graph construction β who calls whom, resolved at ingestion time
Scope-aware resolution β class scope > local file > imports > qualified dot paths > global
Blast radius analysis β recursive CTE traversal to arbitrary depth with cycle guards
Performance
Connection pooling β
ThreadedConnectionPooleliminates per-symbol DB round tripsBatch inserts β symbols, imports, and calls batched per file in single transactions
Optimized queries β
SPLIT_PARTfix delivered a 2Γ speedup on call resolution
Visualization & Integration
Interactive graph β vis.js orbit map with click-to-inspect nodes, sidebar, and depth slider
Dark mode β toggleable canvas dark mode with real-time node/edge updates, persisted in
localStoragePremium styling β sleek interactive dashboard landing page UI and graph visualizer styled with
Bricolage Grotesque,Inter, andJetBrains MonotypographySKILL.md profile β system instructions to configure Claude as an impact-aware coding agent
Native MCP server β first-class integration with Cursor, Claude Desktop, and Windsurf
Git hooks β automatic re-indexing on every commit
CI pipeline β GitHub Actions with linting (
ruff), type checking (mypy), andpytest
π Supported Languages
β¦and more |
Tree-sitter parsing supported for 27 languages. Deep semantic call graph mapping currently optimized for Python, JS/TS, and Java.
π Installation
Prerequisites
Quick start
Install N3MO directly from PyPI:
# Install the package
pip install n3mo
# Start Docker containers & initialize the database
n3mo setupAlternatively, for contributors running in editable mode:
git clone https://github.com/RajX-dev/N3MO.git
cd N3MO
pip install -e .
n3mo setupπ€ Model Context Protocol (MCP)
N3MO includes a native MCP server that exposes repository analysis and graph traversal tools to LLM agents (like Claude, Cursor, or Windsurf).
Automatic Claude Desktop Setup
# Navigate to the workspace you want Claude to analyze, then run:
n3mo mcp installThis registers N3MO and sets up the paths automatically. Restart Claude Desktop and you're ready!
π§ Claude Skill (System Instructions)
To configure Claude to run N3MO impact queries proactively before changing code in the editor, import or copy-paste the custom instructions from the SKILL.md profile.
Cursor Setup
Go to Settings β Models β MCP.
Click + Add New MCP Server.
Set the configuration details:
Name:
n3moType:
commandCommand:
n3mo mcp start(oruvx n3mo mcp startto run directly)Environment Variables:
TARGET_CODE_DIR=/absolute/path/to/your/active/workspace
Click Save, and Cursor will instantly be able to index and query your workspace blast radius.
π§° Available MCP Tools
Tool | Description |
| Ingests and indexes the codebase |
| Locates the definition of a symbol across the workspace (file path, line number) |
| Finds all external symbols that a given symbol calls (forward-dependency graph) |
| Lists all classes and functions defined inside a specific file |
| Traces the transitive impact/call graph of a code symbol |
β GitHub Webhook Integration
For team collaboration and automated pull-request analysis in your CI/CD pipeline, visit n3mo.shop to get started with the GitHub Webhook integration.
π° Pricing & Licensing
N3MO is free under the PolyForm Noncommercial 1.0.0 License for local usage and single-developer MCP integrations.
100% Free & Local β CLI queries, local MCP integrations, and the visualizer with zero limits.
Enterprise Licensing β for large-scale organization deployments or commercial licensing terms, reach out to the author.
π» Usage
Index a repository
cd /path/to/your/project
n3mo indexWhat gets indexed:
β Source files in all 27 supported languages
β Virtual environments (
venv/,.venv/)β Dependencies (
node_modules/,site-packages/)β Build artifacts (
.git/,__pycache__/,dist/)β Test / fixture directories (
tests/,mocks/,specs/)
Visualizer
Dark Mode β Radial Layout

Horizontal Tree View

Example terminal output:
β IMPACT ANALYSIS
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
Target: authenticate_user
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β Direct Callers (3 symbols)
βΈ login_endpoint api/auth.py:12
βΈ refresh_token api/token.py:23
βΈ validate_session middleware/auth.py:89
β Ripple Effects (5 symbols)
β°ββΈ POST /login routes.py:67
β°ββΈ admin_login admin/views.py:34
β°ββΈ require_auth decorators.py:12
β°ββΈ dashboard_view views/dashboard.py:8
β°ββΈ settings_view views/settings.py:22
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
Total impacted: 8 references β depth β€ 3Dependency graph visualization
graph LR
A[main.py] --> B[auth.py::login]
A --> C[db.py::connect]
B --> D[utils.py::hash_password]
B --> E[models.py::User]
C --> F[config.py::DB_URI]
style A fill:#ff6b6b,stroke:#c92a2a,stroke-width:2px,color:#fff
style B fill:#4ecdc4,stroke:#0ca89e,stroke-width:2px,color:#000
style C fill:#45b7d1,stroke:#1098ad,stroke-width:2px,color:#000
style D fill:#96ceb4,stroke:#63b598,stroke-width:2px,color:#000
style E fill:#ffd93d,stroke:#f5c200,stroke-width:2px,color:#000
style F fill:#e0e0e0,stroke:#a0a0a0,stroke-width:2px,color:#000π οΈ Technology Stack
Component | Technology | Purpose |
Parser | Error-tolerant syntax analysis across 27 languages | |
Database | Relational graph storage + recursive CTE queries | |
Runtime | Core logic + multiprocessing | |
Infrastructure | Containerization | |
Visualization | Interactive impact graph | |
AI Integration | Native tool for LLM agents |
πΊοΈ Roadmap
All four development phases have been completed. N3MO is stable and actively maintained.
Phase | Component | Status |
Phase 1 β Foundations | ||
Docker setup | β Complete | |
Database schema | β Complete | |
Tree-sitter integration | β Complete | |
Symbol + call extraction | β Complete | |
Blast radius (recursive CTE) | β Complete | |
Interactive visualizer | β Complete | |
Phase 2 β Performance | ||
Connection pooling | β Complete | |
Batch DB operations (symbols/imports/calls) | β Complete | |
SPLIT_PART query optimization | β Complete | |
| β Complete | |
Interactive depth slider | β Complete | |
Phase 3 β Correctness & Scaling | ||
Incremental re-index (file hashing) | β Complete | |
Multiprocessing (AST parsing) | β Complete | |
Scope-aware call resolution | β Complete | |
CTE cycle guard | β Complete | |
Full type annotations + mypy | β Complete | |
pytest suite + CI | β Complete | |
Multi-language support (27 languages) | β Complete | |
Phase 4 β Distribution | ||
MCP server (Cursor / Claude / Windsurf) | β Complete | |
Real-time git-hook indexing | β Complete |
Docker environment (PostgreSQL)
Database schema β Projects, Symbols, Calls, Imports tables
Tree-sitter parser integration
Symbol extractor with full AST traversal
Idempotent upsert logic
Blast radius via recursive CTE
Interactive vis.js visualizer
psycopg2.pool.ThreadedConnectionPoolβ replace per-call connectionsexecute_values()batch inserts for symbols, imports, and calls β 1 transaction per fileSPLIT_PART query optimization for call resolution
--fileand--depthCLI flags for targeted impact analysisInteractive depth slider in visualizer
Results: Django (3,021 files, ~43K symbols, ~181K calls) β 23min β 2.5min (9Γ faster)
SHA-256 file hashing for incremental re-index
ProcessPoolExecutorfor parallel AST parsingScope-aware call resolution using imports table
CTE cycle guard (visited node tracking)
Full type annotations,
mypyclean checking in CIpytest unit + integration test suite
GitHub Actions CI pipeline
Multi-language support (27 languages)
MCP server β N3MO as a tool for Cursor, Claude Code, Windsurf
Real-time incremental indexing via git hooks
π Design Principles
1. Structure before semantics Map the code skeleton (AST) before adding AI analysis. A correct graph is worth more than a smart but wrong one.
2. Database as source of truth All state lives in PostgreSQL, eliminating in-memory complexity and enabling graph queries that application-level traversal cannot match.
3. Correctness over speed The parser must handle syntax errors gracefully without corrupting the graph. A fast indexer that silently drops symbols is worse than a slow one that gets everything right.
4. Idempotent operations Re-running ingestion produces identical results, enabling safe incremental updates and CI/CD integration.
π€ Contributing
Contributions are welcome! Please read the CONTRIBUTING.md guide to get started with setting up the project, coding standards, and running checks locally.
Development Setup
# Install with dev dependencies
pip install -e ".[dev]"
# Lint
ruff check n3mo/
# Type check
mypy n3mo/
# Tests
pytest tests/π License
Licensed under the PolyForm Noncommercial 1.0.0 License.
β Free for personal projects, academic research, and hobby tools
β Source available β view, modify, and distribute for noncommercial purposes
β οΈ Noncommercial β you may not use it for commercial purposes
β οΈ Restrictions apply on offering it as a service
For commercial deployments or proprietary modifications, contact for licensing options.
See LICENSE for full legal details.
π¨π» Author
Raj Shekhar β Delhi Technological University
π Acknowledgments
Tree-sitter β for robust, incremental, error-tolerant parsing
PostgreSQL β for making recursive graph queries possible without a graph database
Docker β for reproducible, single-command environments
vis.js β for the interactive graph visualization
FastAPI β for the high-performance REST layer
β Star this repo if you find it useful β thanks for visiting!
Building tools for understanding code at scale.
Maintenance
Latest Blog Posts
- Your AI Chatbot Just Exposed Your CEO's Salary to an InternBy Om-Shree-0709 on .Agent IdentityMCP SecurityOAuth Delegation
- Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)By Om-Shree-0709 on .Agentic AiPrompt InjectionWebAssembly
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/RajX-dev/N3MO'
If you have feedback or need assistance with the MCP directory API, please join our Discord server