Skip to main content
Glama

N3MO Banner PyPI version License: PolyForm Noncommercial 1.0.0 Python Docker Status MCP Registry CI

A structural code intelligence layer that transforms source code into a queryable knowledge graph for search, impact analysis, and AI-powered development.

Parse once. Query forever. Know exactly what breaks before it does.

πŸ“œ Licensed under PolyForm Noncommercial 1.0.0 β€” Source available for noncommercial use. β€’ Need commercial use? Get a commercial license β†’

What is N3MO β€’ Benchmarks β€’ Architecture β€’ Installation β€’ MCP β€’ Usage β€’ Roadmap


🎯 What is N3MO?

N3MO is a symbol-centric code intelligence layer. Instead of scanning raw text, it parses your source code's ASTs, maps call graphs, and models dependencies in a queryable relational database β€” deterministically, with zero LLM calls at index time.

For engineering leaders and teams, N3MO acts as a structural insurance policy for your codebases.

πŸ’‘ Why N3MO?

  • πŸ›‘οΈ Eliminate Regression Risks β€” Utility functions are rarely refactored because developers fear unknown side effects. N3MO maps the transitive blast radius of any symbol to arbitrary depth, showing you exactly what will break before you make the edit.

  • 🏎️ Rapid Developer Onboarding β€” Instead of senior engineers spending hours explaining codebase flow to new hires, developers run one command to visualize complex call chains and parent-child dependencies interactively.

  • πŸ€– AI-Agent Ready Infrastructure β€” Modern LLM agents (Cursor, Claude Desktop) are limited by context windows and text search. N3MO's native MCP server lets agents query the actual code graph, enabling fast, hallucination-free refactoring.

  • ⚑ No Embeddings, No Drift β€” N3MO is pure static analysis: Tree-sitter AST parsing into PostgreSQL. There's no vector index to keep in sync, no embedding cost per repo, and no semantic-similarity guesswork β€” every edge in the graph is an exact, verifiable relationship.

πŸ“Š How N3MO Compares

Capability

Grep / Text Search

IDE "Find References"

N3MO Code Graph

Analysis Basis

Substring matching

AST-based, direct refs only

Relational knowledge graph

Transitive Traversal

❌ None

❌ Manual, one level at a time

⚑ Instant to arbitrary depth

Blast Radius Mapping

❌ None

❌ Flat search-result list

🎨 Interactive visual orbit map

CI/CD Integration

❌ None

❌ Bound to IDE runtime

βš™οΈ Dockerized CLI + CTE queries

AI Agent Integration

❌ Injected file chunks

⚠️ Manual context copy

πŸ€– Native MCP server

Language Coverage

βœ… Any text file

⚠️ Language-specific plugins

βœ… 27 Tree-sitter grammars

Indexing Method

N/A

N/A

βœ… Deterministic AST parse β€” no embeddings, no LLM calls

πŸ› οΈ The Core Problem N3MO Solves

❌ Without N3MO

βœ… With N3MO

N3MO doesn't find text β€” it understands structure. It traces the actual call graph, not string matches.

Questions N3MO answers instantly:

Question

How

πŸ”Ž

What functions and classes exist in this repo?

Full symbol index across 27 languages

🎯

Where is this symbol used β€” directly and transitively?

Recursive CTE traversal to arbitrary depth

πŸ’₯

What is the blast radius of changing this function?

Interactive orbit map with depth slider

πŸ•ΈοΈ

How do these components actually connect?

Call graph + parent-child hierarchy

πŸ€–

Can my AI agent understand this codebase structurally?

Native MCP server for Cursor / Claude


Related MCP server: sourcebook

πŸ“Š Benchmarks

All benchmarks measured on Intel i5-13450HX, 24 GB RAM, NVMe SSD.

Django β€” Optimization History

Django is the primary benchmark target: 3,021 files, ~43K symbols, ~181K calls.

Django Index Time (minutes)
═══════════════════════════════════════════════════════════════

v0.3 Baseline       β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ  23 min   1Γ—
SPLIT_PART Fix      β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ                          11 min   2Γ—
Batch Inserts       β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ                                        5 min   4.6Γ—
+ Multiprocessing   β–ˆβ–ˆβ–ˆβ–ˆ                                           2.5 min   9Γ— πŸš€

═══════════════════════════════════════════════════════════════

Optimization

Index Time

Speedup

What Changed

v0.3 baseline

23 min

1Γ—

Per-symbol DB inserts, naive call resolution

+ SPLIT_PART query fix

11 min

2Γ—

Eliminated redundant string splitting in call resolution

+ Batch inserts

5 min

4.6Γ—

Symbols, imports, and calls batched per file (1 transaction)

+ Multiprocessing

~2.5 min

~9Γ—

ProcessPoolExecutor distributes AST parsing across cores

βœ… All results are real measurements on the Django repository. Multiprocessing gains scale with core count.

TensorFlow β€” Enterprise-Scale Monorepo

Tested on TensorFlow β€” a 36,000-file, multi-language (C++/Python) monorepo.

Metric

Result

Repo size (total files)

~36,000

Files processed & indexed

14,611 (after filtering tests, configs, and non-source files)

Total symbols extracted

79,523

Total call edges extracted

480,851

Full index time (cold start)

14.06 minutes

Peak memory (Docker container)

185 MB RAM

CPU utilization

~5%

N3MO scales from a 3K-file pure-Python repo (Django) to a 36K-file multi-language enterprise monorepo (TensorFlow) β€” roughly a 5Γ— larger indexing job at near-linear throughput, without significant resource overhead. Symbol/edge-per-file and incremental (warm) re-index numbers for TensorFlow are being finalized in the full benchmark report.

ScanCode Toolkit β€” Large Codebase

Tested on ScanCode Toolkit β€” ~600K lines of Python.

Metric

Result

Lines of code

~600,000

Full index time

~3 minutes

Processing mode

Single-threaded (v0.3)

Incremental Re-Indexing

N3MO uses SHA-256 file hashing to skip unchanged files on subsequent runs.

Scenario

Time

Notes

Full index (first run)

Baseline

All files parsed and inserted

No changes (re-run)

< 1 second

Hash comparison only, zero DB writes

1 file modified

< 2 seconds

Only the changed file is re-parsed and upserted

These results are from the built-in benchmark script on a 20-file synthetic repository. Real-world incremental performance is proportional to the number of changed files, not the total repository size.

Query Performance

Impact analysis uses PostgreSQL recursive CTEs with cycle guards. Query times are independent of repository size β€” they depend only on the size of the result subgraph.

Query Type

Typical Latency

Direct callers of a symbol

< 10 ms

Full blast radius (depth ≀ 5)

< 50 ms

Complete graph traversal

< 200 ms

Running the Benchmark

python benchmarks/benchmark_indexing.py

πŸ—οΈ Architecture

Knowledge graph model

N3MO builds a symbol-centric knowledge graph stored in PostgreSQL:

graph TD
    A["πŸ“„ Source Code"] -->|Tree-sitter| B["🌳 AST Parser"]
    B --> C["πŸ” Symbol Extractor"]
    D["πŸ”„ Git Hooks"] -->|post-commit| A

    C --> E[("πŸ—„οΈ PostgreSQL<br/>Projects Β· Symbols Β· Calls<br/>Imports Β· Files")]

    E --> F["πŸ’₯ Impact Analysis"]
    E --> G["πŸ“ž Call Graph"]
    E --> H["πŸ“Š Dependency Graph"]

    F --> I["🎨 Visualizer"]
    G --> I
    H --> I

    F --> J["πŸ€– MCP Server"]

    style A fill:#6c63ff,stroke:#4a3fbf,color:#fff
    style B fill:#7c74ff,stroke:#4a3fbf,color:#fff
    style C fill:#7c74ff,stroke:#4a3fbf,color:#fff
    style D fill:#ffd93d,stroke:#d4b800,color:#1a202c
    style E fill:#ff6b6b,stroke:#c53030,color:#fff,stroke-width:3px
    style F fill:#45b7d1,stroke:#2c8ea8,color:#1a202c
    style G fill:#45b7d1,stroke:#2c8ea8,color:#1a202c
    style H fill:#45b7d1,stroke:#2c8ea8,color:#1a202c
    style I fill:#9ae6b4,stroke:#2f855a,color:#1a202c
    style J fill:#ffd93d,stroke:#d4b800,color:#1a202c

System flow

sequenceDiagram
    participant User as User / CI
    participant CLI as N3MO CLI
    participant DB as PostgreSQL (Docker)
    participant Viz as Graph Visualizer

    rect rgb(26, 27, 46)
    Note over User, DB: Indexing Flow (Local CLI)
    User->>CLI: n3mo index
    CLI->>DB: Start PostgreSQL container (if not running)
    CLI->>CLI: Walk file tree (SHA-256 hash checks)
    CLI->>CLI: Parse AST (Tree-sitter, multiprocessing)
    CLI->>DB: Batch insert symbols, calls, imports
    CLI->>DB: Resolve imports & call links
    DB-->>CLI: Success
    CLI-->>User: Complete summary
    end

    rect rgb(26, 27, 46)
    Note over User, Viz: Query & Visualization Flow
    User->>CLI: n3mo impact "symbol" --graph
    CLI->>DB: Recursive CTE traversal (depth & file filters)
    DB-->>CLI: Blast radius subgraph
    CLI->>Viz: Generate orbital vis.js HTML
    CLI->>User: Launch local web server & open browser
    end

Data model

erDiagram
    PROJECT ||--o{ SYMBOL : contains
    PROJECT ||--o{ CALL : tracks
    PROJECT ||--o{ IMPORT : tracks
    PROJECT ||--o{ FILE : indexes
    SYMBOL ||--o{ CALL : "source of"
    SYMBOL ||--o{ CALL : "resolved to"
    SYMBOL ||--o{ SYMBOL : "parent of"

    PROJECT {
        uuid id PK
        text name
        text repo_url
        timestamp created_at
    }
    SYMBOL {
        uuid id PK
        uuid project_id FK
        text name
        text file_path
        text kind "function|class|method"
        text signature
        int start_line
        int end_line
        uuid parent_id FK
    }
    CALL {
        uuid id PK
        uuid project_id FK
        uuid source_symbol_id FK
        text call_name
        int line_number
        uuid resolved_symbol_id FK
    }
    IMPORT {
        uuid id PK
        uuid project_id FK
        text file_path
        text module
        text name
        text alias
        uuid resolved_symbol_id FK
    }
    FILE {
        uuid project_id FK
        text file_path PK
        text sha256
    }

✨ Core Capabilities

Ingestion & Parsing

  • Multi-language support β€” 27 Tree-sitter grammars supported (dynamically loaded); actively benchmarked on 10 major languages including Python, JS/TS, Go, Java, and C/C++

  • Parallel AST ingestion β€” ProcessPoolExecutor distributes CPU-bound parsing across all available cores

  • Incremental re-indexing β€” SHA-256 file hashing skips unchanged files automatically

  • Idempotent operations β€” re-indexing updates existing data without duplication

  • Smart exclusions β€” case-insensitive directory filters and camelCase-aware filename checks prevent false positives (e.g. allows contest.py while skipping test_*.py)

Analysis & Querying

  • Symbol extraction β€” functions, classes, methods with full file path + line context

  • Hierarchical modeling β€” parent-child relationships (Module β†’ Class β†’ Method)

  • Call graph construction β€” who calls whom, resolved at ingestion time

  • Scope-aware resolution β€” class scope > local file > imports > qualified dot paths > global

  • Blast radius analysis β€” recursive CTE traversal to arbitrary depth with cycle guards

Performance

  • Connection pooling β€” ThreadedConnectionPool eliminates per-symbol DB round trips

  • Batch inserts β€” symbols, imports, and calls batched per file in single transactions

  • Optimized queries β€” SPLIT_PART fix delivered a 2Γ— speedup on call resolution

Visualization & Integration

  • Interactive graph β€” vis.js orbit map with click-to-inspect nodes, sidebar, and depth slider

  • Dark mode β€” toggleable canvas dark mode with real-time node/edge updates, persisted in localStorage

  • Premium styling β€” sleek interactive dashboard landing page UI and graph visualizer styled with Bricolage Grotesque, Inter, and JetBrains Mono typography

  • SKILL.md profile β€” system instructions to configure Claude as an impact-aware coding agent

  • Native MCP server β€” first-class integration with Cursor, Claude Desktop, and Windsurf

  • Git hooks β€” automatic re-indexing on every commit

  • CI pipeline β€” GitHub Actions with linting (ruff), type checking (mypy), and pytest


🌐 Supported Languages

Python

JavaScript

TypeScript

Go

Rust

Java

C

C++

C#

Kotlin

Swift

Scala

Ruby

PHP

Haskell

Perl

Lua

R

Elixir

Dart

Groovy

PowerShell

MATLAB

Delphi

Bash

Zig

OCaml

…and more

Tree-sitter parsing supported for 27 languages. Deep semantic call graph mapping currently optimized for Python, JS/TS, and Java.


πŸš€ Installation

Prerequisites

Docker Python Git

Quick start

Install N3MO directly from PyPI:

# Install the package
pip install n3mo

# Start Docker containers & initialize the database
n3mo setup

Alternatively, for contributors running in editable mode:

git clone https://github.com/RajX-dev/N3MO.git
cd N3MO
pip install -e .
n3mo setup

πŸ€– Model Context Protocol (MCP)

N3MO includes a native MCP server that exposes repository analysis and graph traversal tools to LLM agents (like Claude, Cursor, or Windsurf).

Automatic Claude Desktop Setup

# Navigate to the workspace you want Claude to analyze, then run:
n3mo mcp install

This registers N3MO and sets up the paths automatically. Restart Claude Desktop and you're ready!

🧠 Claude Skill (System Instructions)

To configure Claude to run N3MO impact queries proactively before changing code in the editor, import or copy-paste the custom instructions from the SKILL.md profile.

Cursor Setup

  1. Go to Settings β†’ Models β†’ MCP.

  2. Click + Add New MCP Server.

  3. Set the configuration details:

    • Name: n3mo

    • Type: command

    • Command: n3mo mcp start (or uvx n3mo mcp start to run directly)

    • Environment Variables: TARGET_CODE_DIR=/absolute/path/to/your/active/workspace

  4. Click Save, and Cursor will instantly be able to index and query your workspace blast radius.

🧰 Available MCP Tools

Tool

Description

n3mo_index

Ingests and indexes the codebase

n3mo_search_symbol

Locates the definition of a symbol across the workspace (file path, line number)

n3mo_get_dependencies

Finds all external symbols that a given symbol calls (forward-dependency graph)

n3mo_get_file_symbols

Lists all classes and functions defined inside a specific file

n3mo_get_blast_radius

Traces the transitive impact/call graph of a code symbol


βš“ GitHub Webhook Integration

For team collaboration and automated pull-request analysis in your CI/CD pipeline, visit n3mo.shop to get started with the GitHub Webhook integration.

πŸ’° Pricing & Licensing

N3MO is free under the PolyForm Noncommercial 1.0.0 License for local usage and single-developer MCP integrations.

  • 100% Free & Local β€” CLI queries, local MCP integrations, and the visualizer with zero limits.

  • Enterprise Licensing β€” for large-scale organization deployments or commercial licensing terms, reach out to the author.


πŸ’» Usage

Index a repository

cd /path/to/your/project
n3mo index

What gets indexed:

  • βœ… Source files in all 27 supported languages

  • ❌ Virtual environments (venv/, .venv/)

  • ❌ Dependencies (node_modules/, site-packages/)

  • ❌ Build artifacts (.git/, __pycache__/, dist/)

  • ❌ Test / fixture directories (tests/, mocks/, specs/)

Visualizer

Dark Mode β€” Radial Layout

Dark Mode Radial Layout

Horizontal Tree View

Horizontal Tree View

Example terminal output:

  β—ˆ IMPACT ANALYSIS
  ──────────────────────────────────────────────────────────────────
  Target:  authenticate_user
  ──────────────────────────────────────────────────────────────────

  β—‰ Direct Callers  (3 symbols)

  β–Έ login_endpoint             api/auth.py:12
  β–Έ refresh_token              api/token.py:23
  β–Έ validate_session           middleware/auth.py:89

  β—Ž Ripple Effects  (5 symbols)

    ╰─▸ POST /login              routes.py:67
    ╰─▸ admin_login              admin/views.py:34
    ╰─▸ require_auth             decorators.py:12
    ╰─▸ dashboard_view           views/dashboard.py:8
    ╰─▸ settings_view            views/settings.py:22

  ──────────────────────────────────────────────────────────────────
  Total impacted: 8 references  β”‚  depth ≀ 3

Dependency graph visualization

graph LR
    A[main.py] --> B[auth.py::login]
    A --> C[db.py::connect]
    B --> D[utils.py::hash_password]
    B --> E[models.py::User]
    C --> F[config.py::DB_URI]

    style A fill:#ff6b6b,stroke:#c92a2a,stroke-width:2px,color:#fff
    style B fill:#4ecdc4,stroke:#0ca89e,stroke-width:2px,color:#000
    style C fill:#45b7d1,stroke:#1098ad,stroke-width:2px,color:#000
    style D fill:#96ceb4,stroke:#63b598,stroke-width:2px,color:#000
    style E fill:#ffd93d,stroke:#f5c200,stroke-width:2px,color:#000
    style F fill:#e0e0e0,stroke:#a0a0a0,stroke-width:2px,color:#000

πŸ› οΈ Technology Stack

Component

Technology

Purpose

Parser

Tree-sitter

Error-tolerant syntax analysis across 27 languages

Database

PostgreSQL

Relational graph storage + recursive CTE queries

Runtime

Python

Core logic + multiprocessing

Infrastructure

Docker

Containerization

Visualization

JavaScript

Interactive impact graph

AI Integration

MCP

Native tool for LLM agents


πŸ—ΊοΈ Roadmap

All four development phases have been completed. N3MO is stable and actively maintained.

Phase

Component

Status

Phase 1 β€” Foundations

Docker setup

βœ… Complete

Database schema

βœ… Complete

Tree-sitter integration

βœ… Complete

Symbol + call extraction

βœ… Complete

Blast radius (recursive CTE)

βœ… Complete

Interactive visualizer

βœ… Complete

Phase 2 β€” Performance

Connection pooling

βœ… Complete

Batch DB operations (symbols/imports/calls)

βœ… Complete

SPLIT_PART query optimization

βœ… Complete

--file / --depth CLI flags

βœ… Complete

Interactive depth slider

βœ… Complete

Phase 3 β€” Correctness & Scaling

Incremental re-index (file hashing)

βœ… Complete

Multiprocessing (AST parsing)

βœ… Complete

Scope-aware call resolution

βœ… Complete

CTE cycle guard

βœ… Complete

Full type annotations + mypy

βœ… Complete

pytest suite + CI

βœ… Complete

Multi-language support (27 languages)

βœ… Complete

Phase 4 β€” Distribution

MCP server (Cursor / Claude / Windsurf)

βœ… Complete

Real-time git-hook indexing

βœ… Complete

  • Docker environment (PostgreSQL)

  • Database schema β€” Projects, Symbols, Calls, Imports tables

  • Tree-sitter parser integration

  • Symbol extractor with full AST traversal

  • Idempotent upsert logic

  • Blast radius via recursive CTE

  • Interactive vis.js visualizer

  • psycopg2.pool.ThreadedConnectionPool β€” replace per-call connections

  • execute_values() batch inserts for symbols, imports, and calls β€” 1 transaction per file

  • SPLIT_PART query optimization for call resolution

  • --file and --depth CLI flags for targeted impact analysis

  • Interactive depth slider in visualizer

Results: Django (3,021 files, ~43K symbols, ~181K calls) β€” 23min β†’ 2.5min (9Γ— faster)

  • SHA-256 file hashing for incremental re-index

  • ProcessPoolExecutor for parallel AST parsing

  • Scope-aware call resolution using imports table

  • CTE cycle guard (visited node tracking)

  • Full type annotations, mypy clean checking in CI

  • pytest unit + integration test suite

  • GitHub Actions CI pipeline

  • Multi-language support (27 languages)

  • MCP server β€” N3MO as a tool for Cursor, Claude Code, Windsurf

  • Real-time incremental indexing via git hooks


πŸ“ Design Principles

1. Structure before semantics Map the code skeleton (AST) before adding AI analysis. A correct graph is worth more than a smart but wrong one.

2. Database as source of truth All state lives in PostgreSQL, eliminating in-memory complexity and enabling graph queries that application-level traversal cannot match.

3. Correctness over speed The parser must handle syntax errors gracefully without corrupting the graph. A fast indexer that silently drops symbols is worse than a slow one that gets everything right.

4. Idempotent operations Re-running ingestion produces identical results, enabling safe incremental updates and CI/CD integration.


🀝 Contributing

Contributions are welcome! Please read the CONTRIBUTING.md guide to get started with setting up the project, coding standards, and running checks locally.

Development Setup

# Install with dev dependencies
pip install -e ".[dev]"

# Lint
ruff check n3mo/

# Type check
mypy n3mo/

# Tests
pytest tests/

πŸ“œ License

Licensed under the PolyForm Noncommercial 1.0.0 License.

  • βœ… Free for personal projects, academic research, and hobby tools

  • βœ… Source available β€” view, modify, and distribute for noncommercial purposes

  • ⚠️ Noncommercial β€” you may not use it for commercial purposes

  • ⚠️ Restrictions apply on offering it as a service

For commercial deployments or proprietary modifications, contact for licensing options.

See LICENSE for full legal details.


πŸ‘¨πŸ’» Author

Raj Shekhar β€” Delhi Technological University

GitHub LinkedIn


πŸ™ Acknowledgments

  • Tree-sitter β€” for robust, incremental, error-tolerant parsing

  • PostgreSQL β€” for making recursive graph queries possible without a graph database

  • Docker β€” for reproducible, single-command environments

  • vis.js β€” for the interactive graph visualization

  • FastAPI β€” for the high-performance REST layer


⭐ Star this repo if you find it useful β€” thanks for visiting!

Building tools for understanding code at scale.

Visitors

Install Server
F
license - not found
A
quality
A
maintenance

Maintenance

–Maintainers
6hResponse time
–Release cycle
1Releases (12mo)
Commit activity

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/RajX-dev/N3MO'

If you have feedback or need assistance with the MCP directory API, please join our Discord server