PG-Git (Semantic Memory MCP)
Provides semantic code search and repository management for Git repositories, storing the entire DAG in PostgreSQL with vector embeddings for AI-driven code discovery.
Integrates with Ollama's nomic-embed-text model for local, private generation of semantic embeddings used in temporal-decay vector search.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@PG-Git (Semantic Memory MCP)find where temporal decay is calculated"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
PG-Git (Semantic Memory Engine)
A persistent, PostgreSQL-backed repository management system and semantic memory engine for AI IDEs. Instead of storing Git objects loosely on the file system, PG-Git stores the entire Directed Acyclic Graph (DAG) natively in PostgreSQL, complete with automatically generated, temporally-decayed semantic vector embeddings.
π§ Why PG-Git?
In the standard AI coding agent ecosystem, searching codebases relies on rigid grep searches or expensive AST parsing. PG-Git fundamentally changes this by bridging Git directly with Vector Databases:
Semantic Code Search: Find code based on what it does, not just its syntax.
Exponential Temporal Decay: PG-Git mathematically decays older vectors. Your agent will prioritize code you wrote yesterday over highly similar dead code written 6 months ago.
Local-First Purity: No cloud APIs. It uses Ollama with
bge-largefor 100% private, on-device vectorization at 1024 dimensions.ACID Compliant: Native transactions ensure complete safety for multi-node accessibility and concurrent AI swarm agents.
β οΈ Not a Replacement for Git
It is critical to understand that PG-Git does not replace Git or services like GitHub/GitLab. It does not handle branch merging, rebasing, or pull requests.
Instead, PG-Git is an agentic augmentation layer. You continue to use standard Git for your human-facing source control and team collaboration. PG-Git sits alongside it in your workflow, automatically ingesting your standard Git history to provide your AI agents with a mathematically optimized, semantically searchable clone of your codebase.
π Reliability Audit & Pointer Resolution
To guarantee production-grade stability and complete continuity across complex monorepos, PG-Git underwent a thorough architectural audit and reliability overhaul:
Deterministic Pointer Resolution: In standard
'pointer'storage mode (used to keep the database lightweight), file content is stored asNULLin PostgreSQL. We introduced a dynamic resolver insideserver/git-engine.jsthat maps relative blob paths to the local filesystem monorepo root. This automatically streams raw file contents from disk on demand, resolving previousnullvalue crashes on the Express/api/blobs/:idroute andpg_git_read_blobMCP tool.Safe Semantic Search Previews: Standard semantic search tool calls throw TypeErrors if they attempt to stringify nullable database columns. PG-Git now safely extracts the pre-compiled
summarycolumn (generated usingqwen2.5-coder:14binline summaries) to serve as the context preview.Starvation-Proof Queue Management: The fleet priority queue (
PriorityQueue.process) insidelib/llm-queue.jsis secured using a stricttry ... finallyblock. This guarantees that concurrent slots are cleanly released (this.active--) under all task outcomes, preventing memory and slot leaks that would otherwise block multi-agent swarm operations.Fidelity-Preserved Embedding Backfills: Upgraded
backfill_embeddings.jsto automatically resolve local disk paths when processing pointer-mode blobs. This enables the embedding backfill process to compute full-fidelity, chunked centroid embeddings from the original code rather than falling back to lossy, shortened summary snippets.
π€ The Agentic Brain (Synergy with Krusch Context MCP)
PG-Git is designed to be used as the codebase engine underneath the Krusch Context MCP β a unified IDE context server that collapses semantic code search, episodic memory, and holographic nuggets into a single MCP process.
While PG-Git can be run standalone as its own MCP server (via server/mcp.js), its full power is realized when consumed by Krusch Context MCP:
Layer | Source | Purpose |
Codebase Memory (The "What" & "How") | PG-Git | Semantically embedded source files across your entire codebase |
Episodic Memory (The "Why") | Krusch Context | Architectural decisions, bugs encountered, project goals |
Holographic Nuggets (The "How to Behave") | Krusch Context | Lightweight steering facts, user preferences, project conventions |
Infinite Continuity: By running Krusch Context MCP (which imports PG-Git's database pool and embedding logic directly), your agent can cross-reference the intent (episodic memory) with the implementation (codebase blobs). It remembers why you chose a specific architecture, and instantly sees how it's currently implemented, creating a deeply contextualized and autonomous coding workflow that persists across infinite sessions.
π See the full unified server documentation: Krusch Context MCP README
β‘ Quick Start
You must have Ollama running with the bge-large embedding model pulled:
ollama pull bge-large1. Clone & Migrate Database
You will need a running PostgreSQL instance with pgvector enabled.
git clone https://github.com/kruschdev/pg-git.git
cd pg-git
npm install
cp .env.example .env
# Edit .env with your PostgreSQL credentials
node db/migrate.js2. Import Your GitHub History
Choose your embedding model carefully. You must set your preferred model (via the Web UI Settings tab or .env) before running your first import or snapshot. If you change models later, vector dimensions will collide and you will be forced to manually wipe the database and re-embed all repositories from scratch.
You can instantly import any local .git repository. PG-Git will natively parse the Git history, generate semantic embeddings for all blobs, and securely deduplicate them into PostgreSQL:
npm run import3a. Use via Krusch Context MCP (Recommended)
The recommended way to use PG-Git is through the Krusch Context MCP unified server, which wraps PG-Git's database pool and embedding logic alongside episodic memory and nuggets. See the Krusch Context MCP Quick Start for setup instructions.
3b. Use Standalone MCP Server
If you prefer to run PG-Git as an isolated MCP server, you can execute it directly via NPM. Add it to your agent/IDE configuration (e.g., mcp_config.json):
{
"mcpServers": {
"pg-git-mcp": {
"command": "npx",
"args": ["-y", "pg-git-mcp"],
"env": {
"PG_CONNECTION_STRING": "postgres://user:pass@localhost:5434/kruschdb",
"OLLAMA_URL": "http://localhost:11434",
"EMBED_MODEL": "bge-large"
}
}
}
}4. Start the Web UI (Optional) PG-Git includes a sleek, dual-pane IDE interface for browsing your semantic repositories.
npm run devπ§ MCP Tools (Standalone Mode)
When running PG-Git as a standalone MCP server, it exposes the following tools:
Tool | Description |
| List all available PG-Git repositories stored in the database |
| Read the directory structure (DAG node) of a specific repository |
| Read the file contents of a specific blob |
| Semantically search all indexed files with temporal decay scoring |
When used throughKrusch Context MCP, these capabilities are exposed under unified tool names (krusch_context_search_code, krusch_context_read_tree, krusch_context_read_blob, krusch_context_list_repos) alongside 14 additional memory and nugget tools.
π Real-World Usage Examples
Standalone Mode
Speak to your IDE agent normally. It will use the standalone MCP tools to interface with the database:
Example 1: Finding specific logic
You: "Where do we handle the temporal decay for the memory MCP?" Agent: [Calls
pg_git_semantic_search] "I found the logic inserver/git-engine.js. It uses theexp(-0.01 * age_in_days)formula insearchBlobs()."
Example 2: Reading a repository tree
You: "What is the folder structure for the pg-git project?" Agent: [Calls
pg_git_read_tree] "Here is the root directory structure..."
Example 3: Filtering by project
You: "Search for authentication logic in the pocket-lawyer project only." Agent: [Calls
pg_git_semantic_searchwithproject: 'pocket-lawyer'] "Found 3 matches in the auth module..."
How Does Temporal Decay Work?
When calling pg_git_semantic_search, PG-Git returns the highest cosine-similarity matches. However, it applies Exponential Temporal Decay based on the blob's last_seen_at timestamp:
score = cosine_similarity Γ exp(-0.01 Γ age_in_days)If you have two very similar pieces of code, the newer one will have a significantly higher score, preventing your agent from hallucinating based on outdated implementations.
π€ The Autonomous Agent Workflow
You can integrate PG-Git into your agentic workflow to ensure your semantic memory is always up to date.
Snapshot (Single Project)
Whenever you step away from a task, tell your agent to run the snapshot script. The agent will autonomously:
Hash the current project folder into Git Blobs and Trees.
Ping Ollama to embed any new or modified files.
Commit the state directly into PostgreSQL.
npm run snapshotSync All Projects (Fleet-Wide)
To re-index all active project codebases across the fleet:
npm run sync-allExternal Documentation Sync
PG-Git can ingest external documentation (e.g., llms.txt manifests) for hallucination-free framework knowledge:
node scripts/sync_external_docs.jsπ Project Structure
pg-git/
βββ server/
β βββ index.js # Express API + Web UI server
β βββ mcp.js # Standalone MCP server (StdioServerTransport)
β βββ git-engine.js # Git DAG operations + semantic search with temporal decay
βββ db/
β βββ schema.sql # PostgreSQL schema (repos, commits, branches, trees, blobs + pgvector)
β βββ pool.js # Shared pg.Pool connection
β βββ migrate.js # Schema migration runner
β βββ create-db.js # Database creation helper
β βββ list-dbs.js # List available databases
βββ lib/
β βββ embedding.js # Shared Ollama embedding client with fleet load balancing
βββ scripts/
β βββ sync_to_pg.js # Snapshot a single project into PostgreSQL
β βββ sync_all_projects.js # Fleet-wide project sync
β βββ sync_external_docs.js # External documentation ingestion (llms.txt)
β βββ import_github.js # Import native .git history
β βββ import_hf_dataset.js # Import HuggingFace datasets
β βββ backfill_embeddings.js # Backfill missing embeddings
β βββ migrate_to_1024.js # Dimension migration helper
β βββ migrate_to_pointer.js # Storage mode migration
β βββ scaffold_nesting_dolls.js # Nesting-doll chunking scaffold
βββ client/ # Vite + React Web UI
βββ config/
β βββ external_docs.json # External documentation manifest
βββ config.js # Unified configuration (env + config.json merge)
βββ assets/ # Banner and social preview images
βββ Dockerfile # Multi-stage production build
βββ docker-compose.yml # Container orchestration
βββ AGENTS.md # Agent context rules for AI IDEs
βββ spec.md # Original project specificationπ οΈ Configuration & Environment Variables
PG-Git uses a layered configuration system: environment variables override config.json, which overrides built-in defaults.
Variable | Description | Default |
| Express server port |
|
| PostgreSQL Host address |
|
| PostgreSQL Port |
|
| Database Name |
|
| Database User |
|
| Database Password | (empty) |
| The endpoint for your local Ollama instance |
|
| The Ollama text-embedding model to use |
|
Database Schema
PG-Git's PostgreSQL schema maps Git objects directly into SQL tables:
repositoriesβ Project registriescommitsβ SHA-1 identified commit objects with tree and parent referencesbranchesβ Named references to commit headstrees/tree_entriesβ Directory structures mapping names to blob/tree object IDsblobsβ File content withembedding vector(1024)andlast_seen_attemporal tracking
Semantic search uses the pgvector HNSW index with vector_cosine_ops for sub-millisecond approximate nearest-neighbor lookups.
πΊοΈ Related Projects
Project | Role |
Unified IDE context server β wraps PG-Git + episodic memory + nuggets into a single MCP process | |
This project published to the NPM registry | |
Original Holographic Nuggets MCP architecture adapted in Krusch Context |
License
ISC License. Created by kruschdev.
This server cannot be installed
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/kruschdev/pg-git-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server