How do I use Super-Memory-TS?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@Super-Memory-TS store that I prefer dark mode in my editor" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

Super-Memory-TS

by Veedubin

Overview Schema Related Servers Score Discussions

TypeScript

Local

⚠️ NO LONGER IN ACTIVE DEVELOPMENT
This project (Super-Memory-TS/, TypeScript + Qdrant) is no longer in active development.
Status
Replacement
Memory backend
memini-ai-dev — successor written in Python with PostgreSQL + pgvector. Adds trust scoring, knowledge graph, tiered loading, and thought chains. Source at ~/Projects/MCP-Servers/memini-ai-dev/.
Orchestration plugin (consumer)
@veedubin/boomerang-v3 — successor orchestration plugin that consumes memini-ai-dev via the memini-ai-dev MCP server. Source at ~/Projects/MCP-Servers/boomerang-v3/.
This directory is kept for historical reference only. No new features, bug fixes, or NPM releases (@veedubin/super-memory-ts) will be made here. The Qdrant storage, BGE-Large / MiniLM embedding pipeline, and dist/ builds are still functional if you point an older Boomerang v2 at them.
Migrating? See ~/Projects/MCP-Servers/memini-ai-dev/README.md for the migration guide. The new server exposes memory.add, memory.query, memory.getTrustScore, memory.adjustTrust, and 30+ other tools that map cleanly onto the old save_to_memory / query_memory / search_project interface.

Super-Memory-TS

npm version License: MIT

Local-first semantic memory server with project indexing for AI assistants.

Version: v2.4.0 | Database: Qdrant (HNSW indexing, payload filtering) | Embeddings: BGE-Large (GPU, 1024-dim) / MiniLM-L6-v2 (CPU, 384-dim) | Precision: fp16 (~325MB) | NPM: @veedubin/super-memory-ts

Super-Memory-TS is a TypeScript implementation of a persistent, local-first memory system that provides semantic search over memories and project code using embeddings and vector search. It runs as an MCP (Model Context Protocol) server, enabling AI assistants like Boomerang to store, retrieve, and search through accumulated knowledge.

Related MCP server: KB-MCP Server

🤝 Companion Package

Super-Memory-TS powers the memory system for @veedubin/boomerang-v2 — a multi-agent orchestration plugin for OpenCode with 14 specialized agents, 8-step protocol, and built-in semantic memory.

Overview

What is Super-Memory v2?

Super-Memory v2 is a complete rewrite of the original Python-based memory system in TypeScript. It provides:

Semantic Memory Search: Store and retrieve memories using natural language queries
Project Indexing: Automatically index project files for code-aware search
Local-First: All data stays on your machine—no cloud dependencies
MCP Server: Standard MCP protocol for integration with AI assistants
FP16 Support: Reduce memory usage by 50% with floating-point precision options
HNSW Vector Search: Sub-10ms query latency using state-of-the-art approximate nearest neighbor search

Key Features

Feature	Description
fp16/fp32 Precision	Reduce memory footprint from ~650MB to ~325MB per model instance
BGE-Large Embeddings	1024-dimensional embeddings from BAAI/bge-large-en-v1.5
MiniLM Fallback	CPU-friendly 384-dimensional embeddings from sentence-transformers/all-MiniLM-L6-v2
HNSW Index	IVF_HNSW_SQ index for optimal recall/speed tradeoff
Incremental Indexing	xxhash-wasm snapshot-based change detection (10x faster than SHA-256)
Semantic Chunking	Intelligent code splitting at function/class boundaries
Reference Counting	Singleton model manager prevents VRAM duplication
Project Isolation	Payload-based filtering by `projectId` for multi-project memory separation
Tiered Search	`tiered` (default), `vector_only`, `text_only` search strategies
Snapshot Indexing	`.opencode/super-memory-ts/snapshot.json` for persistent file tracking

Features

Project Isolation

Super-Memory-TS supports multi-project memory isolation using projectId tagging in Qdrant payloads.

How It Works

Set Project ID via BOOMERANG_PROJECT_ID environment variable:
```
export BOOMERANG_PROJECT_ID=my-project
```
Automatic Tagging: All memories added are tagged with projectId in the Qdrant payload
Automatic Filtering: Queries are automatically filtered by projectId - you only see memories from the current project
Backward Compatible: Memories without a projectId tag are still searchable (untagged memories are visible to all projects)

Use Cases

Single project: Set BOOMERANG_PROJECT_ID once, all operations are isolated
Multiple projects: Change BOOMERANG_PROJECT_ID to switch between projects
Shared memories: Omit BOOMERANG_PROJECT_ID to create cross-project shared memories

Example

// In project "backend-api"
await memorySystem.addMemory({ text: "Use JWT for auth" });
// → Stored with projectId: "backend-api"

// Query from "backend-api" → Returns the JWT memory
// Query from "frontend-app" → Does NOT see the JWT memory

Semantic Memory Search

Store memories with automatic embedding generation and retrieve them using natural language queries:

// Add a memory
await memorySystem.addMemory({
  text: "User prefers dark mode in VS Code",
  sourceType: 'session',
  metadata: { context: "settings discussion" }
});

// Query memories
const results = await memorySystem.queryMemories("What theme settings does the user have?");

Automatic Project Indexing

Index project files on startup with background watching for changes:

Supports TypeScript, JavaScript, Python, Markdown, JSON, and more
Semantic chunking preserves code structure (functions, classes)
Incremental updates via SHA-256 hash comparison
File watching with debouncing (500ms)

Custom Path Indexing

You can index any directory using the path parameter in the index_project tool:

{
  "path": "/path/to/other/project",
  "force": true
}

Use cases:

Index a subdirectory of your project
Index a completely different codebase
Re-index specific folders after major changes

Example: Index a shared library:

{
  "path": "/home/user/Projects/shared-utils",
  "force": false
}

Tiered Memory Architecture

Super-Memory-TS provides two search strategies optimized for different use cases:

Mode	Strategy	Description	Best For
Fast Reply	`TIERED` (default)	Quick MiniLM search with BGE fallback	Interactive queries, real-time responses
Archivist	`PARALLEL`	Dual-tier search with RRF fusion	Comprehensive recall, research tasks

Fast Reply (TIERED)

Hybrid approach optimized for speed:

Primary: MiniLM-L6-v2 (384-dim) search - fast CPU-based search
Fallback: BGE-Large (1024-dim) for refinement when MiniLM results are ambiguous
Use when: Speed matters, general purpose queries

Query → MiniLM Search → [if low confidence] → BGE Refinement → Results

Archivist (PARALLEL)

Dual-tier search optimized for recall:

Parallel: Both MiniLM and BGE searches run simultaneously
Fusion: Reciprocal Rank Fusion (RRF) combines results
Use when: Thorough search is needed, research, debugging

Query → MiniLM Search ─┐
                       ├→ RRF Fusion → Results
       → BGE Search   ─┘

Configuration

# Default: TIERED (Fast Reply)
BOOMERANG_SEARCH_STRATEGY=tiered

# Archivist mode for maximum recall
BOOMERANG_SEARCH_STRATEGY=parallel

Or via MCP tool:

{
  "query": "authentication middleware",
  "strategy": "parallel"
}

HNSW Vector Search

High-performance approximate nearest neighbor search:

// Configure HNSW index
const HNSW_CONFIG = {
  m: 16,                    // Max connections per layer
  efConstruction: 128,      // Build-time search depth
  efSearch: 64,            // Query-time search depth
  distanceType: 'cosine',  // Cosine similarity
};

CPU/GPU Support

Automatic device detection with fallback:

GPU: BGE-Large with fp16 for maximum quality
CPU: MiniLM-L6-v2 fallback if GPU unavailable
Environment variable control: BOOMERANG_USE_GPU, BOOMERANG_DEVICE

Precision Options

Precision	Memory (BGE-Large)	Accuracy	Use Case
`fp32`	~650MB	Highest	Default
`fp16`	~325MB	Near-lossy	Production
`q8`	~162MB	Good	Memory constrained
`q4`	~81MB	Acceptable	Edge devices

Architecture

Dual Use Cases

Super-Memory-TS supports two integration modes:

Mode	Description	Use Case
MCP Server (External)	Runs as standalone MCP server accessible via HTTP	External AI tools, cross-framework sharing
Built-in (Boomerang)	Core modules imported directly into Boomerang	Boomerang plugin operation, zero-overhead

MCP Server Mode (External Users)

Traditional MCP server deployment for external AI assistants:

Standalone Node.js process
MCP protocol over stdio/HTTP
Full tool interface (query_memories, add_memory, search_project, index_project)
Suitable for Claude Desktop, Cursor, other MCP-compatible tools

Built-in Mode (Boomerang Integration)

Direct module integration with Boomerang:

Core modules imported as TypeScript/JS imports
No MCP protocol overhead
Automatic startup and file watching
Direct memory operations for Boomerang agents

Component Overview

┌─────────────────────────────────────────────────────────────────┐
│                        SuperMemoryServer                         │
│  ┌─────────────┐  ┌─────────────┐  ┌─────────────────────────┐│
│  │  MCP Tools  │  │   Memory    │  │   ProjectIndexer        ││
│  │             │  │   System    │  │                         ││
│  │ query_      │  │             │  │  ┌─────────────────────┐ ││
│  │ memories    │  │  ┌────────┐  │  │   FileWatcher      │ ││
│  │             │  │  │Qdrant │  │  │   (chokidar)       │ ││
│  │ add_memory  │──│  │  +     │  │  │ └─────────────────────┘ ││
│  │             │  │  │ HNSW   │  │  │  ┌─────────────────────┐ ││
│  │ search_     │  │  └────────┘  │  │  │   FileChunker       │ ││
│  │ project     │  │             │  │  │   (semantic/sliding) │ ││
│  │             │  │             │  │  └─────────────────────┘ ││
│  │ index_      │  │             │  │                         ││
│  │ project     │  │             │  │                         ││
│  └─────────────┘  └─────────────┘  └─────────────────────────┘│
└─────────────────────────────────────────────────────────────────┘
                              │
                              ▼
┌─────────────────────────────────────────────────────────────────┐
│                     ModelManager (Singleton)                     │
│  ┌─────────────────────────────────────────────────────────────┐ │
│  │                  @xenova/transformers                        │ │
│  │  ┌─────────────────────┐    ┌─────────────────────────────┐ │ │
│  │  │   BGE-Large         │    │    MiniLM-L6-v2             │ │ │
│  │  │   (1024-dim, fp16)  │    │    (384-dim, fp32)         │ │ │
│  │  │   ~325MB            │    │    ~80MB                    │ │ │
│  │  └─────────────────────┘    └─────────────────────────────┘ │ │
│  └─────────────────────────────────────────────────────────────┘ │
└─────────────────────────────────────────────────────────────────┘

Automatic Indexing

When integrated with Boomerang (built-in mode):

Startup: ProjectIndexer automatically scans and indexes project files
Watching: FileWatcher monitors for changes with 500ms debounce
Incremental: SHA-256 hash comparison skips unchanged files
Background: All indexing runs in background without blocking agent operations

Data Flow

Memory Storage Flow

add_memory tool
      │
      ▼
┌──────────────┐     ┌─────────────────┐     ┌─────────────┐
│ Input Text   │────▶│ Generate        │────▶│ Qdrant      │
│              │     │ Embedding        │     │ + HNSW Index│
└──────────────┘     │ (ModelManager)  │     └─────────────┘
                     └─────────────────┘

Query Flow

query_memories tool
      │
      ▼
┌──────────────┐     ┌─────────────────┐     ┌─────────────┐
│ Query Text  │────▶│ Generate Query  │────▶│ HNSW Search │
│              │     │ Embedding       │     │ (Qdrant)    │
└──────────────┘     │ (ModelManager)  │     └─────────────┘
                     └─────────────────┘            │
                                                   ▼
                                          ┌─────────────────┐
                                          │ Return Top-K   │
                                          │ Results         │
                                          └─────────────────┘

Project Indexing Flow

FileWatcher (chokidar)
      │
      ├── add/change ──▶ processFile()
      │                      │
      │                      ▼
      │               ┌─────────────────┐
      │               │ Semantic        │
      │               │ Chunking         │
      │               └─────────────────┘
      │                      │
      │                      ▼
      │               ┌─────────────────┐
      │               │ Generate        │
      │               │ Embeddings       │
      │               └─────────────────┘
      │                      │
      │                      ▼
      │               ┌─────────────────┐
      └── unlink ────▶│ Remove from     │
                      │ Database        │
                      └─────────────────┘

Model Layer (`src/model/`)

ModelManager - Singleton pattern with reference counting:

// Get instance (creates if necessary)
const manager = ModelManager.getInstance();

// Acquire model (loads if not already)
await manager.acquire();

// Generate embeddings
const extractor = manager.getExtractor();
const embedding = await extractor(text, { pooling: 'mean', normalize: true });

// Release when done
manager.release();

Models:

BGE-Large (BAAI/bge-large-en-v1.5): 1024-dim, fp16 capable
MiniLM-L6-v2 (sentence-transformers/all-MiniLM-L6-v2): 384-dim, CPU fallback

Memory Storage (`src/memory/`)

Qdrant with HNSW indexing:

// Schema
interface MemoryEntry {
  id: string;              // UUID
  text: string;            // Content
  vector: Float32Array;    // 1024-dim embedding
  sourceType: MemorySourceType;
  sourcePath?: string;
  timestamp: Date;
  contentHash: string;     // SHA-256 for deduplication
  metadataJson?: string;
}

Search Strategies:

TIERED: Hybrid vector + keyword search (default)
VECTOR_ONLY: Pure semantic similarity
TEXT_ONLY: Keyword matching via Fuse.js

Project Indexing (`src/project-index/`)

FileChunker - Hybrid chunking:

Semantic: Splits at function/class boundaries for code files
Sliding Window: Falls back for non-code or ambiguous content

ProjectWatcher - File monitoring:

Uses chokidar for cross-platform file watching
500ms debounce to batch rapid changes
SHA-256 hash comparison for incremental updates

Requirements

Requirement	Version	Notes
Node.js	≥20.0.0	Required for ESM modules
Qdrant	Latest	Vector database (see below)

Qdrant Setup

Super-Memory-TS requires a running Qdrant instance. The easiest way is Docker:

docker run -p 6333:6333 -v $(pwd)/qdrant_storage:/qdrant/storage qdrant/qdrant

Or set a custom Qdrant URL:

export QDRANT_URL=http://your-qdrant-host:6333

Installation

Prerequisites

Requirement	Version	Notes
Node.js	≥20.0.0	Required for ESM modules
npm or bun	Latest	For package management

Optional:

CUDA-capable GPU (for GPU acceleration)

Install Dependencies

# Using npm
npm install

# Using bun (recommended for faster install)
bun install

Build from Source

npm run build

This produces output in dist/ directory.

npx Usage

The package works with npx:

{
  "super-memory-ts": {
    "type": "local",
    "command": ["npx", "-y", "@veedubin/super-memory-ts"],
    "environment": {
      "QDRANT_URL": "http://localhost:6333"
    }
  }
}

Note: Model loading is deferred to the first embedding request for faster startup. To preload the model at startup, set SUPER_MEMORY_EAGER_LOAD=1.

Quick Start

1. Environment Variables (Optional)

Create a .env file or set environment variables:

# Model configuration
BOOMERANG_PRECISION=fp16        # fp32, fp16, q8, q4
BOOMERANG_DEVICE=auto           # auto, gpu, cpu
BOOMERANG_USE_GPU=true          # true, false

# Database
export QDRANT_URL=http://localhost:6333  # Qdrant server URL

# Logging
BOOMERANG_LOG_LEVEL=info        # debug, info, warn, error

# Indexing
BOOMERANG_CHUNK_SIZE=512
BOOMERANG_CHUNK_OVERLAP=50
BOOMERANG_MAX_FILE_SIZE=10485760  # 10MB

2. Start the Server

# Development mode (with watch)
npm run dev

# Production mode
npm run build
npm start

Build Commands

Command	Purpose
`npm run build`	Compile TypeScript to `dist/`
`npm run typecheck`	Type-check without emitting (`tsc --noEmit`)
`npm run prepublishOnly`	Runs `npm run build` before publish
`npm run dev`	Watch mode with `tsx`
`npm run test`	Run tests with `vitest`
`npm run lint`	ESLint on `src/`

3. Basic Usage Example

import { SuperMemoryServer } from './src/index.ts';

async function main() {
  const server = new SuperMemoryServer();
  await server.start();
  console.log('Super-Memory MCP Server running...');
}

main();

4. Configure Boomerang Plugin

In your Boomerang configuration:

{
  "superMemory": {
    "server": "super-memory-ts",
    "enabled": true
  }
}

Configuration

Configuration File

Create super-memory.json in your project root:

{
  "model": {
    "precision": "fp16",
    "device": "auto",
    "useGpu": false,
    "embeddingDim": 1024,
    "batchSize": 32
  },
  "database": {
    "qdrantUrl": "http://localhost:6333",
    "tableName": "memories"
  },
  "indexer": {
    "chunkSize": 512,
    "chunkOverlap": 50,
    "maxFileSize": 10485760,
    "excludePatterns": [
      "node_modules",
      ".git",
      "dist",
      "*.log"
    ]
  },
  "logging": {
    "level": "info"
  }
}

Configuration Priority

Settings are merged in the following order (highest to lowest):

Environment variables
JSON config file (super-memory.json)
Default values

Environment Variables Reference

Variable	Default	Description
`BOOMERANG_PRECISION`	`fp32`	Model precision: `fp32`, `fp16`, `q8`, `q4`
`BOOMERANG_DEVICE`	`auto`	Compute device: `auto`, `gpu`, `cpu`
`BOOMERANG_USE_GPU`	`false`	Enable GPU usage
`QDRANT_URL`	`http://localhost:6333`	Qdrant server URL
`BOOMERANG_LOG_LEVEL`	`info`	Log level: `debug`, `info`, `warn`, `error`
`BOOMERANG_CHUNK_SIZE`	`512`	Token chunk size for indexing
`BOOMERANG_CHUNK_OVERLAP`	`50`	Overlap between chunks
`BOOMERANG_MAX_FILE_SIZE`	`10485760`	Max file size (bytes) to index
`BOOMERANG_PROJECT_ID`	-	Project identifier for memory isolation
`BOOMERANG_SEARCH_STRATEGY`	`tiered`	Default search strategy: `tiered`, `parallel`
`QUERY_COLLECTIONS`	`tableName`	Comma-separated list of collections to search (for multi-collection RRF)

Multi-Collection Search with RRF

When you need to search across multiple Qdrant collections (e.g., during model migration from MiniLM 384-dim to BGE-Large 1024-dim), use QUERY_COLLECTIONS:

# Search both old (384-dim) and new (1024-dim) collections
export QUERY_COLLECTIONS="memories,memories_bge_fp16"

# Or in JSON config
{
  "database": {
    "qdrantUrl": "http://localhost:6333",
    "tableName": "memories",
    "queryCollections": ["memories", "memories_bge_fp16"]
  }
}

How it works:

Write target: tableName (memories) remains the target for all writes
Search target: queryCollections defines which collections to search
RRF merge: Results from all collections are merged using Reciprocal Rank Fusion (k=60)
Dimension validation: Collections with mismatched dimensions are skipped with a warning
Fallback: If any collection fails, search continues with remaining collections

API Reference

MCP Tools

The server provides five MCP tools:

`query_memories`

Semantic search over stored memories.

Arguments:

Parameter	Type	Required	Default	Description
`query`	string	Yes	-	Search query text
`limit`	number	No	10	Max results to return
`strategy`	string	No	`tiered`	Search strategy: `tiered`, `vector_only`, `text_only`

Example:

{
  "query": "What was discussed about authentication?",
  "limit": 5,
  "strategy": "tiered"
}

Response:

{
  "count": 2,
  "memories": [
    {
      "id": "550e8400-e29b-41d4-a716-446655440000",
      "content": "User mentioned OAuth2 integration needed",
      "sourceType": "session",
      "sourcePath": null,
      "timestamp": "2026-04-23T10:30:00Z"
    }
  ]
}

`add_memory`

Store a new memory entry.

Arguments:

Parameter	Type	Required	Default	Description
`content`	string	Yes	-	Memory content to store
`sourceType`	string	No	`manual`	Source type: `manual`, `file`, `conversation`, `web`
`sourcePath`	string	No	-	Source URL or file path
`metadata`	object	No	-	Additional metadata

Example:

{
  "content": "Remember that the user prefers TypeScript over JavaScript",
  "sourceType": "conversation",
  "sourcePath": "/session/123",
  "metadata": {
    "userId": "user-456",
    "importance": "high"
  }
}

Response:

{
  "success": true,
  "id": "550e8400-e29b-41d4-a716-446655440001",
  "message": "Memory added successfully"
}

Note: Duplicate content (same SHA-256 hash) is rejected with duplicate: true.

`search_project`

Search indexed project files.

Arguments:

Parameter	Type	Required	Default	Description
`query`	string	Yes	-	Search query
`topK`	number	No	20	Max results
`fileTypes`	string[]	No	-	Filter by extensions (e.g., `["ts", "js"]`)
`paths`	string[]	No	-	Filter by directory paths

Example:

{
  "query": "authentication middleware",
  "topK": 10,
  "fileTypes": ["ts", "tsx"],
  "paths": ["src/api", "src/middleware"]
}

Response:

{
  "count": 3,
  "chunks": [
    {
      "filePath": "src/middleware/auth.ts",
      "content": "export async function authMiddleware(req, res, next) { ... }",
      "lineStart": 15,
      "lineEnd": 42,
      "score": 0.89
    }
  ]
}

`index_project`

Trigger project indexing.

Arguments:

Parameter	Type	Required	Default	Description
`path`	string	No	cwd	Directory to index
`force`	boolean	No	`false`	Force re-index all files

Example:

{
  "path": "/home/user/project",
  "force": true
}

Response:

{
  "success": true,
  "message": "Indexing completed",
  "stats": {
    "totalFiles": 150,
    "indexedFiles": 148,
    "failedFiles": 2,
    "totalChunks": 1247,
    "lastIndexing": "2026-04-23T10:35:00Z"
  }
}

`get_file_contents`

Reconstruct file contents from indexed chunks.

Arguments:

Parameter	Type	Required	Default	Description
`filePath`	string	Yes	-	Path to the file to reconstruct
`triggerIndex`	boolean	No	`false`	If true and file not found, trigger indexing

Example:

{
  "filePath": "src/server.ts",
  "triggerIndex": false
}

Response:

{
  "success": true,
  "filePath": "src/server.ts",
  "content": "import { McpServer } from '@modelcontextprotocol/sdk/server/mcp.js';\n...",
  "chunks": [
    { "chunkIndex": 0, "lineStart": 1, "lineEnd": 25 },
    { "chunkIndex": 1, "lineStart": 26, "lineEnd": 50 }
  ],
  "lineCount": 850,
  "indexedAt": "2026-04-28T10:35:00Z",
  "truncated": false
}

Notes:

Content is reconstructed by concatenating indexed chunks in order
Max content size: 100KB (returns truncated: true if exceeded)
Pauses/resumes indexer during read operations

Configuration File Format

The super-memory.json configuration file:

{
  "model": {
    "precision": "fp16",
    "device": "auto",
    "useGpu": false,
    "embeddingDim": 1024,
    "batchSize": 32
  },
  "database": {
    "qdrantUrl": "http://localhost:6333",
    "tableName": "memories"
  },
  "indexer": {
    "chunkSize": 512,
    "chunkOverlap": 50,
    "maxFileSize": 10485760,
    "excludePatterns": [
      "**/node_modules/**",
      "**/.git/**",
      "**/dist/**",
      "**/*.log",
      "**/.cache/**"
    ]
  },
  "logging": {
    "level": "info"
  }
}

Memory Source Types

Type	Description
`session`	Session-scoped memory (default)
`file`	Imported from a file
`web`	Scraped from web content
`boomerang`	Generated by Boomerang plugin
`project`	Indexed from project files

Performance

Memory Usage

Configuration	Model Memory	Total Memory	Notes
1 instance, BGE-Large fp16	~325MB	~500MB	Default single-user
3 instances, BGE-Large fp16	~975MB	~1.2GB	Shared via singleton
3 instances, BGE-Large fp32	~1.95GB	~2.2GB	High accuracy
CPU fallback, MiniLM fp32	~80MB	~200MB	CPU-only systems

Query Latency Targets

Operation	Target	Notes
Semantic query (HNSW)	<10ms p50	With warm cache
Embedding generation	<100ms	BGE-Large single text
Batch embedding	<50ms/text	Batch of 8
Project search	<50ms	With indexed project

Benchmarks

Test Environment:

CPU: AMD Ryzen 9 5950X
RAM: 64GB DDR4
GPU: NVIDIA RTX 3090 (24GB)

Single Query Latency (p50):

Strategy: TIERED
├── Embedding generation: 45ms
├── HNSW search (top 10): 3ms
└── Total: ~48ms

Throughput:

Operation	Throughput
Memory add (with embedding)	~20/sec
Memory query	~100/sec
Project file indexing	~100 files/min

Security

Accepted Vulnerabilities (with monitoring schedule):

Package	Vulnerability	Status	Notes
`uuid`	CVE-2024-21539	Accepted	Monitoring for patch updates
`@modelcontextprotocol/sdk`	Moderate	Accepted	Used for MCP protocol; monitoring
`protobufjs`	Moderate	Accepted	Used for serialization; monitoring

Monitoring: Run npm audit monthly to check for new vulnerabilities. Update packages when security patches become available without breaking changes.

Architecture Decisions

Why TypeScript?

Type Safety: Catch errors at compile time
ESM Modules: Native support in Node.js 20+
MCP SDK: Official TypeScript SDK available
Bundle Size: Lighter than Python runtime

Search Strategy Selection

Scenario	Recommended Strategy
Interactive chat, speed critical	`tiered` (Fast Reply)
Research, debugging, thorough recall	`parallel` (Archivist)
Exact keyword matching	`text_only`
Pure semantic similarity	`vector_only`

Singleton Model Manager

Prevents VRAM duplication when multiple components need embeddings:

// Instead of creating new models
const extractor = await pipeline('feature-extraction', 'bge-large'); // Bad

// Use singleton
const manager = ModelManager.getInstance();
await manager.acquire();  // Loads once, shares across users

Qdrant over Alternatives

Database	Pros	Cons
Qdrant	REST API, payload filtering, HNSW, open source	Requires separate process
LanceDB	Embedded, Arrow format	TypeScript support was immature
Chroma	Simple, local	Less mature
Pinecone	Managed, scalable	Requires API key

Hybrid Chunking

// Semantic boundaries detected:
function handleAuth() { ... }    // Split here
class UserService { ... }        // And here
const config = { ... };          // Continue until max size

// Fallback: sliding window for ambiguous content

Development

Project Structure

Super-Memory-TS/
├── src/
│   ├── index.ts              # Entry point
│   ├── server.ts             # MCP server implementation
│   ├── config.ts             # Configuration management
│   ├── model/
│   │   ├── index.ts          # ModelManager singleton
│   │   ├── embeddings.ts     # Embedding generation
│   │   └── types.ts          # Model type definitions
│   ├── memory/
│   │   ├── index.ts          # MemorySystem facade
│   │   ├── database.ts       # Qdrant operations
│   │   ├── schema.ts         # Memory schema
│   │   └── search.ts         # Search strategies
│   ├── project-index/
│   │   ├── indexer.ts        # ProjectIndexer class
│   │   ├── chunker.ts        # FileChunker (semantic/sliding)
│   │   ├── watcher.ts        # FileWatcher (chokidar)
│   │   └── types.ts          # Indexer types
│   └── utils/
│       ├── logger.ts         # Logging utility
│       ├── hash.ts           # SHA-256 hashing
│       └── errors.ts         # Custom error types
├── tests/
│   ├── server.test.ts        # Server unit tests
│   ├── index.test.ts         # Integration tests
│   ├── project-index.test.ts # Indexer tests
│   └── test-project/         # Sample project for testing
├── package.json
├── tsconfig.json
└── README.md

Building from Source

# Install dependencies
npm install

# Type check
npx tsc --noEmit

# Build
npm run build

# Output in dist/

Running Tests

# Run all tests
npx vitest run

# Run with coverage
npx vitest run --coverage

# Run specific test file
npx vitest run tests/server.test.ts

Contributing

Fork the repository
Create a feature branch: git checkout -b feature/my-feature
Make changes with tests
Run linting: npm run lint
Commit with conventional messages
Push and create PR

Error Handling

All errors extend MemoryError:

// Throwing errors
throw new MemoryError('Query failed', 'QUERY_FAILED');

// Error codes
VALIDATION_ERROR  // Invalid input
QUERY_FAILED      // Search failed
ADD_FAILED        // Memory add failed
INDEX_NOT_INITIALIZED  // Indexer not ready
INTERNAL_ERROR    // Unexpected errors

License

MIT License - see project repository for details.

This server cannot be installed

license - permissive license

quality - not tested

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Veedubin/Super-Memory-TS'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

Status	Replacement
Memory backend	`memini-ai-dev` — successor written in Python with PostgreSQL + pgvector. Adds trust scoring, knowledge graph, tiered loading, and thought chains. Source at `~/Projects/MCP-Servers/memini-ai-dev/`.
Orchestration plugin (consumer)	`@veedubin/boomerang-v3` — successor orchestration plugin that consumes `memini-ai-dev` via the `memini-ai-dev` MCP server. Source at `~/Projects/MCP-Servers/boomerang-v3/`.

⚠️ NO LONGER IN ACTIVE DEVELOPMENT

Super-Memory-TS

🤝 Companion Package

Table of Contents

Overview

What is Super-Memory v2?

Key Features

Features

Project Isolation

How It Works

Use Cases

Example

Semantic Memory Search

Automatic Project Indexing

Custom Path Indexing

Tiered Memory Architecture

Fast Reply (TIERED)

Archivist (PARALLEL)

Configuration

HNSW Vector Search

CPU/GPU Support

Precision Options

Architecture

Dual Use Cases

MCP Server Mode (External Users)

Built-in Mode (Boomerang Integration)

Component Overview

Automatic Indexing

Data Flow

Memory Storage Flow

Query Flow

Project Indexing Flow

Model Layer (src/model/)

Memory Storage (src/memory/)

Project Indexing (src/project-index/)

Requirements

Qdrant Setup

Installation

Prerequisites

Install Dependencies

Build from Source

npx Usage

Quick Start

1. Environment Variables (Optional)

2. Start the Server

Build Commands

3. Basic Usage Example

4. Configure Boomerang Plugin

Configuration

Configuration File

Configuration Priority

Environment Variables Reference

Multi-Collection Search with RRF

API Reference

MCP Tools

query_memories

add_memory

search_project

index_project

get_file_contents

Configuration File Format

Memory Source Types

Performance

Memory Usage

Query Latency Targets

Benchmarks

Security

Architecture Decisions

Why TypeScript?

Search Strategy Selection

Singleton Model Manager

Qdrant over Alternatives

Hybrid Chunking

Development

Project Structure

Building from Source

Running Tests

Contributing

Error Handling

License

Model Layer (`src/model/`)

Memory Storage (`src/memory/`)

Project Indexing (`src/project-index/`)

`query_memories`

`add_memory`

`search_project`

`index_project`

`get_file_contents`