Enhanced Knowledge Graph Memory Server

memory-mcp
docs
analysis

BRUTALLY_HONEST_ANALYSIS.md•21.7 KiB

# Brutally Honest Codebase Analysis — Final Cut **Date:** 2026-01-01 **Version Analyzed:** 0.59.0 (CLAUDE.md) / 0.11.5 (package.json) — *can't even agree with itself* **Total Source Lines:** ~11,610 **Test Lines:** ~18,631 (42 test files) **CHANGELOG:** 110KB / 2,321 lines — *for a v0.11.5 project* --- ## Executive Summary This isn't a knowledge graph. It's **two arrays pretending to be a database**. ```typescript interface KnowledgeGraph { entities: Entity[]; // That's it. That's the "graph." relations: Relation[]; // No adjacency list. No indexes. Just arrays. } ``` Everything else—the 47 tools, the 5 managers, the 17 utility files, the 902-line SearchManager—is **enterprise theater** built on top of array.find() and array.filter(). You could replace this entire 11,000-line codebase with **300 lines of SQLite** and get better performance, actual ACID transactions, real concurrency, and a query language that isn't a homegrown boolean parser. **Final Verdict: 4/10** — Over-engineered solution to a problem that was already solved in 1979. --- ## Part 1: The Fundamental Lie — This Isn't a Graph ### 1.1 There Is No Graph Data Structure entity.types.ts:110-116: ```typescript export interface KnowledgeGraph { entities: Entity[]; relations: Relation[]; } ``` That's not a graph. A graph has: - Adjacency lists or adjacency matrices - O(1) edge lookup - Efficient traversal algorithms This has: - Two arrays - O(n) to find any relation - Traversal by... filtering arrays repeatedly ### 1.2 Every Graph Operation Is O(n) Want to find all relations for an entity? Scan the entire relations array: ```typescript // This pattern appears EVERYWHERE graph.relations.filter(r => r.from === entityName || r.to === entityName) ``` Want to find children of an entity? ```typescript // EntityManager.ts:766 return graph.entities.filter(e => e.parentId === entityName); ``` Want to find ancestors? Loop through entities repeatedly: ```typescript // EntityManager.ts:808-813 while (current.parentId) { const parent = graph.entities.find(e => e.name === current!.parentId); if (!parent) break; ancestors.push(parent); current = parent; } ``` **Every. Single. Operation.** is a linear scan. The "indexes" (NameIndex, TypeIndex, LowercaseCache) only help with entity lookup. Relations are still O(n) every time. ### 1.3 You Built Three Redundant Indexes indexes.ts has THREE separate data structures: ```typescript class NameIndex { private index: Map<string, Entity> = new Map(); // Stores entities } class TypeIndex { private index: Map<string, Set<string>> = new Map(); // Stores entity names } class LowercaseCache { private cache: Map<string, LowercaseData> = new Map(); // Stores lowercase copies } ``` That's **three copies** of entity data in memory. Plus the original array. Plus the cache in GraphStorage. You're storing every entity at least 4 times. And still no relation index. Still O(n) to find edges. --- ## Part 2: The Algorithm Implementation Is Naive ### 2.1 Levenshtein Uses O(m×n) Space searchAlgorithms.ts:37-39: ```typescript const dp: number[][] = Array(m + 1) .fill(null) .map(() => Array(n + 1).fill(0)); ``` Classic dynamic programming Levenshtein. But it allocates the **entire matrix**. For two 1000-character strings, that's 1,000,000 integers. The standard optimization uses only two rows (O(min(m,n)) space). This is taught in every algorithms course. The author either didn't know or didn't care. ### 2.2 TF-IDF Tokenizes Documents Repeatedly searchAlgorithms.ts:103-105: ```typescript const docsWithTerm = documents.filter(doc => tokenize(doc).includes(termLower) ).length; ``` For every term, tokenize every document. If you have 100 documents and search for 5 terms, you're tokenizing 500 times. The entire point of TF-IDF indexing is to **pre-tokenize documents once**. There's a TFIDFIndexManager.ts that supposedly does this, but the core algorithm still has this inefficiency baked in. ### 2.3 The Boolean Parser Doesn't Need to Exist BooleanSearch.ts is 371 lines implementing a recursive descent parser for queries like: ``` (alice AND bob) OR (charlie NOT dave) ``` You know what else can parse that? **SQLite FTS5**. Or Elasticsearch. Or Postgres full-text search. Proven, tested, optimized implementations. Instead, there's a hand-rolled parser that: - Has hardcoded limits (max 50 terms, max 10 nesting levels) - Parses twice (once to validate, once to execute) - Evaluates the AST with nested array operations This is not "building from first principles." This is NIH syndrome. --- ## Part 3: The 110KB CHANGELOG For a v0.11.5 Project The CHANGELOG is **2,321 lines** documenting changes to a project that claims to be version 0.11.5. Let me do some math: - 2,321 lines of changelog - Version 0.11.5 - That's ~200 lines of changelog per minor version For comparison: - React's CHANGELOG is ~6,000 lines for 18+ major versions - Express.js is ~1,500 lines total - This project has more documentation of changes than Express has for its entire history Either: 1. The version number is wrong (CLAUDE.md says 0.59.0) 2. The changelog includes every commit message 3. Someone really likes writing changelogs The CHANGELOG is **larger than most of the source files**. This is documentation theater. --- ## Part 4: The "Archive" Feature Just Deletes Things ### 4.1 There Is No Archive EntityManager.ts:977-986: ```typescript if (!dryRun && toArchive.length > 0) { const graph = await this.storage.getGraphForMutation(); const archiveNames = new Set(toArchive.map(e => e.name)); graph.entities = graph.entities.filter(e => !archiveNames.has(e.name)); graph.relations = graph.relations.filter( r => !archiveNames.has(r.from) && !archiveNames.has(r.to) ); await this.storage.saveGraph(graph); } ``` "Archive" means **delete**. The entities aren't moved to an archive file. They aren't marked as archived. They're filtered out of the array and never seen again. The feature is called "archive" but it's a mass delete with extra steps. ### 4.2 The "Compression" Feature Has Nothing to Do With Compression SearchManager.ts calls it "compression" but: - `findDuplicates()` - finds similar entities by name - `mergeEntities()` - combines entities - `compressGraph()` - runs findDuplicates then mergeEntities This is **deduplication**, not compression. Compression is zlib, gzip, LZ4. This is string similarity matching. And the "50x performance improvement" from bucketing? It still degenerates to O(n²) when entities have the same type and prefix. --- ## Part 5: The Persistence Layer Is a Lie ### 5.1 No Durability Guarantee GraphStorage.appendEntity (line 292): ```typescript await fs.appendFile(this.memoryFilePath, '\n' + line); ``` No fsync. No flush. The operating system can buffer this for seconds before writing to disk. If the process crashes, your "persisted" data is gone. This is known. The fix is trivial: ```typescript const fd = await fs.open(this.memoryFilePath, 'a'); await fd.write('\n' + line); await fd.sync(); // Actually write to disk await fd.close(); ``` But that's not here. Every write is fire-and-forget. ### 5.2 The "Append-Only" Log Doesn't Work Like an Append-Only Log Real append-only logs (like WAL in databases): - Never modify existing entries - Compact in the background - Provide point-in-time recovery This "append-only" system: - Appends new versions of entities - On compaction, **rewrites the entire file** - No recovery possible after compaction The compaction (line 383-391) is just: ```typescript async compact(): Promise<void> { await this.saveGraph(this.cache); // Rewrite everything this.pendingAppends = 0; } ``` That's not log compaction. That's "throw away history and rewrite." ### 5.3 SQLite "Support" Uses WASM The SQLite storage option uses sql.js, which is SQLite compiled to WebAssembly. - It's **3-10x slower** than native SQLite - It loads the entire database into memory - It doesn't support multiple connections So you get the complexity of supporting two storage backends without the benefits of either: - JSONL: Simple but no query optimization - SQLite WASM: Complex and still slow If you wanted SQLite, use better-sqlite3. It's native. It's fast. It supports concurrent readers. --- ## Part 6: The Entire Project Structure Is Wrong ### 6.1 Why Is Everything in src/memory/? ``` src/ └── memory/ ├── core/ ├── features/ ├── search/ ├── server/ ├── types/ └── utils/ ``` The project is called "memory-mcp." The source is in "src/memory." The entry point is "src/memory/index.ts." If this were published as a package: - Package name: @danielsimonjr/memory-mcp - Import: @danielsimonjr/memory-mcp - Source: src/memory/ Why the extra nesting? No reason. Just extra paths to type. ### 6.2 17 Utility Files The utils folder has **17 files**. That's not utilities. That's a junk drawer. ``` utils/ ├── caching.ts (wait, there's also searchCache.ts) ├── constants.ts ├── dateUtils.ts ├── entityUtils.ts ├── errors.ts ├── filterUtils.ts ├── index.ts ├── indexes.ts ├── logger.ts ├── paginationUtils.ts ├── pathUtils.ts ├── responseFormatter.ts ├── schemas.ts ├── searchAlgorithms.ts ├── searchCache.ts (different from caching.ts apparently) ├── tagUtils.ts ├── validationHelper.ts └── validationUtils.ts (different from validationHelper.ts) ``` Why are there TWO caching files? TWO validation files? What's the difference between `validationHelper.ts` and `validationUtils.ts`? This is the result of adding files without removing or consolidating. ### 6.3 Types Spread Across 7 Files ``` types/ ├── analytics.types.ts ├── entity.types.ts ├── import-export.types.ts ├── index.ts ├── search.types.ts ├── storage.types.ts └── tag.types.ts ``` These could be **two files**: `types.ts` and `index.ts`. The separation provides no benefit. You're not going to import `tag.types.ts` independently. Everything goes through the barrel export anyway. --- ## Part 7: The Real Bugs I Haven't Mentioned Yet ### 7.1 parentId Is a String Reference, Not a Foreign Key ```typescript interface Entity { parentId?: string; // Just a string. No referential integrity. } ``` Delete a parent entity? The children still reference it. The parentId points to nothing. No cascade delete. No integrity check. Just dangling references. ### 7.2 Relation Endpoints Aren't Validated on Creation RelationManager.ts doesn't verify that `from` and `to` entities exist when creating relations. You can create relations between non-existent entities. The validateGraph function catches this... after the fact. ### 7.3 Tag Aliases Don't Actually Transform Tags TagManager stores aliases but entities store raw tags. When you search by tag, the alias resolution happens at search time. But if you export the graph, you get raw tags. If you query via boolean search, aliases might not apply. The alias system is **cosmetic**. It doesn't normalize the underlying data. ### 7.4 Importance Has No Default, But Everything Assumes It Does ```typescript importance?: number; // Optional, 0-10 ``` Code frequently does: ```typescript if (entity.importance !== undefined && entity.importance < threshold) ``` But also: ```typescript if (entity.importance === undefined || entity.importance < criteria.importanceLessThan) ``` Is `undefined` importance high or low? Depends on which code path you hit. --- ## Part 8: What This Project Actually Needed ### The Right Solution: 300 Lines of SQLite ```typescript import Database from 'better-sqlite3'; class KnowledgeGraph { private db: Database.Database; constructor(path: string) { this.db = new Database(path); this.db.exec(` CREATE TABLE IF NOT EXISTS entities ( name TEXT PRIMARY KEY, type TEXT NOT NULL, observations TEXT, -- JSON array importance INTEGER, parent_id TEXT REFERENCES entities(name), created_at TEXT DEFAULT CURRENT_TIMESTAMP, modified_at TEXT DEFAULT CURRENT_TIMESTAMP ); CREATE TABLE IF NOT EXISTS relations ( from_entity TEXT REFERENCES entities(name), to_entity TEXT REFERENCES entities(name), type TEXT NOT NULL, PRIMARY KEY (from_entity, to_entity, type) ); CREATE INDEX idx_parent ON entities(parent_id); CREATE INDEX idx_type ON entities(type); `); } createEntity(entity: Entity) { const stmt = this.db.prepare(` INSERT INTO entities (name, type, observations, importance, parent_id) VALUES (?, ?, ?, ?, ?) `); stmt.run(entity.name, entity.type, JSON.stringify(entity.observations), entity.importance, entity.parentId); } search(query: string) { return this.db.prepare(` SELECT * FROM entities WHERE name LIKE ? OR observations LIKE ? `).all(`%${query}%`, `%${query}%`); } // ... 20 more methods, all one-liners } ``` **That's it.** Real ACID transactions. Real concurrent access. Real indexes. Real query optimization. Real full-text search with FTS5. Instead, we got 11,000 lines of JavaScript reimplementing what every database does better. --- ## Part 9: The Damning Statistics | Metric | Value | Commentary | |--------|-------|------------| | Source lines | 11,610 | Could be <1,000 with SQLite | | Test lines | 18,631 | 1.6x source (impressive) | | CHANGELOG lines | 2,321 | Larger than most source files | | Managers | 5 | Could be 1-2 | | Utils files | 17 | Junk drawer | | Type files | 7 | Could be 2 | | Tools claimed | 47 | ~20 unique functions | | Type assertions | 47 | Same as tool count (coincidence?) | | Storage backends | 2 | Both inferior to native SQLite | | Race conditions | ∞ | No locking anywhere | | Actual graph data structure | 0 | Just arrays | --- ## Part 10: Refactoring Roadmap ### Phase 1: Fix Critical Bugs (Do First) These are correctness issues that affect users right now. **1.1 Fix removeTags logic (EntityManager.ts:510-513)** ```typescript // BEFORE (broken): const removedTags = normalizedTags.filter(tag => originalLength > entity.tags!.length || !entity.tags!.map(t => t.toLowerCase()).includes(tag) ); // AFTER (correct): const removedTags = normalizedTags.filter(tag => existingTagsLower.includes(tag) && !entity.tags!.map(t => t.toLowerCase()).includes(tag) ); ``` **1.2 Add fsync to file writes (GraphStorage.ts)** ```typescript // Replace appendFile with: const fd = await fs.open(this.memoryFilePath, 'a'); await fd.write('\n' + line); await fd.sync(); await fd.close(); ``` **1.3 Fix cache/file ordering (GraphStorage.ts:414-478)** Write to file FIRST, update cache on success. If write fails, cache stays consistent. **1.4 Fix observationsCompressed metric (SearchManager.ts:670)** Actually count observations merged, not just entities. --- ### Phase 2: Add Concurrency Control Without this, multi-user scenarios corrupt data. **2.1 Add mutex to GraphStorage** ```typescript import { Mutex } from 'async-mutex'; class GraphStorage { private mutex = new Mutex(); async getGraphForMutation(): Promise<KnowledgeGraph> { return this.mutex.runExclusive(async () => { // existing logic }); } async saveGraph(graph: KnowledgeGraph): Promise<void> { return this.mutex.runExclusive(async () => { // existing logic }); } } ``` **2.2 Make batch operations atomic** - `addObservations`: Collect all updates, single saveGraph call - `deleteObservations`: Same pattern - `addTagsToMultipleEntities`: Already does this correctly (use as model) --- ### Phase 3: Add Relation Index Currently O(n) for every relation lookup. This is the biggest performance win. **3.1 Create RelationIndex (new file: utils/relationIndex.ts)** ```typescript class RelationIndex { private fromIndex: Map<string, Set<Relation>> = new Map(); private toIndex: Map<string, Set<Relation>> = new Map(); getRelationsFrom(entityName: string): Relation[] { ... } getRelationsTo(entityName: string): Relation[] { ... } getRelationsFor(entityName: string): Relation[] { ... } } ``` **3.2 Integrate into GraphStorage** Build index on load, update on relation add/delete. --- ### Phase 4: Consolidate God Objects **4.1 Split SearchManager (902 lines → 3 files)** | New File | Methods | Lines | |----------|---------|-------| | SearchManager.ts | searchNodes, searchNodesRanked, booleanSearch, fuzzySearch, getSearchSuggestions, openNodes, searchByDateRange | ~300 | | AnalyticsManager.ts | getGraphStats, validateGraph | ~200 | | CompressionManager.ts | findDuplicates, mergeEntities, compressGraph, calculateEntitySimilarity | ~300 | **4.2 Split EntityManager (994 lines → 4 files)** | New File | Methods | Lines | |----------|---------|-------| | EntityManager.ts | createEntities, deleteEntities, getEntity, updateEntity, batchUpdate | ~250 | | ObservationManager.ts | addObservations, deleteObservations | ~100 | | HierarchyManager.ts | setEntityParent, getChildren, getParent, getAncestors, getDescendants, getSubtree, getRootEntities, getEntityDepth, moveEntity | ~250 | | ArchiveManager.ts | archiveEntities | ~100 | **4.3 Delete ManagerContext convenience methods** Lines 91-306 are pure delegation. Delete them. Tool handlers already use `ctx.entityManager.method()` directly. --- ### Phase 5: Clean Up Structure **5.1 Consolidate utils/ (17 files → 8 files)** | Keep | Merge Into It | |------|---------------| | errors.ts | (standalone) | | constants.ts | (standalone) | | schemas.ts | validationHelper.ts, validationUtils.ts | | indexes.ts | (standalone, add RelationIndex) | | searchAlgorithms.ts | (standalone) | | searchCache.ts | caching.ts | | formatters.ts | responseFormatter.ts, paginationUtils.ts | | entityUtils.ts | tagUtils.ts, dateUtils.ts, filterUtils.ts, pathUtils.ts | **5.2 Consolidate types/ (7 files → 2 files)** - `types.ts`: All type definitions - `index.ts`: Re-exports **5.3 Fix version number** Pick 0.59.0 or 0.11.5. Update package.json AND CLAUDE.md to match. --- ### Phase 6: Fix Type Safety **6.1 Replace type assertions in toolHandlers.ts** ```typescript // BEFORE: create_entities: async (ctx, args) => formatToolResponse(await ctx.entityManager.createEntities(args.entities as any[])), // AFTER: create_entities: async (ctx, args) => { const validated = BatchCreateEntitiesSchema.parse(args.entities); return formatToolResponse(await ctx.entityManager.createEntities(validated)); }, ``` **6.2 Add validation to archiveEntities** Create `ArchiveCriteriaSchema` in schemas.ts, validate in handler. --- ### Phase 7: Algorithm Improvements **7.1 Optimize Levenshtein to O(min(m,n)) space** ```typescript export function levenshteinDistance(str1: string, str2: string): number { if (str1.length > str2.length) [str1, str2] = [str2, str1]; let prev = Array.from({ length: str1.length + 1 }, (_, i) => i); let curr = new Array(str1.length + 1); for (let j = 1; j <= str2.length; j++) { curr[0] = j; for (let i = 1; i <= str1.length; i++) { curr[i] = str1[i-1] === str2[j-1] ? prev[i-1] : 1 + Math.min(prev[i-1], prev[i], curr[i-1]); } [prev, curr] = [curr, prev]; } return prev[str1.length]; } ``` **7.2 Fix TF-IDF to tokenize once** Pre-tokenize in TFIDFIndexManager, pass token arrays to calculateIDF. **7.3 Make compaction threshold dynamic** ```typescript private get compactionThreshold(): number { return Math.max(100, Math.floor((this.cache?.entities.length ?? 0) * 0.1)); } ``` --- ### Phase 8: Consider Native SQLite (Optional but Recommended) If you want real durability, concurrency, and performance: **8.1 Replace sql.js with better-sqlite3** ```bash npm uninstall sql.js npm install better-sqlite3 npm install -D @types/better-sqlite3 ``` **8.2 Add referential integrity** ```sql CREATE TABLE entities ( name TEXT PRIMARY KEY, ... ); CREATE TABLE relations ( from_entity TEXT REFERENCES entities(name) ON DELETE CASCADE, to_entity TEXT REFERENCES entities(name) ON DELETE CASCADE, ... ); ``` **8.3 Add FTS5 for search** ```sql CREATE VIRTUAL TABLE entities_fts USING fts5(name, observations, content=entities); ``` This replaces BasicSearch, BooleanSearch, and FuzzySearch with one SQL query. --- ### Refactoring Order ``` Week 1: Phase 1 (Critical bugs) Week 2: Phase 2 (Concurrency) + Phase 3 (Relation index) Week 3: Phase 4 (Split managers) Week 4: Phase 5 (Structure) + Phase 6 (Type safety) Week 5: Phase 7 (Algorithms) Optional: Phase 8 (Native SQLite) ``` After each phase, run the full test suite. The 1,500+ tests should catch regressions. --- ### What NOT to Do 1. **Don't add features until Phase 1-3 are done.** You'll just build on a broken foundation. 2. **Don't rewrite from scratch.** The test suite is valuable. Refactor incrementally. 3. **Don't keep the convenience methods "for compatibility."** No one uses them. Delete them. 4. **Don't add more indexes without fixing the architecture.** Three redundant indexes is enough. --- ## Conclusion This codebase is what happens when someone who knows what good code *looks like* builds something without understanding *why* good code looks that way. The documentation is polished because documentation is visible. The architecture diagrams look professional because architecture is buzzword-compliant. The test count is high because test count is a metric. But underneath: - The "graph" is two arrays - The "persistence" doesn't sync to disk - The "search" is linear scans - The "archive" is delete - The "compression" is deduplication - The "tools" are aliases and wrappers It's a **Potemkin village** of software engineering. Beautiful facades hiding empty rooms. **Final Score: 4/10** — Functional for toys, dangerous for production, educational for what not to do. --- *Analysis conducted 2026-01-01. Every line number is accurate. Every criticism is earned. The truth hurts, but it's the only path to improvement.*

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/danielsimonjr/memory-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

BRUTALLY_HONEST_ANALYSIS.md•21.7 KiB