Markdown RAG Documentation

Overview Schema Related Servers Score Discussions

memory-graph-enhancement-analysis.md•22.1 KiB

# Memory Graph Enhancement Analysis **Document Type**: Architectural Analysis **Status**: Draft **Author**: AI Planner **Date**: 2026-01-11 **Related**: [16-memory-management-plan.md](./16-memory-management-plan.md) --- ## Executive Summary The current memory management system stores markdown files with a graph structure based on wikilinks, but agents aren't naturally building a meaningful knowledge graph. This analysis explores **5 concrete approaches** to make the memory graph more useful for AI agents, comparing implementation complexity, agent behavior changes, and retrieval utility. The top 2 recommendations are: (1) **Tag-as-Nodes** for concept clustering, and (2) **Typed Memory Relationships** for explicit memory-to-memory links. --- ## Problem Analysis ### Current State The existing implementation (`src/memory/manager.py`) creates: - **Memory nodes**: Individual memories with `memory:` prefix (e.g., `memory:user-preferences-1`) - **Ghost nodes**: References to documents with `ghost:` prefix (e.g., `ghost:src/server.py`) - **Directed edges**: Created from wikilinks `[[target]]` with inferred types (`mentions`, `refactors`, `plans`, `debugs`, `related_to`) - **Edge attributes**: `edge_type` and `edge_context` (~100 char anchor text) **Example Graph Structure**: ``` memory:bug-fix-123 ---refactors---> ghost:src/api/handler.py ---mentions---> ghost:docs/api.md ``` ### The Disconnect Agents write **isolated markdown files** that occasionally link to documents but rarely to other memories. The graph topology is shallow: 1. **No memory-to-memory edges**: Memories don't reference each other, preventing concept chains 2. **Tags are metadata, not nodes**: Tags exist only in frontmatter YAML, not as graph entities 3. **No hierarchical relationships**: No parent/child or supersedes/obsoletes relationships 4. **Limited traversal queries**: Current `search_linked_memories` only finds memories linking TO a document, not FROM it or between memories **Result**: The graph is a **star topology** (memories → ghost nodes) rather than a **web topology** (memories ↔ memories ↔ concepts). --- ## Approach 1: Tags as First-Class Nodes ### Overview Elevate tags from metadata to graph nodes, enabling concept-based traversal and clustering. ### Graph Structure Changes **New Node Types**: - `tag:testing` — represents the concept "testing" - `tag:architecture` — represents the concept "architecture" - `tag:authentication` — represents the concept "authentication" **New Edges**: ```python # Memory → Tag memory:test-refactor ---HAS_TAG---> tag:testing memory:test-refactor ---HAS_TAG---> tag:unit-tests # Tag → Tag (hierarchies) tag:unit-tests ---IS_SUBTYPE_OF---> tag:testing tag:integration-tests ---IS_SUBTYPE_OF---> tag:testing ``` **Implementation Details**: 1. During `MemoryIndexManager.index_memory()`, for each `tag` in frontmatter: - Create `tag:{tag_name}` node if absent - Add `HAS_TAG` edge from memory to tag 2. Add `register_tag_hierarchy(parent: str, child: str)` tool to create `IS_SUBTYPE_OF` edges 3. Extend `GraphStore` with `get_neighbors_by_edge_type(node, edge_type)` for filtered traversal ### Agent Behavior Impact **Current Behavior**: ```yaml # Agent writes this: tags: [testing, refactoring] ``` **New Behavior**: ```yaml # Agent still writes this: tags: [testing, unit-tests] # But can now query: # "Find all memories about testing" → traverses tag:testing node # "What are the subtypes of testing?" → traverses IS_SUBTYPE_OF edges ``` **No prompt changes needed** — agents already use tags. The system just indexes them differently. ### New MCP Tools Enabled 1. **`get_memories_by_tag_cluster(seed_tag: str, depth: int = 2)`** - Traverses tag graph to find all related tags - Returns memories connected to the cluster - Example: `seed_tag="authentication"` might find `tag:jwt`, `tag:oauth`, `tag:sessions` 2. **`suggest_related_tags(current_tags: list[str])`** - Uses graph connectivity to suggest co-occurring tags - Helps agents discover existing tag conventions 3. **`visualize_tag_graph()`** - Returns DOT/JSON graph for visualization - Useful for debugging agent behavior ### Complexity vs. Benefit | Metric | Assessment | |--------|-----------| | **LOC Estimate** | ~250 lines (GraphStore extensions + 3 tools) | | **Agent Training** | None — reuses existing tag behavior | | **Query Utility** | High — enables concept-based clustering | | **Risk** | Low — additive change, doesn't break existing graph | **Key Benefit**: Agents naturally form concept clusters without explicit prompting. --- ## Approach 2: Typed Memory-to-Memory Relationships ### Overview Allow memories to link to other memories with explicit relationship types (supersedes, extends, contradicts, etc.). ### Graph Structure Changes **New Edge Types**: ```python # Versioning memory:api-design-v2 ---SUPERSEDES---> memory:api-design-v1 # Elaboration memory:auth-deep-dive ---EXTENDS---> memory:auth-overview # Disagreement memory:go-for-performance ---CONTRADICTS---> memory:python-for-readability # Dependencies memory:deploy-step-3 ---DEPENDS_ON---> memory:deploy-step-2 ``` **Link Syntax**: ```markdown This supersedes [[memory:old-plan]]. See also [[memory:related-note]] for context. ``` **Implementation Details**: 1. Extend `link_parser.py` to detect `memory:` prefix in wikilinks 2. Add relationship keywords to `EDGE_TYPE_KEYWORDS`: ```python "supersedes": ["supersedes", "replaces", "obsoletes"], "extends": ["extends", "elaborates", "builds on"], "contradicts": ["contradicts", "disagrees with", "challenges"], "depends_on": ["depends on", "requires", "needs"], ``` 3. No changes to `GraphStore` — reuses existing edge system ### Agent Behavior Impact **Current Behavior**: ```markdown # Agent writes isolated memories --- type: plan tags: [deployment] --- Deploy with Docker... ``` **New Behavior**: ```markdown # Agent creates chains --- type: plan tags: [deployment] --- This plan supersedes [[memory:old-docker-plan]] because... For prerequisites, see [[memory:setup-guide]]. ``` **Prompt changes needed**: - Add example: "When updating a memory, link to the old version with 'supersedes [[memory:old-id]]'" - Encourage: "Link related memories with [[memory:other-id]]" ### New MCP Tools Enabled 1. **`get_memory_history(memory_id: str)`** - Traverses `SUPERSEDES` edges to build version chain - Returns chronological list of memory versions 2. **`get_memory_dependencies(memory_id: str)`** - Traverses `DEPENDS_ON` edges to build dependency tree - Helps agents understand prerequisites 3. **`find_contradictions(topic: str)`** - Searches for memories linked by `CONTRADICTS` edges - Surfaces conflicting decisions/opinions ### Complexity vs. Benefit | Metric | Assessment | |--------|-----------| | **LOC Estimate** | ~150 lines (link parser keywords + 3 tools) | | **Agent Training** | Medium — requires prompt examples | | **Query Utility** | Very High — enables versioning and dependency tracking | | **Risk** | Low — reuses existing edge system | **Key Benefit**: Agents can track how memories evolve over time without manual renaming. --- ## Approach 3: Automatic Concept Extraction (Entity Recognition) ### Overview Use NLP to automatically extract entities (function names, class names, file paths) from memory content and create concept nodes. ### Graph Structure Changes **New Node Types**: ```python concept:FastAPI # Detected from text concept:authenticate_user # Detected function name concept:JWT # Detected acronym ``` **Edges**: ```python memory:api-refactor ---MENTIONS---> concept:FastAPI memory:auth-flow ---MENTIONS---> concept:authenticate_user ``` **Implementation Details**: 1. Add `src/memory/entity_extractor.py` with: - Code entity regex: `` `\w+\(` `` for functions, `` `\w+\.py` `` for files - Acronym detection: `[A-Z]{2,}` - Proper noun detection (optional: spaCy NER) 2. During indexing, run extractor on memory content 3. Create `concept:` nodes and `MENTIONS` edges ### Agent Behavior Impact **No behavior change needed** — fully automatic. **Example**: ```markdown # Agent writes: Refactored `authenticate_user()` in `auth.py` to use JWT tokens. # System automatically creates: memory:auth-refactor ---MENTIONS---> concept:authenticate_user memory:auth-refactor ---MENTIONS---> concept:auth.py memory:auth-refactor ---MENTIONS---> concept:JWT ``` ### New MCP Tools Enabled 1. **`find_memories_about_concept(concept: str)`** - Returns memories mentioning a specific function/file/entity - Example: "What memories discuss `authenticate_user`?" 2. **`get_concept_graph(memory_id: str)`** - Returns all concepts mentioned in a memory - Visualizes knowledge captured ### Complexity vs. Benefit | Metric | Assessment | |--------|-----------| | **LOC Estimate** | ~400 lines (NLP pipeline + tools) | | **Agent Training** | None — fully automatic | | **Query Utility** | Medium — useful but overlaps with keyword search | | **Risk** | Medium — regex FPs/FNs, potential noise | **Key Benefit**: Zero agent effort, but may create noisy graph. **Key Risk**: Over-extraction leads to concept pollution (e.g., `concept:the`, `concept:a`). --- ## Approach 4: Embedding-Based Memory Similarity Edges ### Overview Compute cosine similarity between memory embeddings and create `SIMILAR_TO` edges above a threshold. ### Graph Structure Changes **New Edges**: ```python memory:auth-flow ---SIMILAR_TO(0.87)---> memory:oauth-impl memory:test-strategy ---SIMILAR_TO(0.91)---> memory:test-refactor ``` **Implementation Details**: 1. After indexing a new memory, compute its embedding 2. Query vector index for top-K similar memories (threshold: 0.85) 3. Add `SIMILAR_TO` edges with similarity score as weight 4. Run batch recomputation periodically (weekly?) ### Agent Behavior Impact **No behavior change needed** — fully automatic. **Query Example**: ``` Agent: "Find memories related to testing" System: 1. Searches for "testing" 2. Finds memory:test-strategy 3. Traverses SIMILAR_TO edges 4. Returns memory:test-refactor, memory:unit-test-patterns ``` ### New MCP Tools Enabled 1. **`find_similar_memories(memory_id: str, limit: int = 5)`** - Traverses `SIMILAR_TO` edges - Returns semantically related memories 2. **`cluster_memories_by_similarity(threshold: float = 0.9)`** - Groups memories into clusters - Helps identify redundant memories ### Complexity vs. Benefit | Metric | Assessment | |--------|-----------| | **LOC Estimate** | ~300 lines (batch processing + tools) | | **Agent Training** | None — fully automatic | | **Query Utility** | High — complements tag-based search | | **Risk** | Medium — similarity ≠ relatedness, false positives | **Key Benefit**: Discovers implicit relationships without manual annotation. **Key Risk**: High similarity might link unrelated memories (e.g., both mention "Python" but discuss different topics). --- ## Approach 5: Temporal Graph (Event Sequences) ### Overview Treat memories as events in a timeline, creating `PRECEDED_BY` / `FOLLOWED_BY` edges based on `created_at`. ### Graph Structure Changes **New Edges**: ```python memory:bug-report ---PRECEDED_BY---> memory:feature-request memory:feature-request ---FOLLOWED_BY---> memory:bug-report ``` **Implementation Details**: 1. During indexing, sort memories by `created_at` 2. For each memory, link to the previous memory with same `type` or `tags` 3. Create bidirectional edges: `PRECEDED_BY` and `FOLLOWED_BY` ### Agent Behavior Impact **No behavior change needed** — uses existing `created_at` metadata. **Query Example**: ``` Agent: "Show the evolution of authentication decisions" System: 1. Searches for tag:authentication 2. Finds all memories 3. Traverses PRECEDED_BY edges chronologically 4. Returns ordered timeline ``` ### New MCP Tools Enabled 1. **`get_memory_timeline(tag: str)`** - Returns chronologically ordered memories for a tag - Visualizes how thinking evolved 2. **`get_recent_context(days: int = 7)`** - Returns memories from last N days - Helps agents "remember" recent work ### Complexity vs. Benefit | Metric | Assessment | |--------|-----------| | **LOC Estimate** | ~200 lines (temporal indexing + 2 tools) | | **Agent Training** | None — automatic | | **Query Utility** | Medium — useful for narrative reconstruction | | **Risk** | Low — simple chronological ordering | **Key Benefit**: Helps agents understand project history without reading all memories. **Key Risk**: Time-based proximity ≠ logical connection (unrelated memories might be temporally adjacent). --- ## Comparison Table | Approach | LOC | Agent Effort | Query Utility | Implementation Risk | Maintenance Burden | |----------|-----|--------------|---------------|---------------------|-------------------| | **1. Tags-as-Nodes** | 250 | **None** ✅ | **High** ✅ | **Low** ✅ | Low | | **2. Typed Memory Links** | 150 | Medium | **Very High** ✅ | **Low** ✅ | Low | | **3. Concept Extraction** | 400 | **None** ✅ | Medium | Medium ⚠️ | High (noise control) | | **4. Embedding Similarity** | 300 | **None** ✅ | High | Medium ⚠️ | Medium (batch jobs) | | **5. Temporal Graph** | 200 | **None** ✅ | Medium | **Low** ✅ | Low | ### Scoring Criteria - **LOC**: Lower is better (faster implementation) - **Agent Effort**: None = no prompt changes needed - **Query Utility**: How useful are the new graph queries? - **Implementation Risk**: Likelihood of bugs or edge cases - **Maintenance Burden**: Ongoing cost to keep graph clean --- ## Top 2 Recommendations ### 🏆 Recommendation 1: Tags-as-Nodes (Approach 1) **Why Implement First**: 1. **Zero agent training** — Agents already use tags extensively. We just index them differently. 2. **Immediate value** — Enables concept clustering without changing agent behavior. 3. **Low risk** — Additive change to existing graph, doesn't break anything. 4. **Foundation for future work** — Tag hierarchies can support ontology evolution. **Implementation Priority**: ``` Phase 1: Basic tag nodes + HAS_TAG edges (~150 LOC) Phase 2: Tag hierarchy support (IS_SUBTYPE_OF) (~50 LOC) Phase 3: MCP tools (get_memories_by_tag_cluster, suggest_related_tags) (~50 LOC) ``` **Expected Impact**: - Agents can discover related memories by concept even if exact keywords differ - Tag suggestions help agents converge on consistent taxonomy - Visualization reveals knowledge clusters in the memory bank **Example Workflow**: ```python # Agent queries results = search_memories("authentication") # Returns memories tagged: auth, oauth, jwt, sessions # System automatically clusters via tag graph: tag:authentication ├─ tag:jwt (3 memories) ├─ tag:oauth (2 memories) └─ tag:sessions (4 memories) ``` --- ### 🥈 Recommendation 2: Typed Memory-to-Memory Links (Approach 2) **Why Implement Second**: 1. **High utility** — Enables memory versioning and dependency tracking, which are common pain points. 2. **Reuses existing infrastructure** — No new graph features needed, just keyword updates. 3. **Encourages better agent behavior** — Prompts agents to connect memories explicitly. 4. **Complements Approach 1** — Tags provide concepts, memory links provide narratives. **Implementation Priority**: ``` Phase 1: Extend link parser with memory: prefix support (~50 LOC) Phase 2: Add relationship keywords (SUPERSEDES, EXTENDS, etc.) (~50 LOC) Phase 3: MCP tools (get_memory_history, find_contradictions) (~50 LOC) ``` **Expected Impact**: - Agents can update memories without losing history - Dependency chains make complex plans navigable - Contradictions surface when multiple approaches are explored **Example Workflow**: ```markdown # Agent creates memory chain --- type: plan --- Deploy with Docker. See [[memory:setup-env]] for prerequisites. # Later, agent updates: --- type: plan --- Deploy with Kubernetes. This supersedes [[memory:docker-deploy]] because... ``` **System can now answer**: - "What's the latest deployment plan?" → Traverses SUPERSEDES chain - "What do I need to set up first?" → Traverses DEPENDS_ON edges --- ## Implementation Roadmap ### Phase 1: Tags-as-Nodes (Week 1-2) **Files to Modify**: 1. `src/memory/manager.py`: - In `index_memory()`, after creating memory node: ```python for tag in frontmatter.tags: tag_node_id = f"tag:{tag}" self._graph.add_node(tag_node_id, {"is_tag": True}) self._graph.add_edge(memory.id, tag_node_id, "HAS_TAG") ``` 2. `src/indices/graph.py`: - Add `get_neighbors_by_edge_type(node_id: str, edge_type: str) -> list[str]` **New Files**: 1. `src/memory/tag_tools.py`: - `get_memories_by_tag_cluster(seed_tag, depth)` - `suggest_related_tags(current_tags)` - `register_tag_hierarchy(parent, child)` **Tests**: - `tests/unit/test_memory_tag_nodes.py` - `tests/integration/test_tag_cluster_search.py` **Acceptance Criteria**: - Tags appear as `tag:` nodes in graph - `get_memories_by_tag_cluster("testing")` returns memories with `unit-tests`, `integration-tests`, etc. - Graph persists/loads tag nodes correctly --- ### Phase 2: Typed Memory Links (Week 3-4) **Files to Modify**: 1. `src/memory/link_parser.py`: - Update `LINK_PATTERN` to capture `memory:` prefix: ```python LINK_PATTERN = re.compile(r'\[\[([^\]]+)\]\]') # Then check if target.startswith("memory:") ``` - Add to `EDGE_TYPE_KEYWORDS`: ```python "supersedes": ["supersedes", "replaces", "obsoletes"], "extends": ["extends", "elaborates", "builds on"], "contradicts": ["contradicts", "disagrees with"], "depends_on": ["depends on", "requires", "needs"], ``` **New Files**: 1. `src/memory/relationship_tools.py`: - `get_memory_history(memory_id)` - `get_memory_dependencies(memory_id)` - `find_contradictions(topic)` **Tests**: - `tests/unit/test_memory_link_types.py` - `tests/integration/test_memory_versioning.py` **Acceptance Criteria**: - `[[memory:old-plan]]` creates memory-to-memory edge - `get_memory_history()` returns chronological chain via SUPERSEDES edges - Edge types inferred correctly from context --- ## Open Questions ### Q1: Should tag nodes be case-sensitive? **Options**: - A: Case-insensitive (`tag:Testing` and `tag:testing` are the same) - B: Case-sensitive (allow both) **Recommendation**: Case-insensitive. Normalize to lowercase during indexing to avoid fragmentation. --- ### Q2: How to handle tag renaming? **Options**: - A: Manual tool `rename_tag(old, new)` that updates all edges - B: Alias system (multiple names for same node) - C: No support (agents must be consistent) **Recommendation**: Option A. Provide `rename_tag()` tool that: 1. Creates new `tag:new-name` node 2. Copies all edges from `tag:old-name` 3. Removes old node 4. Logs migration in memory bank --- ### Q3: Should memory-to-memory links require validation? **Options**: - A: Validate target exists (fail if `[[memory:nonexistent]]`) - B: Create "dangling" reference (like ghost nodes) - C: Auto-suggest existing memory IDs **Recommendation**: Option B initially, add Option C later. Ghost-style references preserve intent even if target is deleted. In future, add autocomplete to MCP tool descriptions. --- ### Q4: How to prevent tag explosion? **Problem**: Agents might create too many unique tags, fragmenting the graph. **Mitigation**: 1. `suggest_related_tags()` helps agents reuse existing tags 2. Weekly report: "Top 20 singleton tags" (used by only 1 memory) 3. Future: Tag merging tool (`merge_tags(["auth", "authentication"])`) --- ## Risks & Mitigation | Risk | Severity | Mitigation | |------|----------|------------| | **Tag graph becomes too dense** | Medium | Limit `get_memories_by_tag_cluster` depth to 2-3 hops | | **Memory link cycles (A→B→A)** | Low | Detect cycles in `get_memory_history()`, return warning | | **Agents overuse SUPERSEDES** | Low | Document: "Use SUPERSEDES only for direct replacements, not iterations" | | **Tag hierarchy conflicts** | Medium | Validate no cycles in `IS_SUBTYPE_OF` edges | | **Performance: tag node traversal slow** | Low | Index common queries (e.g., cache tag→memory mappings) | --- ## Alternatives Considered ### Alt 1: RDF-style Triple Store **Description**: Replace NetworkX with proper RDF store (rdflib). **Why Rejected**: - Massive architectural change (~2000 LOC) - Requires SPARQL learning curve for agents - Overkill for current graph complexity **Future Reconsideration**: If graph grows beyond 10K nodes. --- ### Alt 2: Obsidian Dataview-style Queries **Description**: Add SQL-like query language over memory metadata. **Why Rejected**: - Overlaps with existing hybrid search - Adds new syntax for agents to learn - Graph traversal better served by explicit tools **Future Reconsideration**: If agents need complex joins across metadata fields. --- ## Success Metrics ### Quantitative Metrics | Metric | Baseline | 3-Month Target | |--------|----------|----------------| | Avg memories per tag | ~2 | ~5 (tags reused more) | | Memory-to-memory links | 0 | 20% of memories link to another | | Tag hierarchy depth | 0 | 2-3 levels | | Ghost nodes | 100% of links | 80% (more memory links) | ### Qualitative Metrics 1. **Agent asks "What memories are related to X?"** → Should find via tag clusters 2. **Agent creates updated plan** → Should link with SUPERSEDES 3. **Agent explores multiple approaches** → Should create CONTRADICTS links --- ## Conclusion The current memory graph is a **collection of flat documents** with external references. By implementing **Tags-as-Nodes** and **Typed Memory-to-Memory Links**, we transform it into a **knowledge web** where: 1. **Concepts emerge** through tag clustering 2. **Narratives form** through memory chains 3. **Context propagates** through graph traversal These two approaches provide **maximum value** with **minimal agent training** and **low implementation risk**. They lay the foundation for future enhancements (concept extraction, embedding similarity) without committing to complex NLP pipelines prematurely. **Estimated Total Effort**: 3-4 weeks for both approaches (~400 LOC + tests) **Next Steps**: 1. Review this analysis with stakeholders 2. Create detailed implementation tickets for Phase 1 (Tags-as-Nodes) 3. Update [16-memory-management-plan.md](./16-memory-management-plan.md) with graph extension phases

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/andnp/ragdocs-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

memory-graph-enhancement-analysis.md•22.1 KiB