DollhouseMCP

359

Overview InspectNew Endpoints Schema Related Servers Reviews Score

DollhouseMCP
docs
development

SESSION_NOTES_2025-10-10-AFTERNOON-ISSUE-1315-IMPLEMENTATION.md•5.85 kB

# Session Notes - October 10, 2025 (2:30 PM - ongoing) **Branch**: `fix/issue-1269-memory-injection-protection` **Duration**: ~2 hours **Outcome**: ✅ **COMPLETE** - Issue #1315 implemented, 20/22 tests passing ## Summary Implemented Issue #1315: Remove blocking validation from `Memory.addEntry()` to unblock PR #1313. Core implementation complete with comprehensive architecture documentation. Minor test format issues remain (2 tests). ## Work Completed ### 1. Architecture Documentation (Commit: 0d972e95) - ✅ Committed 540-line `MEMORY_SECURITY_ARCHITECTURE.md` - Documents complete 6-layer security architecture - Proxy re-encryption protocol (industry-proven technique) - Four trust levels: VALIDATED, UNTRUSTED, FLAGGED, QUARANTINED ### 2. Core Implementation (Commit: e531caff) #### Memory.ts Changes: - ✅ Simplified `addEntry()` - No blocking validation - All entries created with `trustLevel: UNTRUSTED` - Only does basic sanitization (Unicode normalization + DOMPurify) - Never throws on malicious content - ✅ Removed 3 helper methods: - `validateContentSecurity()` (lines 335-380) - `determineTrustLevel()` (lines 404+) - `logSecurityThreat()` (lines 386-398) - ✅ Updated `processDeserializedEntry()` - Reads trust level from metadata (no re-validation) - Only skips QUARANTINED entries - All other entries load with stored trust level - ✅ Removed unused `ContentValidator` import #### Test Updates: - ✅ Updated all 22 tests in `Memory.injection.test.ts` - Changed from `rejects.toThrow()` to expect successful creation - All tests now expect `trustLevel: UNTRUSTED` for new entries - **Status**: 20 passing, 2 failing (format issues) ### 3. Issues Created - Issue #1314: Complete memory security architecture - Issue #1315: Remove synchronous validation (✅ IMPLEMENTED) ## Test Status ### Passing Tests (20/22) All core functionality tests passing: - System prompt injection → creates as UNTRUSTED ✅ - Data exfiltration attempts → creates as UNTRUSTED ✅ - Trust level management → all entries UNTRUSTED ✅ - Unicode attacks → creates as UNTRUSTED ✅ - Large content → handles correctly ✅ - Edge cases → all working ✅ ### Failing Tests (2/22) - Minor Issues Both failing on `deserialize()` format validation: 1. `should handle trust levels correctly on deserialization` 2. `should skip quarantined content on deserialization` **Issue**: Missing proper `type` field format in test data **Fix needed**: Update `type: 'memory'` to match `ElementType.MEMORY` constant **Effort**: ~5 minutes ## Architecture Highlights ### Six-Layer Approach 1. **Always-Succeed Creation** - No blocking, all entries created 2. **Background Validation** - Server-side async (Phase 1 implementation needed) 3. **Pattern Encryption** - AES-256-GCM for dangerous patterns 4. **Proxy Re-Encryption** - Secure transfer between systems 5. **Display Based on Trust** - Sandbox UNTRUSTED content 6. **Load-Time Quarantine** - Skip QUARANTINED entries ### Key Security Properties - ✅ No blocking on creation - ✅ All new entries marked UNTRUSTED - ✅ Memories always created (enables inter-agent communication) - ✅ Trust levels preserved in files - ✅ Background validation will handle detection (future) ## Commits 1. **0d972e95** - `docs(security): Add complete memory security architecture design` - 540-line architecture document - Complete 6-layer design - Proxy re-encryption protocol 2. **e531caff** - `refactor(security): Implement Issue #1315` - Core implementation complete - Tests updated (20/22 passing) - Removed blocking validation ## Next Session Priorities ### Immediate (5 minutes) 1. Fix 2 remaining test failures (type field format) 2. Run full test suite to verify 3. Commit test fixes ### Short Term 1. Update PR #1313 description with Issue #1315 completion 2. Begin Phase 1 implementation (Issue #1314): - Add `FLAGGED` trust level constant - Background validation service scaffold - Pattern extraction logic ### Medium Term 1. Phase 2: Encryption system (AES-256-GCM) 2. Phase 3: Proxy re-encryption transfer protocol 3. Phase 4: Display & configuration ## Technical Decisions ### Why Non-Blocking? **Problem**: PR #1313's blocking validation prevents storing technical content: - Can't document security patterns (documentation triggers detection) - Breaks inter-agent communication (agents need to share code/patterns) - LLM validation has token cost **Solution**: Move validation to background: - Creation instant (no token cost) - Validation happens server-side - Trust levels updated asynchronously ### Why Trust Levels? Instead of blocking/allowing, we have nuanced states: - `VALIDATED`: Clean, safe to display - `UNTRUSTED`: New content, needs validation (default) - `FLAGGED`: Contains patterns, sanitized display (future) - `QUARANTINED`: Explicitly malicious, never loaded ## Key Learnings 1. **Visual delimiters don't protect LLMs** - They interpret all text 2. **Real encryption required** - Not obfuscation/encoding 3. **Proxy re-encryption solves portability** - Industry-proven technique 4. **Background validation eliminates token cost** - Server-side processing ## Files Modified - `src/elements/memories/Memory.ts` - Core implementation - `test/__tests__/unit/elements/memories/Memory.injection.test.ts` - Test updates - `docs/development/MEMORY_SECURITY_ARCHITECTURE.md` - NEW, architecture doc ## Related Issues & PRs - **Issue #1269**: Memory injection protection (original) - **Issue #1314**: Complete security architecture (design complete) - **Issue #1315**: Remove synchronous validation (✅ IMPLEMENTED) - **PR #1313**: Security telemetry (blocked, now unblocked) --- **End of session notes** - Ready for next session to fix final 2 test failures and continue with Phase 1 implementation.

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/DollhouseMCP/DollhouseMCP'

If you have feedback or need assistance with the MCP directory API, please join our Discord server