Session Buddy

CLAUDE.md•24 KiB

# CLAUDE.md This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository. ## Project Overview Session Buddy is a Claude Session Management MCP (Model Context Protocol) server providing comprehensive session management for Claude Code across any project. It operates as a standalone MCP server with isolated environment to avoid dependency conflicts. ## Development Commands ### Installation & Setup ```bash # Install all dependencies (development + production) uv sync --group dev # Install minimal dependencies (production) uv sync # Run server python -m session_buddy.server # Run with debug logging PYTHONPATH=. python -m session_buddy.server --debug # Verify installation python -c "from session_buddy.server import mcp; print('✅ MCP server ready')" python -c "from session_buddy.reflection_tools import ReflectionDatabase; print('✅ Memory system ready')" ``` ### Quick Start ```bash # Complete development setup uv sync --group dev && pytest -m "not slow" && crackerjack lint ``` ### Code Quality ```bash crackerjack lint # Lint and format code crackerjack typecheck # Run type checking crackerjack security # Security scanning crackerjack complexity # Code complexity analysis crackerjack analyze # Full quality analysis ``` ### Testing ```bash pytest # Run all tests pytest -m "not slow" # Quick smoke tests (dev recommended) pytest tests/unit/ # Unit tests only pytest tests/integration/ # Integration tests only pytest -m performance # Performance tests pytest -m security # Security tests pytest -n auto # Parallel execution (faster) pytest --cov=session_buddy --cov-report=term-missing # Coverage pytest --cov=session_buddy --cov-fail-under=85 # Enforce 85%+ coverage ``` ### Workflows ```bash # Pre-commit workflow uv sync --group dev && crackerjack lint && pytest -m "not slow" && crackerjack typecheck # Full quality gate (before PR) pytest --cov=session_buddy --cov-fail-under=85 && crackerjack security && crackerjack complexity ``` ## Architecture Overview ### Recent Changes (January 2025) - **Removed sitecustomize.py** - Eliminated 115 lines of startup-time patches - **Updated FastAPI to >=0.124.2** - Removed upper bound constraint - **Documentation Reorganization** - Archived 80 historical docs to `docs/archive/` ### Oneiric Adapter Migration (COMPLETE) Both database layers migrated to native DuckDB adapters (Oneiric): **Phase 2-3**: Created `ReflectionDatabaseAdapter` and `KnowledgeGraphDatabaseAdapter` with hybrid sync/async pattern **Phase 5**: Replaced external framework with direct DuckDB operations ```python async def create_entity(self, name: str, ...) -> dict: """Async signature for API consistency, sync operation internally.""" conn = self._get_conn() # Sync DuckDB connection conn.execute("INSERT INTO kg_entities ...") # Fast (<1ms) return {"id": entity_id, ...} ``` **Benefits**: Native DuckDB, improved connection pooling, better testability, zero breaking changes **Details**: `docs/migrations/ONEIRIC_MIGRATION_PLAN.md` ### Phase 2 Architecture Refactoring (February 2026) **Key Achievements**: - Zero circular dependencies between core and MCP layers - Core layer has ZERO MCP imports (verified programmatically) - 908 lines of code reduction through deprecated code removal - 18/18 architecture validation checks passed **Details**: `docs/PHASE2_SUMMARY.md` #### Layer Separation ``` MCP Layer (server.py) - FastMCP integration, Tool registration - Concrete implementations (MCPQualityScorer, MCPCodeFormatter) implements Core Layer Interfaces - QualityScorer (ABC), CodeFormatter (ABC) injected via DI Core Layer Components - SessionLifecycleManager, HooksManager uses Infrastructure Layer - Reflection database, Git operations, File system utilities ``` **Dependency Rules**: 1. **MCP Layer**: Implements Core interfaces, CAN import Core + Infrastructure 1. **Core Layer**: Contains interfaces/business logic, CAN import Infrastructure + ABC, CANNOT import MCP 1. **Infrastructure Layer**: Storage/utilities, CAN import stdlib + external deps, CANNOT import MCP/Core #### Dependency Injection ```python # Configure DI container (server startup) from session_buddy.di import configure, get_sync_typed, reset configure() # Registers all singletons # Get typed instance manager = get_sync_typed(SessionLifecycleManager) # Reset (testing) reset() ``` **Registration Order** (critical): 1. SessionPaths → 2. SessionLogger → 3. SessionPermissionsManager → 1. QualityScorer (MCPQualityScorer/DefaultQualityScorer) → 1. CodeFormatter (MCPCodeFormatter/DefaultCodeFormatter) → 1. SessionLifecycleManager → 7. HooksManager **Usage Patterns**: ```python # Pattern 1: Constructor injection (preferred) class SessionLifecycleManager: def __init__(self, quality_scorer: QualityScorer | None = None): self.quality_scorer = quality_scorer or get_sync_typed(QualityScorer) # Pattern 2: Direct DI lookup (utilities) def some_function(): manager = get_sync_typed(SessionLifecycleManager) # Pattern 3: Testing with mocks manager = SessionLifecycleManager(quality_scorer=Mock(spec=QualityScorer)) ``` #### Interface-Based Design **Core layer defines interface**: ```python class QualityScorer(ABC): @abstractmethod async def calculate_quality_score(self, project_dir: Path | None = None) -> dict[str, Any]: pass ``` **MCP layer provides concrete implementation**: ```python class MCPQualityScorer(QualityScorer): async def calculate_quality_score(self, project_dir: Path | None = None) -> dict[str, Any]: from session_buddy.mcp.server import calculate_quality_score return await calculate_quality_score(project_dir=project_dir) ``` **Core layer provides fallback**: ```python class DefaultQualityScorer(QualityScorer): async def calculate_quality_score(self, project_dir: Path | None = None) -> dict[str, Any]: return {"overall": 75, "metrics": {}, "recommendations": []} ``` #### Reflection System Modularization **Before**: `reflection_tools.py` (1,345 lines) - Everything in one file **After**: Modular structure with 97% reduction in main file ``` session_buddy/reflection/ |-- __init__.py (50 lines) - Public API |-- database.py (380 lines) - Database class |-- embeddings.py (200 lines) - Vector generation |-- schema.py (160 lines) - Database structure |-- search.py (330 lines) - Semantic/text search |-- storage.py (280 lines) - CRUD operations reflection_tools.py (37 lines) - Compatibility wrapper ``` **Benefits**: Clear separation, better testability, 100% backward compatibility #### Architectural Benefits - **Testability**: Core layer tested without MCP dependencies, DI supports test doubles - **Maintainability**: Layer separation prevents circular dependencies, modular structure isolates changes - **Extensibility**: Multiple implementations via DI, graceful degradation with fallbacks - **Performance**: DI adds \<1ms overhead, singleton pattern prevents duplicates ### Core Components **server.py** (~3,500+ lines): FastMCP integration, tool registration, session lifecycle, permissions, project analysis, Git integration, structured logging **reflection_tools.py**: DuckDB database with FLOAT[384] vector embeddings, local ONNX model (all-MiniLM-L6-v2), async architecture with executor threads, text search fallback **crackerjack_integration.py**: Real-time parsing of Crackerjack output, quality metrics aggregation, test result analysis, command history learning **tools/**: Organized MCP tool implementations (session, memory, search, crackerjack, LLM, team) **core/**: Session state and lifecycle coordination **di/**: Dependency injection configuration (`configure()`, `get_sync_typed()`, benefits: testability, reduced coupling) **utils/**: Git operations, logging, quality scoring (V1 + V2 filesystem-based) ### Advanced Components **multi_project_coordinator.py**: Cross-project coordination with `ProjectGroup`/`ProjectDependency` dataclasses, relationship types (related/continuation/reference), dependency-aware result ranking **token_optimizer.py**: tiktoken-based token counting, auto-split responses >4000 tokens with `ChunkResult` pagination **search_enhanced.py**: Faceted search (project, time, author, content type), aggregations, FTS5 full-text indexing **interruption_manager.py**: File system monitoring, automatic context snapshots, session restoration **serverless_mode.py**: Oneiric storage adapters (File, S3, Azure, GCS, Memory), session serialization, multi-instance support **app_monitor.py**: IDE activity tracking, development behavior insights, workflow efficiency metrics **natural_scheduler.py**: Natural language time parsing, background reminder service **worktree_manager.py**: Git worktree operations, session coordination, branch management ### Key Design Patterns #### 1. Async-First Architecture ```python # ✅ Correct: Use executor for blocking operations async def generate_embedding(text: str) -> np.ndarray: loop = asyncio.get_event_loop() return await loop.run_in_executor(None, _sync_embedding_generation, text) # ❌ Avoid: Blocking the event loop async def bad_embedding(text: str) -> np.ndarray: return onnx_session.run(None, {"input": text}) # Blocks! ``` #### 2. Graceful Degradation - Optional dependencies fall back gracefully (ONNX → text search) - Memory constraints trigger automatic chunking/compression - Error recovery continues despite component failures #### 3. Local-First Privacy - No external APIs (embeddings generated locally via ONNX) - Local DuckDB storage in `~/.claude/` - Zero network dependencies for core features - Complete user data control #### 4. Selective Auto-Store (High Signal-to-Noise) **Triggers**: Manual checkpoints (always), session end (always), quality delta ≥10, exceptional ≥90 **Skips**: Routine auto-checkpoints with minimal changes **Configuration**: ```python enable_auto_store_reflections: bool = True auto_store_quality_delta_threshold: int = 10 auto_store_exceptional_quality_threshold: int = 90 auto_store_manual_checkpoints: bool = True auto_store_session_end: bool = True ``` **Tags**: `manual_checkpoint`, `session_end`, `quality_improvement`, `quality_degradation`, `high-quality`, `good-quality`, `needs-improvement`, `user-initiated`, `quality-change`, `session-summary` #### 5. Type-Safe Data Modeling ```python @dataclass class ProjectDependency: source_project: str target_project: str dependency_type: Literal["related", "continuation", "reference"] description: str | None = None ``` - Dataclass architecture, modern Python 3.13+ type hints, Pydantic runtime validation #### 6. Performance-Optimized Vector Search ```sql SELECT content, array_cosine_similarity(embedding, $1) as similarity FROM conversations WHERE similarity > 0.7 ORDER BY similarity DESC, timestamp DESC LIMIT 20; ``` - FLOAT[384] vectors, cosine similarity, hybrid semantic + temporal ranking ### Session Management Workflow **Git Repositories (Automatic)**: 1. Start Claude Code - Session auto-initializes 1. Work normally - Automatic quality tracking 1. Run `/checkpoint` - Manual checkpoints with auto-compaction 1. Exit any way - Session auto-cleanup **Non-Git Projects (Manual)**: 1. Start with `/start` (if you want session management) 1. Checkpoint with `/checkpoint` as needed 1. End with `/end` before quitting **Automatic Initialization** (Git repos): Sets up `~/.claude` directory, syncs UV dependencies, analyzes project context, calculates maturity score, sets up permissions, crash resilient **Enhanced Quality Monitoring** (`checkpoint`): Multi-factor quality score, automatic context compaction, Git commits with metadata, workflow recommendations **Automatic Session Cleanup** (Git repos): Any disconnect/quit/crash, generates handoff docs, final quality assessment, cleanup artifacts, zero manual intervention ### Memory System Architecture **DuckDB Schema**: ```sql CREATE TABLE conversations ( id TEXT PRIMARY KEY, content TEXT, embedding FLOAT[384], -- all-MiniLM-L6-v2 project TEXT, timestamp TIMESTAMP ); CREATE TABLE reflections ( id TEXT PRIMARY KEY, content TEXT, embedding FLOAT[384], tags TEXT[] ); ``` **Vector Search**: Local ONNX model (all-MiniLM-L6-v2, 384-dim), cosine similarity, text search fallback, async executor threads **Multi-Project**: `ProjectGroup`/`ProjectDependency` tables, cross-project search with dependency-aware ranking, typed relationships (continuation/reference/related) ## Configuration & Integration ### MCP Configuration Uses global `~/.claude/.mcp.json` (recommended). Project-level `.mcp.json` removed as redundant. ```json { "mcpServers": { "session-buddy": { "command": "python", "args": ["-m", "session_buddy.server"], "cwd": "/path/to/session-buddy", "env": {"PYTHONPATH": "/path/to/session-buddy"} } } } ``` ### Directory Structure - **~/.claude/logs/**: Session management logging - **~/.claude/data/**: Reflection database storage ### Environment Variables - `PWD`: Current working directory detection ### Oneiric Storage Adapters **Backends**: `file` (default, dev), `s3` (AWS/MinIO), `azure`, `gcs`, `memory` (testing only) **Configuration** (`settings/session-buddy.yaml`): ```yaml storage: default_backend: "file" file: local_path: "${SESSION_STORAGE_PATH:~/.claude/data/sessions}" auto_mkdir: true s3: bucket_name: "${S3_BUCKET:session-buddy}" endpoint_url: "${S3_ENDPOINT:}" region: "${S3_REGION:us-east-1}" ``` **Benefits**: Multi-cloud support, environment variable support, native DuckDB, better connection pooling, 91% code reduction, 100% backward compatibility **Migration**: `docs/migrations/ONEIRIC_MIGRATION_PLAN.md` ## Development Notes ### Dependencies - **Core**: `fastmcp>=2`, `duckdb>=0.9`, `pydantic>=2.0`, `tiktoken>=0.5`, `crackerjack` - **Embeddings**: `onnxruntime>=1.15`, `transformers>=4.21` (graceful fallback to text search) - **Dev**: `pytest>=7`, `pytest-asyncio>=0.21`, `hypothesis>=6.70`, `coverage>=7` - Isolated virtual environment prevents conflicts ### Testing Architecture **Structure**: - **Unit** (`tests/unit/`): Core functionality, session permissions, reflection DB operations, async/await patterns, mock fixtures - **Integration** (`tests/integration/`): End-to-end MCP workflows, tool registration/execution, concurrent database operations - **Functional** (`tests/functional/`): Cross-component integration, user workflows, performance/reliability **Features**: Async/await support, temporary database fixtures, data factories, performance metrics, mock MCP server ## Available MCP Tools **Total: 70+ tools** across 10 categories. See [README.md](README.md#available-mcp-tools) for complete list. ### Core Session Management (8 tools) `start`, `checkpoint`, `end`, `status`, `permissions`, `auto_compact`, `quality_monitor`, `session_welcome` ### Memory & Search (14 tools) **Search**: `search_reflections`/`reflect_on_past`, `quick_search`, `search_summary`, `get_more_results`, `search_by_file`, `search_by_concept`, `search_code`, `search_errors`, `search_temporal` **Storage**: `store_reflection`, `reflection_stats`, `reset_reflection_database` ### Advanced Categories - **Crackerjack** (11): Command execution, quality metrics, patterns, health monitoring - **LLM Management** (5): `list_llm_providers`, `test_llm_providers`, `generate_with_llm`, `chat_with_llm`, `configure_llm_provider` - **Serverless** (8): External storage integration (Redis, S3, local) - **Team** (4): `create_team`, `search_team_knowledge`, `get_team_statistics`, `vote_on_reflection` - **Multi-Project** (4): `create_project_group`, `add_project_dependency`, `search_across_projects`, `get_project_insights` - **Plus**: App Monitoring (5), Interruption Management (7), Natural Scheduling (5), Git Worktree (3), Advanced Search (3) ## Token Optimization **Architecture**: `TokenOptimizer` with tiktoken, auto-split responses >4000 tokens into paginated chunks ```python @dataclass class ChunkResult: chunks: list[str] total_chunks: int current_chunk: int cache_key: str metadata: dict[str, Any] ``` **Usage**: ```python result = await some_large_operation() if result.get("chunked"): # Use get_cached_chunk tool for additional chunks ``` ## Integration with Crackerjack - **Quality Commands**: `crackerjack lint`, `crackerjack typecheck`, etc. - **MCP Integration**: Crackerjack configured in `.mcp.json` - **Progress Tracking**: `crackerjack_integration.py` provides real-time analysis - **Test Integration**: Crackerjack handles execution, this project handles results ## Development Guidelines ### Adding New MCP Tools 1. Define function with `@mcp.tool()` in appropriate `tools/` module 1. Add `@mcp.prompt()` for slash command support 1. Import and register in `server.py` 1. Update `status()` tool 1. Add tests in appropriate category ### Extending Memory System 1. Add table schemas in `reflection_tools.py:_ensure_tables()` 1. Implement storage/retrieval in `ReflectionDatabase` 1. Add MCP tools in `tools/memory_tools.py` 1. Update `reflection_stats()` for new metrics 1. Add performance tests ### Testing New Features 1. Add unit tests in `tests/unit/` 1. Add integration tests in `tests/integration/` 1. Add functional tests in `tests/functional/` 1. Use `tests/fixtures/` for test data factories 1. Ensure coverage: `pytest --cov=session_buddy` ## Configuration Files ### pyproject.toml - **Python 3.13+** required - **Ruff**: Formatting/linting, complexity ≤15 - **Pytest**: Async/await, coverage, benchmarking - **Optional**: `[embeddings]` for semantic search, `[dev]` for dev tools ### MCP Server Configuration Global `~/.claude/.mcp.json`: session-buddy, crackerjack, GitHub, GitLab, memory ### Testing Configuration (conftest.py) Async/await support, temporary database fixtures, mock MCP server, performance baselines ## Modern Development Patterns ### Database Connection Management ```python # ✅ Correct: Context manager with pooling async def store_conversation(content: str) -> str: async with ReflectionDatabase() as db: return await db.store_conversation(content) # ✅ Correct: Batch operations async def bulk_store(conversations: list[str]) -> list[str]: async with ReflectionDatabase() as db: return await db.bulk_store_conversations(conversations) ``` ### Error Handling ```python async def search_with_fallback(query: str) -> list[SearchResult]: try: return await semantic_search(query) except (ImportError, RuntimeError) as e: logger.info(f"Semantic search unavailable: {e}. Using text search.") return await text_search(query) except Exception as e: logger.error(f"Search failed: {e}") return [] ``` ### MCP Tool Pattern ```python @mcp.tool() async def example_tool(param1: str, param2: int | None = None) -> dict[str, Any]: """Tool description for Claude Code.""" try: if not param1.strip(): return {"success": False, "error": "param1 cannot be empty"} result = await perform_async_operation(param1, param2) return { "success": True, "data": result, "metadata": { "timestamp": datetime.now().isoformat(), "execution_time_ms": 42, }, } except Exception as e: logger.error(f"Tool execution failed: {e}") return {"success": False, "error": str(e)} ``` ## Troubleshooting ### MCP Server Not Loading ```bash python -c "import session_buddy; print('✅ Package imports')" python -c "from session_buddy.server import mcp; print('✅ MCP server')" python -c "import duckdb, numpy, tiktoken; print('✅ Core deps')" ``` ### Memory/Embedding Issues ```bash python -c " from session_buddy.reflection_tools import ReflectionDatabase import asyncio async def test(): try: async with ReflectionDatabase() as db: result = await db.test_embedding_system() print(f'✅ Embedding system: {result}') except Exception as e: print(f'⚠️ Fallback mode: {e}') asyncio.run(test()) " uv sync # Reinstall if needed ``` ### Database Connection ```bash python -c "import duckdb; print(f'✅ DuckDB {duckdb.__version__}')" python -c "import duckdb; conn = duckdb.connect(':memory:'); print('✅ Connection')" ls -la ~/.claude/data/ || mkdir -p ~/.claude/data/ ``` ### Performance Issues ```bash pytest -m performance --verbose python -c "import psutil, os; print(f'Memory: {psutil.Process(os.getpid()).memory_info().rss / 1024 / 1024:.1f} MB')" PYTHONPATH=. python -m session_buddy.server --debug ``` ### Environment Setup ```bash # UV uv --version || curl -LsSf https://astral.sh/uv/install.sh | sh rm -rf .venv && uv sync --group dev uv pip check # Python version python --version # Should be 3.13+ python -c "import sys; assert sys.version_info >= (3, 13)" ``` ## Coding Standards ### Core Philosophy (RULES.md) - **EVERY LINE OF CODE IS A LIABILITY**: The best code is no code - **DRY**: If you write it twice, you're doing it wrong - **YAGNI**: Build only what's needed NOW - **KISS**: Complexity is the enemy of maintainability ### Type Safety - Comprehensive type hints with Python 3.13+ syntax - `list[str]` instead of `typing.List[str]` - `str | None` instead of `Optional[str]` ### Development Practices 1. Always use async/await for database/file operations 1. Test with both embedding and fallback modes 1. Include comprehensive error handling with graceful degradation 1. Use type hints and dataclasses 1. Follow testing pattern: unit → integration → functional 1. Run pre-commit workflow before commits 1. Monitor token usage and response chunking 1. Test cross-project coordination with multiple repos ### Key Architecture Patterns 1. **FastMCP Integration**: `@mcp.tool()` decorators, structured responses 1. **Async-First**: Executor threads prevent blocking 1. **Local Privacy**: No external APIs, local embeddings 1. **Graceful Fallback**: Continues despite component failures 1. **Modular Structure**: Tools organized by functionality 1. **Session Lifecycle**: Init → Work → Checkpoint → End  This project uses [crackerjack](https://github.com/lesleslie/crackerjack) for Python project management and quality assurance. **Specialized Agents**: - **crackerjack-architect**: Use PROACTIVELY for all feature development and architectural decisions - **python-pro**: Modern Python with type hints, async/await, clean architecture - **pytest-hypothesis-specialist**: Advanced testing patterns and optimization - **backend-architect**: System design and API architecture - **security-auditor**: Security analysis and secure coding **Usage**: ```bash # Use Task tool with subagent_type Task tool with subagent_type="crackerjack-architect" # Feature planning Task tool with subagent_type="python-pro" # Code implementation Task tool with subagent_type="pytest-hypothesis-specialist" # Tests Task tool with subagent_type="security-auditor" # Security analysis ``` **Crackerjack Philosophy** (same as core philosophy): - Every line of code is a liability - DRY, YAGNI, KISS - Cognitive complexity ≤15 per function - Coverage ratchet: never decrease, always improve toward 100% - Type annotations required - Security patterns: no hardcoded paths, proper temp file handling - Python 3.13+ patterns: `|` unions, pathlib over os.path **Workflow**: ```bash python -m crackerjack # Main menu python -m crackerjack -t # Run tests python -m crackerjack --ai-agent -t # Auto-fix with AI python -m crackerjack -a patch # Apply patches ``` **Best Practices**: 1. Plan with crackerjack-architect for proper architecture 1. Implement with python-pro for modern patterns 1. Test with pytest-hypothesis-specialist 1. Run `python -m crackerjack -t` before committing 1. Security review with security-auditor **Key**: Use crackerjack-architect proactively to avoid retrofitting. Never reduce test coverage (ratchet system). Follow crackerjack patterns - tools enforce quality automatically.

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/lesleslie/session-buddy'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

CLAUDE.md•24 KiB