Codebase MCP Server

codebase-mcp
docs
architecture

background-indexing-tasks-merged.md•94.7 KiB

# Background Indexing Implementation Tasks - Definitive Edition **Feature Branch**: `014-add-background-indexing` **Source Documents**: - `docs/architecture/background-indexing-tasks.md` (Collaborative - technical depth) - `specs/014-add-background-indexing/tasks.md` (Spec-Driven - user story coverage) - `specs/014-add-background-indexing/spec.md` (Requirements traceability) **Constitutional Principles**: I (Simplicity), II (Local-First), III (Protocol Compliance), IV (Performance), V (Production Quality), VI (Specification-First), VII (Test-Driven Development), VIII (Type Safety), XI (FastMCP Foundation) --- ## Executive Summary This task list implements PostgreSQL-native background indexing for large repositories (10K+ files, 5-10 minute indexing time) that exceed typical MCP request timeouts. The architecture uses PostgreSQL as the single source of truth for job state, eliminating in-memory complexity while ensuring production-ready reliability with state persistence across server restarts. **Key Architectural Decision**: Job state lives in the database, not in memory. This trades slight query overhead for massive gains in simplicity, reliability, and debuggability. **Implementation Approach**: TDD throughout, with test tasks preceding implementation. Each phase is independently testable and delivers incremental value. **User Story Integration**: Tasks are organized by user story to enable independent implementation and testing: - **[US1]**: Core background indexing (MVP) - Index large repositories without interruption - **[US2]**: Enhanced monitoring - Monitor indexing progress with detailed status - **[US3]**: Cancellation - Cancel unwanted indexing tasks - **[US4]**: Resumption - Resume indexing after interruptions --- ## Implementation Timeline **Total Estimated Time**: 12.5-14.5 hours (updated to include US2 enhancements) | Phase | Description | Time | Dependencies | User Stories | |-------|-------------|------|--------------|--------------| | Phase 1 | Database Schema & Migration | 1.5 hours | None (Critical Path Start) | Infrastructure | | Phase 2 | Core Models & Validation (BLOCKS User Stories) | 1.5 hours | Phase 1 | Infrastructure | | Phase 3 | Database Utilities | 1.5 hours | Phase 2 | Infrastructure | | Phase 4 | Background Worker | 2 hours | Phase 3 | Infrastructure | | Phase 5 | MCP Tools - Core (US1) | 1.5 hours | Phase 4 | US1 | | Phase 6 | Enhanced Monitoring (US2) | 2 hours | Phase 5 | US2 | | Phase 7 | Indexer Enhancement | 1 hour | Phase 4-5 (Parallel) | US1 + US2 | | Phase 8 | Integration Testing | 2 hours | Phase 6 | All | | Phase 9 | Documentation & Polish | 1.5 hours | Phase 8 | All | **Critical Path**: Phase 1 → Phase 2 → Phase 3 → Phase 4 → Phase 5 → Phase 6 → Phase 8 → Phase 9 (11 hours) **Parallel Opportunities**: Phase 7 can run parallel with Phase 5-6 (saves 1 hour) --- ## Phase Breakdown ### Phase 1: Database Schema & Migration (Critical Path) **Purpose**: Create `indexing_jobs` table for job state persistence **Estimated Time**: 1.5 hours **Dependencies**: None (blocks all other phases) ### Phase 2: Core Models & Validation (BLOCKS User Stories) ⚠️ **Purpose**: Pydantic models with security validation (path traversal prevention) **Estimated Time**: 1.5 hours **Dependencies**: Phase 1 complete **⚠️ CRITICAL**: No user story work can begin until this phase is complete ### Phase 3: Database Utilities **Purpose**: Transaction-safe job state update functions **Estimated Time**: 1.5 hours **Dependencies**: Phase 2 complete ### Phase 4: Background Worker **Purpose**: Asyncio task that executes indexing and updates database **Estimated Time**: 2 hours **Dependencies**: Phase 3 complete ### Phase 5: MCP Tools - Core (User Story 1) **Purpose**: FastMCP tools for start/status operations (MVP) **Estimated Time**: 1.5 hours **Dependencies**: Phase 4 complete **Delivers**: [US1] Core background indexing functionality ### Phase 6: Enhanced Monitoring (User Story 2) 🆕 **Purpose**: Enhanced monitoring with list, ETA, and phase-specific progress **Estimated Time**: 2 hours **Dependencies**: Phase 5 complete **Delivers**: [US2] Detailed progress monitoring ### Phase 7: Indexer Enhancement (Parallel with Phase 5-6) **Purpose**: Add progress callbacks to existing indexer service **Estimated Time**: 1 hour **Dependencies**: Phase 4 started (can run parallel) **Serves**: [US1] + [US2] (dual-story support) ### Phase 8: Integration Testing **Purpose**: End-to-end workflow validation, cancellation, persistence **Estimated Time**: 2 hours **Dependencies**: Phase 6 complete ### Phase 9: Documentation & Polish **Purpose**: User documentation, .env examples, CLAUDE.md updates **Estimated Time**: 1.5 hours **Dependencies**: Phase 8 complete --- ## Detailed Tasks ### Phase 1: Database Schema & Migration **Checkpoint**: Migration created, applied, and validated ✅ --- #### T001: Create Alembic migration for indexing_jobs table [Implementation] **Dependencies**: None (Critical Path Start) **Estimated Time**: 1 hour **Parallelization**: Cannot parallelize (critical path) **User Story**: Infrastructure (no label - foundational) **Description**: Create Alembic migration to add the `indexing_jobs` table to project databases with proper schema, indexes, and constraints matching the architecture specification. **Deliverables**: - [ ] Create migration file: `migrations/versions/008_add_indexing_jobs.py` - [ ] Define table schema matching architecture doc lines 73-113 - [ ] Add all 4 performance indexes with WHERE clauses - [ ] Add CHECK constraints for status enum and progress range - [ ] Implement upgrade() function - [ ] Implement downgrade() function - [ ] Add table and column comments for documentation **Acceptance Criteria**: - [ ] Migration file passes `alembic check` - [ ] upgrade() creates table with correct 18 columns - [ ] downgrade() cleanly removes table and indexes - [ ] All indexes created with partial WHERE conditions - [ ] CHECK constraints enforce valid status values and 0-100 progress range - [ ] Comments explain purpose of key columns (repo_path, status, metadata) **Constitutional Principles**: - Principle V: Production Quality (proper indexes for <10ms queries, constraints) - Principle VIII: Type Safety (explicit PostgreSQL column types) - Principle II: Local-First (standard PostgreSQL schema, no cloud dependencies) **Technical Notes**: - Use PostgreSQL-specific types: UUID (gen_random_uuid()), TIMESTAMPTZ, JSONB - Use partial indexes for performance: `WHERE status IN ('pending', 'running')` - Reference architecture doc lines 73-129 for exact schema - Table will be created in ALL project databases (cb_proj_* pattern) **SQL Schema Reference**: ```sql CREATE TABLE indexing_jobs ( id UUID PRIMARY KEY DEFAULT gen_random_uuid(), repository_id UUID, repo_path TEXT NOT NULL, repo_name TEXT NOT NULL, project_id VARCHAR(255) NOT NULL, force_reindex BOOLEAN NOT NULL DEFAULT FALSE, status VARCHAR(20) NOT NULL CHECK (status IN ('pending', 'running', 'completed', 'failed', 'cancelled')), progress_percentage INTEGER NOT NULL DEFAULT 0 CHECK (progress_percentage >= 0 AND progress_percentage <= 100), progress_message TEXT, files_scanned INTEGER NOT NULL DEFAULT 0, files_indexed INTEGER NOT NULL DEFAULT 0, chunks_created INTEGER NOT NULL DEFAULT 0, error_message TEXT, error_type VARCHAR(255), error_traceback TEXT, created_at TIMESTAMPTZ NOT NULL DEFAULT NOW(), started_at TIMESTAMPTZ, completed_at TIMESTAMPTZ, cancelled_at TIMESTAMPTZ, metadata JSONB DEFAULT '{}'::jsonb, worker_task_id TEXT, connection_id TEXT ); -- Indexes for US2 list queries (from Spec-Driven) CREATE INDEX idx_indexing_jobs_status ON indexing_jobs(status) WHERE status IN ('pending', 'running'); CREATE INDEX idx_indexing_jobs_project_id ON indexing_jobs(project_id); CREATE INDEX idx_indexing_jobs_created_at ON indexing_jobs(created_at DESC); CREATE INDEX idx_indexing_jobs_repo_path ON indexing_jobs(repo_path); ``` **Risks**: - Migration must apply to both registry and project databases - Need to handle existing projects (apply migration to all cb_proj_* databases) - Downgrade must not leave orphaned indexes --- #### T002: Apply migration to test project database [Migration] **Dependencies**: T001 **Estimated Time**: 15 minutes **Parallelization**: Cannot parallelize (depends on T001) **User Story**: Infrastructure **Description**: Apply the indexing_jobs migration to the test project database and validate schema correctness. **Deliverables**: - [ ] Run `alembic upgrade head` on test database - [ ] Verify table exists with `\d indexing_jobs` in psql - [ ] Verify all 4 indexes created correctly - [ ] Verify CHECK constraints active - [ ] Document migration validation steps **Acceptance Criteria**: - [ ] `SELECT * FROM indexing_jobs LIMIT 1;` succeeds (returns no rows) - [ ] All indexes appear in `\di` output - [ ] CHECK constraint violation fails: `INSERT INTO indexing_jobs (status) VALUES ('invalid');` - [ ] Progress range constraint works: `INSERT INTO indexing_jobs (progress_percentage) VALUES (101);` fails - [ ] Table comments visible: `\d+ indexing_jobs` **Constitutional Principles**: - Principle V: Production Quality (validation before proceeding) - Principle VI: Specification-First (migration validates against spec) **Technical Notes**: - Use test database: `cb_proj_test_<hash>` - Test both upgrade and downgrade paths - Validate foreign key behavior (repository_id can be NULL initially) **Risks**: - Test database may not exist yet (may need setup script) - Migration might fail on constraint naming conflicts --- #### T003: Validate migration rollback (downgrade) [Testing] **Dependencies**: T002 **Estimated Time**: 15 minutes **Parallelization**: Cannot parallelize (depends on T002) **User Story**: Infrastructure **Description**: Test the downgrade path to ensure clean rollback without leaving artifacts. **Deliverables**: - [ ] Run `alembic downgrade -1` on test database - [ ] Verify table removed: `\d indexing_jobs` fails - [ ] Verify all indexes removed - [ ] Re-upgrade to validate idempotency - [ ] Document rollback validation **Acceptance Criteria**: - [ ] After downgrade, `SELECT * FROM indexing_jobs` fails with "relation does not exist" - [ ] After downgrade, no `idx_indexing_jobs_*` indexes exist - [ ] Re-running upgrade succeeds without errors - [ ] Downgrade → upgrade → downgrade cycle works cleanly **Constitutional Principles**: - Principle V: Production Quality (safe rollback capability) - Principle VII: Test-Driven Development (validate failure paths) **Technical Notes**: - Test idempotency: upgrade twice, downgrade twice - Check for orphaned constraints or indexes - Validate with `\d` and `\di` psql commands **Risks**: - Downgrade might fail if indexes not dropped in correct order - Partial downgrade could leave database in inconsistent state --- ### Phase 2: Core Models & Validation (BLOCKS User Stories) ⚠️ **Checkpoint**: Pydantic models created with security validation ✅ **⚠️ CRITICAL**: No user story work can begin until this phase is complete. This phase provides the foundational type safety and validation that all subsequent features depend on. --- #### T004-test: Write unit tests for IndexingJobCreate validation [Testing] **Dependencies**: T001 (schema defined) **Estimated Time**: 30 minutes **Parallelization**: [P] Can run parallel with other test tasks **User Story**: Infrastructure **Description**: Write TDD tests for the IndexingJobCreate Pydantic model focusing on path traversal prevention and validation rules. **Deliverables**: - [ ] Create test file: `tests/unit/test_indexing_job_models.py` - [ ] Test valid absolute path acceptance - [ ] Test relative path rejection - [ ] Test path traversal rejection (../ sequences) - [ ] Test symlink path handling - [ ] Test field validation (repo_name, project_id required) - [ ] Test default values (force_reindex=False) **Acceptance Criteria**: - [ ] All tests fail initially (models don't exist yet) - [ ] Test valid path: `/Users/alice/projects/myapp` → accepted - [ ] Test relative path: `./relative/path` → ValueError - [ ] Test traversal: `/var/data/../../etc/passwd` → ValueError - [ ] Test traversal: `/tmp/../../../etc/passwd` → ValueError - [ ] Test empty repo_path: ValueError with clear message - [ ] Test empty project_id: ValueError with clear message **Constitutional Principles**: - Principle VII: Test-Driven Development (tests before implementation) - Principle V: Production Quality (comprehensive security validation) - Principle VIII: Type Safety (Pydantic validation testing) **Technical Notes**: - Use pytest with pytest-asyncio - Test both valid and invalid cases - Validate error messages are actionable - Reference architecture doc lines 174-192 for validation logic **Test Template**: ```python def test_valid_absolute_path(): """Test absolute paths are accepted.""" job_input = IndexingJobCreate( repo_path="/tmp/test-repo", repo_name="test-repo", project_id="test-project", ) assert job_input.repo_path == "/tmp/test-repo" def test_relative_path_rejection(): """Test relative paths are rejected.""" with pytest.raises(ValueError, match="must be absolute"): IndexingJobCreate( repo_path="./relative/path", repo_name="test", project_id="test", ) def test_path_traversal_rejection(): """Test path traversal attacks are rejected.""" with pytest.raises(ValueError, match="Path traversal detected"): IndexingJobCreate( repo_path="/var/data/../../etc/passwd", repo_name="test", project_id="test", ) ``` **Risks**: - Path resolution behavior varies by OS (test on Linux, macOS, Windows) - Symlinks might bypass validation if not handled --- #### T004: Create IndexingJob Pydantic models [Implementation] **Dependencies**: T004-test (tests written) **Estimated Time**: 45 minutes **Parallelization**: [P] Can run parallel with T005 **User Story**: Infrastructure **Description**: Implement Pydantic models for indexing job data: IndexingJobStatus enum, IndexingJobCreate (input), and IndexingJobProgress (output). **Deliverables**: - [ ] Create file: `src/models/indexing_job.py` - [ ] Define IndexingJobStatus enum (5 states) - [ ] Implement IndexingJobCreate with path validation - [ ] Implement IndexingJobProgress (immutable output) - [ ] Add comprehensive docstrings - [ ] Export models in `src/models/__init__.py` **Acceptance Criteria**: - [ ] All tests from T004-test pass - [ ] mypy --strict passes with no errors - [ ] IndexingJobStatus has 5 values: pending, running, completed, failed, cancelled - [ ] IndexingJobCreate validates absolute paths only - [ ] IndexingJobCreate detects path traversal (../ sequences) - [ ] IndexingJobProgress is frozen (immutable) - [ ] All fields have type hints and Field constraints **Constitutional Principles**: - Principle VIII: Type Safety (Pydantic models, mypy --strict) - Principle V: Production Quality (security validation, clear errors) - Principle VII: TDD (tests pass after implementation) **Technical Notes**: - Reference architecture doc lines 136-193 - Use pydantic.Field for constraints (min_length, ge, le) - Use pydantic.validator for custom path validation - IndexingJobProgress.Config: frozen=True for immutability - Path validation: os.path.isabs() + Path.resolve() comparison **Model Structure**: ```python from enum import Enum from datetime import datetime from pydantic import BaseModel, Field, validator from uuid import UUID class IndexingJobStatus(str, Enum): PENDING = "pending" RUNNING = "running" COMPLETED = "completed" FAILED = "failed" CANCELLED = "cancelled" class IndexingJobCreate(BaseModel): repo_path: str = Field(min_length=1) repo_name: str = Field(min_length=1) project_id: str = Field(min_length=1) force_reindex: bool = False @validator('repo_path') def validate_repo_path(cls, v: str) -> str: # Absolute path check # Path traversal detection # Return validated path pass class IndexingJobProgress(BaseModel): job_id: UUID status: IndexingJobStatus progress_percentage: int = Field(ge=0, le=100) progress_message: str files_scanned: int = Field(ge=0) files_indexed: int = Field(ge=0) chunks_created: int = Field(ge=0) error_message: str | None = None created_at: datetime started_at: datetime | None = None completed_at: datetime | None = None class Config: frozen = True ``` **Risks**: - Path validation logic might have edge cases (symlinks, Windows paths) - Enum string values must match database CHECK constraint exactly --- #### T005-test: Write unit tests for path validation logic [Testing] **Dependencies**: T001 **Estimated Time**: 30 minutes **Parallelization**: [P] Can run parallel with T004-test **User Story**: Infrastructure **Description**: Write comprehensive tests for path traversal prevention logic covering edge cases. **Deliverables**: - [ ] Add tests to `tests/unit/test_indexing_job_models.py` - [ ] Test multiple traversal patterns: `../`, `/../`, `/../../` - [ ] Test encoded traversal: `%2e%2e%2f` - [ ] Test symlink resolution - [ ] Test path normalization edge cases - [ ] Test Windows path handling (if applicable) **Acceptance Criteria**: - [ ] All traversal patterns rejected - [ ] Encoded traversal patterns rejected - [ ] Symlinks validated against resolved target - [ ] Clear error messages for each rejection - [ ] Tests cover both Unix and Windows path formats **Constitutional Principles**: - Principle VII: TDD (comprehensive test coverage) - Principle V: Production Quality (security edge cases) **Technical Notes**: - Test on multiple platforms if possible - Use pathlib.Path for cross-platform compatibility - Validate error messages are actionable **Edge Case Tests**: ```python @pytest.mark.parametrize("malicious_path", [ "/var/data/../../etc/passwd", "/tmp/../../../etc/passwd", "/home/user/../../../etc/shadow", "./../../sensitive", "relative/../../../etc/hosts", ]) def test_path_traversal_patterns(malicious_path): """Test various path traversal patterns are rejected.""" with pytest.raises(ValueError, match="Path traversal detected"): IndexingJobCreate( repo_path=malicious_path, repo_name="test", project_id="test", ) ``` **Risks**: - Validation might not cover all attack vectors - Cross-platform path handling differences --- ### Phase 3: Database Utilities **Checkpoint**: Transaction-safe database update functions implemented ✅ --- #### T006-test: Write unit tests for database update utilities [Testing] **Dependencies**: T002 (schema exists) **Estimated Time**: 45 minutes **Parallelization**: Cannot parallelize (requires database) **User Story**: Infrastructure **Description**: Write tests for the three database utility functions: _update_job_status, _update_job_progress, _check_cancellation. **Deliverables**: - [ ] Create test file: `tests/unit/test_indexing_db_utils.py` - [ ] Test _update_job_status with all fields - [ ] Test _update_job_progress with counters - [ ] Test _check_cancellation for cancelled jobs - [ ] Test transaction commit behavior - [ ] Test error handling and rollback **Acceptance Criteria**: - [ ] Tests initially fail (functions don't exist) - [ ] Test status update writes to database - [ ] Test progress update increments counters - [ ] Test cancellation detection returns True/False correctly - [ ] Test transaction isolation (concurrent updates) - [ ] Test error handling on invalid job_id **Constitutional Principles**: - Principle VII: TDD (tests before implementation) - Principle V: Production Quality (transaction safety) - Principle IV: Performance (efficient queries) **Technical Notes**: - Use pytest fixtures for test database setup - Test with real PostgreSQL connection (not mocks) - Validate transaction commits are visible to other connections - Reference architecture doc lines 500-664 **Test Structure**: ```python @pytest.mark.asyncio async def test_update_job_status(test_db, test_job_id): """Test updating job status persists to database.""" await _update_job_status( job_id=test_job_id, project_id="test", status=IndexingJobStatus.RUNNING, progress_percentage=10, progress_message="Starting...", ) # Verify update visible in database async with get_session(project_id="test") as db: result = await db.execute( text("SELECT status FROM indexing_jobs WHERE id = :job_id"), {"job_id": test_job_id} ) row = result.fetchone() assert row[0] == "running" ``` **Risks**: - Race conditions if not using proper transactions - Connection pool exhaustion during tests --- #### T006: Implement _update_job_status utility [Implementation] **Dependencies**: T006-test, T004 (models exist) **Estimated Time**: 45 minutes **Parallelization**: Cannot parallelize (depends on T006-test) **User Story**: Infrastructure **Description**: Implement the _update_job_status function that transactionally updates job state in PostgreSQL. **Deliverables**: - [ ] Create file: `src/mcp/tools/background_indexing.py` - [ ] Implement _update_job_status function - [ ] Dynamic SQL generation for optional fields - [ ] Transaction commit after update - [ ] Comprehensive error handling - [ ] Logging for all updates **Acceptance Criteria**: - [ ] All tests from T006-test pass - [ ] Function updates all provided fields atomically - [ ] Transaction commits successfully - [ ] Optional fields (started_at, error_message) handled correctly - [ ] Errors logged with context - [ ] mypy --strict passes **Constitutional Principles**: - Principle V: Production Quality (transaction safety, error handling) - Principle VIII: Type Safety (full type hints) - Principle IV: Performance (single UPDATE query) **Technical Notes**: - Use dynamic SQL: build UPDATE SET clause from non-None parameters - Reference architecture doc lines 503-608 - Use get_session(project_id=...) for connection - Commit transaction explicitly: await db.commit() **Function Signature**: ```python async def _update_job_status( job_id: UUID, project_id: str, status: IndexingJobStatus, progress_percentage: int, progress_message: str, started_at: datetime | None = None, completed_at: datetime | None = None, cancelled_at: datetime | None = None, error_message: str | None = None, error_type: str | None = None, error_traceback: str | None = None, files_scanned: int | None = None, files_indexed: int | None = None, chunks_created: int | None = None, repository_id: UUID | None = None, ) -> None: """Update job status in PostgreSQL (transactional).""" async with get_session(project_id=project_id) as db: # Build dynamic UPDATE query # Execute and commit pass ``` **Risks**: - SQL injection if not using parameterized queries (use text() with bindings) - Transaction might deadlock under high concurrency --- #### T007: Implement _update_job_progress and _check_cancellation [Implementation] **Dependencies**: T006 (uses _update_job_status) **Estimated Time**: 30 minutes **Parallelization**: Cannot parallelize (depends on T006) **User Story**: Infrastructure **Description**: Implement the lightweight progress update and cancellation check utilities. **Deliverables**: - [ ] Add _update_job_progress to background_indexing.py - [ ] Add _check_cancellation to background_indexing.py - [ ] Progress function delegates to _update_job_status - [ ] Cancellation function queries status column only - [ ] Add logging for cancellation detection **Acceptance Criteria**: - [ ] _update_job_progress updates percentage and message - [ ] _update_job_progress accepts optional counters (files_scanned, etc.) - [ ] _check_cancellation returns True if status='cancelled' - [ ] _check_cancellation returns False for all other statuses - [ ] _check_cancellation handles missing job_id gracefully - [ ] All functions pass mypy --strict **Constitutional Principles**: - Principle IV: Performance (lightweight queries) - Principle VIII: Type Safety (complete type hints) - Principle V: Production Quality (error handling) **Technical Notes**: - Reference architecture doc lines 610-664 - _update_job_progress is a convenience wrapper - _check_cancellation does simple SELECT query - No transaction needed for read-only cancellation check **Function Signatures**: ```python async def _update_job_progress( job_id: UUID, project_id: str, percentage: int, message: str, **kwargs, # files_scanned, files_indexed, chunks_created ) -> None: """Update job progress (lightweight version).""" await _update_job_status( job_id=job_id, project_id=project_id, status=IndexingJobStatus.RUNNING, progress_percentage=percentage, progress_message=message, **kwargs ) async def _check_cancellation(job_id: UUID, project_id: str) -> bool: """Check if job has been cancelled via database query.""" async with get_session(project_id=project_id) as db: result = await db.execute( text("SELECT status FROM indexing_jobs WHERE id = :job_id"), {"job_id": job_id} ) row = result.fetchone() if row is None: return False return row[0] == IndexingJobStatus.CANCELLED.value ``` **Risks**: - Frequent cancellation checks might impact performance (acceptable tradeoff) - Missing job_id during cancellation check needs graceful handling --- ### Phase 4: Background Worker **Checkpoint**: Asyncio background worker executes indexing and updates database ✅ --- #### T008-test: Write integration test for background worker [Testing] **Dependencies**: T007 (utilities exist) **Estimated Time**: 1 hour **Parallelization**: Cannot parallelize (requires database) **User Story**: Infrastructure **Description**: Write integration test for the complete background worker lifecycle: start → run → complete/fail. **Deliverables**: - [ ] Create test file: `tests/integration/test_background_worker.py` - [ ] Test successful indexing workflow - [ ] Test worker failure handling - [ ] Test cancellation during execution - [ ] Test progress updates written to database - [ ] Test cleanup on completion **Acceptance Criteria**: - [ ] Test initially fails (worker doesn't exist) - [ ] Test creates job, starts worker, waits for completion - [ ] Test verifies status transitions: pending → running → completed - [ ] Test verifies progress updates increment - [ ] Test verifies final counters (files_indexed, chunks_created) - [ ] Test verifies error_message populated on failure - [ ] Test verifies cleanup (no hanging connections) **Constitutional Principles**: - Principle VII: TDD (integration test before worker) - Principle V: Production Quality (lifecycle testing) - Principle IV: Performance (validate 60s target) **Technical Notes**: - Use small test repository (4-5 files) for fast execution - Test with asyncio.wait_for() for timeout safety - Verify worker task doesn't leak resources - Reference architecture doc lines 330-496 **Test Structure**: ```python @pytest.mark.asyncio async def test_background_worker_success(test_repo_path, test_db): """Test successful background indexing workflow.""" # Create job record async with get_session(project_id="test") as db: result = await db.execute( text("INSERT INTO indexing_jobs (...) VALUES (...) RETURNING id") ) job_id = result.fetchone()[0] await db.commit() # Start worker worker_task = asyncio.create_task( _background_indexing_worker( job_id=job_id, repo_path=str(test_repo_path), repo_name="test-repo", project_id="test", force_reindex=False, ) ) # Wait for completion (with timeout) await asyncio.wait_for(worker_task, timeout=30) # Verify final status async with get_session(project_id="test") as db: result = await db.execute( text("SELECT status, files_indexed FROM indexing_jobs WHERE id = :job_id"), {"job_id": job_id} ) row = result.fetchone() assert row[0] == "completed" assert row[1] > 0 ``` **Risks**: - Test might timeout if repository too large - Worker might not clean up properly on failure --- #### T008: Implement _background_indexing_worker function [Implementation] **Dependencies**: T008-test, T007 (utilities exist) **Estimated Time**: 1 hour **Parallelization**: Cannot parallelize (critical path) **User Story**: Infrastructure **Description**: Implement the background worker function that executes the indexing workflow and updates PostgreSQL with progress. **Deliverables**: - [ ] Add _background_indexing_worker to background_indexing.py - [ ] Implement status transition: pending → running - [ ] Define progress callback that updates database - [ ] Call index_repository with progress callback - [ ] Handle successful completion - [ ] Handle asyncio.CancelledError (user cancellation) - [ ] Handle generic exceptions (failures) - [ ] Ensure cleanup in finally block **Acceptance Criteria**: - [ ] All tests from T008-test pass - [ ] Worker transitions status: pending → running → completed/failed/cancelled - [ ] Progress callback invoked at milestones - [ ] Progress callback detects cancellation requests - [ ] Errors captured with full traceback - [ ] Worker completes even if index_repository fails - [ ] No resource leaks (connections closed) **Constitutional Principles**: - Principle V: Production Quality (error handling, cleanup) - Principle VIII: Type Safety (mypy --strict) - Principle IV: Performance (non-blocking execution) **Technical Notes**: - Reference architecture doc lines 330-496 - Use try/except/finally for cleanup - Progress callback: async def progress_callback(message: str, percentage: int, **kwargs) - Catch asyncio.CancelledError separately from Exception - Log all state transitions **Worker Structure**: ```python async def _background_indexing_worker( job_id: UUID, repo_path: str, repo_name: str, project_id: str, force_reindex: bool, ) -> None: """Background worker that executes indexing and updates PostgreSQL.""" try: # Update to running await _update_job_status( job_id=job_id, project_id=project_id, status=IndexingJobStatus.RUNNING, progress_percentage=0, progress_message="Starting indexing...", started_at=datetime.now(timezone.utc), ) # Define progress callback async def progress_callback(message: str, percentage: int, **kwargs) -> None: # Check cancellation if await _check_cancellation(job_id, project_id): raise asyncio.CancelledError("Job cancelled by user") # Update progress await _update_job_progress(job_id, project_id, percentage, message, **kwargs) # Run indexing async with get_session(project_id=project_id) as db: result = await index_repository( repo_path=Path(repo_path), name=repo_name, db=db, project_id=project_id, force_reindex=force_reindex, progress_callback=progress_callback, ) # Update to completed await _update_job_status( job_id=job_id, project_id=project_id, status=IndexingJobStatus.COMPLETED, progress_percentage=100, progress_message=f"Completed: {result.files_indexed} files", completed_at=datetime.now(timezone.utc), files_indexed=result.files_indexed, chunks_created=result.chunks_created, repository_id=result.repository_id, ) except asyncio.CancelledError: # User cancellation await _update_job_status( job_id=job_id, project_id=project_id, status=IndexingJobStatus.CANCELLED, progress_percentage=0, progress_message="Job cancelled by user", cancelled_at=datetime.now(timezone.utc), ) except Exception as e: # Failure import traceback await _update_job_status( job_id=job_id, project_id=project_id, status=IndexingJobStatus.FAILED, progress_percentage=0, progress_message=f"Error: {str(e)}", completed_at=datetime.now(timezone.utc), error_message=str(e), error_type=type(e).__name__, error_traceback=traceback.format_exc(), ) finally: # Cleanup logger.debug(f"Background worker cleanup for job {job_id}") ``` **Risks**: - Worker might hang if index_repository blocks - Cancellation detection might not be frequent enough (acceptable: checked per progress update) --- ### Phase 5: MCP Tools - Core (User Story 1) **Checkpoint**: FastMCP tools for start/status operations implemented ✅ **Delivers**: [US1] Core background indexing functionality (MVP) --- #### T009-test: Write contract tests for MCP tools [Testing] **Dependencies**: T002 (schema exists) **Estimated Time**: 45 minutes **Parallelization**: [P] Can run parallel with other tests **User Story**: [US1] Core background indexing **Description**: Write MCP contract tests for the two core background indexing tools: start_indexing_background, get_indexing_status. **Deliverables**: - [ ] Create test file: `tests/contract/test_background_indexing_tools.py` - [ ] Test start_indexing_background returns job_id - [ ] Test get_indexing_status returns progress - [ ] Validate MCP response schemas - [ ] Test error cases (invalid job_id, missing repo_path) **Acceptance Criteria**: - [ ] Tests initially fail (tools don't exist) - [ ] Test validates start returns dict with job_id key - [ ] Test validates status returns progress_percentage (0-100) - [ ] Test validates error responses for invalid inputs - [ ] Test validates MCP schema compliance (Pydantic models) **Constitutional Principles**: - Principle VII: TDD (contract tests before tools) - Principle III: Protocol Compliance (MCP response validation) - Principle VIII: Type Safety (Pydantic response models) **Technical Notes**: - Use @pytest.mark.contract for test categorization - Validate response schemas match architecture doc - Test both success and error paths - Reference architecture doc lines 197-838 **Test Structure**: ```python @pytest.mark.contract @pytest.mark.asyncio async def test_start_indexing_background_returns_job_id(test_repo_path): """Test start_indexing_background returns job_id immediately.""" result = await start_indexing_background( repo_path=str(test_repo_path), project_id="test", ) assert "job_id" in result assert "status" in result assert result["status"] == "pending" assert "project_id" in result assert "database_name" in result @pytest.mark.contract @pytest.mark.asyncio async def test_get_indexing_status_returns_progress(): """Test get_indexing_status returns progress information.""" # Create test job job_id = "test-job-id" result = await get_indexing_status(job_id=job_id, project_id="test") assert "job_id" in result assert "status" in result assert "progress_percentage" in result assert 0 <= result["progress_percentage"] <= 100 ``` **Risks**: - Contract tests might be flaky if database state not cleaned up - MCP schema changes might break tests --- #### T009: Implement start_indexing_background MCP tool [Implementation] **Dependencies**: T009-test, T008 (worker exists) **Estimated Time**: 45 minutes **Parallelization**: Cannot parallelize (depends on T009-test) **User Story**: [US1] Core background indexing **Description**: Implement the start_indexing_background FastMCP tool that creates a job record and launches the background worker. **Deliverables**: - [ ] Add @mcp.tool() decorator to start_indexing_background - [ ] Resolve project_id via 4-tier chain - [ ] Validate input with IndexingJobCreate - [ ] Insert job record with status=pending - [ ] Launch worker via asyncio.create_task() - [ ] Return job_id and metadata immediately - [ ] Add comprehensive docstring **Acceptance Criteria**: - [ ] Tool registered in FastMCP server - [ ] Path validation rejects relative paths - [ ] Path validation rejects traversal attempts - [ ] Job record created in database with status=pending - [ ] Worker task started asynchronously - [ ] Function returns immediately (non-blocking) - [ ] Response includes job_id, status, project_id, database_name **Constitutional Principles**: - Principle XI: FastMCP Foundation (@mcp.tool() decorator) - Principle V: Production Quality (input validation, error handling) - Principle VIII: Type Safety (Pydantic models) - Principle IV: Performance (non-blocking) **Technical Notes**: - Reference architecture doc lines 197-326 - Use IndexingJobCreate for validation (catches path traversal) - Use resolve_project_id for 4-tier resolution - asyncio.create_task() returns immediately, task runs in background - Log job creation with context **Function Structure**: ```python @mcp.tool() async def start_indexing_background( repo_path: str, project_id: str | None = None, force_reindex: bool = False, ctx: Context | None = None, ) -> dict[str, Any]: """Start repository indexing in the background (non-blocking). Returns immediately with job_id. Use get_indexing_status(job_id) to poll progress. Args: repo_path: Absolute path to repository (validated) project_id: Optional project identifier (resolved via 4-tier chain) force_reindex: If True, re-index even if already indexed ctx: FastMCP Context for session-based project resolution Returns: { "job_id": "uuid", "status": "pending", "message": "Indexing job started", "project_id": "resolved_project_id", "database_name": "cb_proj_xxx" } Raises: ValueError: If repo_path validation fails (path traversal, not absolute) """ # Resolve project_id resolved_id, database_name = await resolve_project_id( explicit_id=project_id, ctx=ctx, ) # Validate input (includes path traversal check) job_input = IndexingJobCreate( repo_path=repo_path, repo_name=Path(repo_path).name, project_id=resolved_id, force_reindex=force_reindex, ) # Create job record async with get_session(project_id=resolved_id, ctx=ctx) as db: result = await db.execute( text(""" INSERT INTO indexing_jobs (repo_path, repo_name, project_id, force_reindex, status, progress_message) VALUES (:repo_path, :repo_name, :project_id, :force_reindex, 'pending', 'Job queued') RETURNING id """), { "repo_path": job_input.repo_path, "repo_name": job_input.repo_name, "project_id": job_input.project_id, "force_reindex": job_input.force_reindex, } ) row = result.fetchone() job_id = row[0] await db.commit() # Start background worker asyncio.create_task( _background_indexing_worker( job_id=job_id, repo_path=job_input.repo_path, repo_name=job_input.repo_name, project_id=resolved_id, force_reindex=job_input.force_reindex, ) ) logger.info(f"Indexing job created: {job_id}") if ctx: await ctx.info(f"Indexing started in background. Job ID: {job_id}") return { "job_id": str(job_id), "status": "pending", "message": "Indexing job started", "project_id": resolved_id, "database_name": database_name, } ``` **Risks**: - Worker task might fail silently if not properly monitored - Path validation might have edge cases --- #### T010: Implement get_indexing_status MCP tool [Implementation] **Dependencies**: T009 **Estimated Time**: 45 minutes **Parallelization**: Cannot parallelize (depends on T009) **User Story**: [US1] Core background indexing **Description**: Implement the status query MCP tool. **Deliverables**: - [ ] Add @mcp.tool() to get_indexing_status - [ ] Query indexing_jobs table by job_id - [ ] Return comprehensive progress information - [ ] Handle invalid job_id errors - [ ] Add docstring **Acceptance Criteria**: - [ ] get_indexing_status returns all progress fields - [ ] get_indexing_status handles missing job_id gracefully - [ ] Tool resolves project_id correctly - [ ] Tool passes mypy --strict **Constitutional Principles**: - Principle XI: FastMCP Foundation (@mcp.tool() decorator) - Principle VIII: Type Safety (complete type hints) - Principle IV: Performance (simple SELECT query) **Technical Notes**: - Reference architecture doc lines 667-838 - get_indexing_status is read-only (no transaction) - Use IndexingJobProgress model for response **Function Signature**: ```python @mcp.tool() async def get_indexing_status( job_id: str, project_id: str | None = None, ctx: Context | None = None, ) -> dict[str, Any]: """Get status of a background indexing job. Returns: { "job_id": "uuid", "status": "running", "progress_percentage": 45, "progress_message": "Generating embeddings...", "files_scanned": 10585, "files_indexed": 5000, "chunks_created": 45000, ... } """ resolved_id, _ = await resolve_project_id(explicit_id=project_id, ctx=ctx) async with get_session(project_id=resolved_id, ctx=ctx) as db: result = await db.execute( text("SELECT * FROM indexing_jobs WHERE id = :job_id"), {"job_id": job_id} ) row = result.fetchone() if row is None: raise ValueError(f"Job not found: {job_id}") # Convert to dict and return return {...} ``` **Risks**: - Status query might be stale if cached --- ### Phase 6: Enhanced Monitoring (User Story 2) 🆕 **Checkpoint**: Enhanced monitoring with list, ETA, and detailed progress ✅ **Delivers**: [US2] Detailed progress monitoring capabilities --- #### T017-test: Write tests for list_background_jobs filtering [Testing] **Dependencies**: T002 (schema exists) **Estimated Time**: 30 minutes **Parallelization**: [P] Can run parallel with other tests **User Story**: [US2] Enhanced monitoring **Description**: Write tests for the list_background_jobs MCP tool with filtering and pagination. **Deliverables**: - [ ] Create test file: `tests/contract/test_list_background_jobs.py` - [ ] Test list without filters (all jobs) - [ ] Test filter by status - [ ] Test filter by project_id - [ ] Test pagination (limit/offset) - [ ] Test ordering by created_at DESC **Acceptance Criteria**: - [ ] Tests initially fail (tool doesn't exist) - [ ] Test validates response schema (list of jobs) - [ ] Test validates filtering works correctly - [ ] Test validates pagination limits enforced (max 100) - [ ] Test validates ordering is newest first **Constitutional Principles**: - Principle VII: TDD (tests before implementation) - Principle IV: Performance (query <200ms target) **Technical Notes**: - Use test fixtures to create multiple jobs - Test boundary conditions (empty results, max results) - Validate response times meet <200ms target **Test Structure**: ```python @pytest.mark.contract @pytest.mark.asyncio async def test_list_background_jobs_filter_by_status(): """Test filtering jobs by status.""" # Create jobs with different statuses running_job = await create_test_job(status="running") completed_job = await create_test_job(status="completed") result = await list_background_jobs(status="running", project_id="test") assert len(result["jobs"]) == 1 assert result["jobs"][0]["id"] == str(running_job) assert result["total_count"] == 1 ``` **Risks**: - Test might be flaky without proper isolation - Query performance might degrade with many jobs --- #### T017: Implement list_background_jobs MCP tool [Implementation] **Dependencies**: T017-test, T010 (status tool exists) **Estimated Time**: 45 minutes **Parallelization**: Cannot parallelize (depends on T017-test) **User Story**: [US2] Enhanced monitoring **Description**: Implement the list_background_jobs FastMCP tool that queries jobs with filtering and pagination. **Deliverables**: - [ ] Add @mcp.tool() decorator to list_background_jobs - [ ] Implement status filter - [ ] Implement project_id filter - [ ] Implement pagination (limit, offset) - [ ] Order by created_at DESC (newest first) - [ ] Enforce max limit of 100 results - [ ] Add comprehensive docstring **Acceptance Criteria**: - [ ] Tool registered in FastMCP server - [ ] Filters work correctly (status, project_id) - [ ] Pagination enforced (default limit=10, max=100) - [ ] Results ordered newest first - [ ] Query completes in <200ms p95 - [ ] Response includes total_count for pagination **Constitutional Principles**: - Principle XI: FastMCP Foundation (@mcp.tool() decorator) - Principle IV: Performance (<200ms query target) - Principle VIII: Type Safety (Pydantic models) **Technical Notes**: - Use database indexes created in T001 for performance - Dynamic WHERE clause based on filters - Reference spec.md FR-007 for requirements **Function Structure**: ```python @mcp.tool() async def list_background_jobs( project_id: str | None = None, status: str | None = None, limit: int = 10, offset: int = 0, ctx: Context | None = None, ) -> dict[str, Any]: """List background indexing jobs with filtering and pagination. Args: project_id: Filter by project (resolved via 4-tier chain if None) status: Filter by status (pending, running, completed, failed, cancelled) limit: Max results to return (default 10, max 100) offset: Number of results to skip for pagination Returns: { "jobs": [...], # List of job objects "total_count": 42, "limit": 10, "offset": 0 } """ resolved_id, _ = await resolve_project_id(explicit_id=project_id, ctx=ctx) # Enforce max limit limit = min(limit, 100) # Build dynamic WHERE clause where_clauses = ["project_id = :project_id"] params = {"project_id": resolved_id, "limit": limit, "offset": offset} if status: where_clauses.append("status = :status") params["status"] = status where_sql = " AND ".join(where_clauses) async with get_session(project_id=resolved_id, ctx=ctx) as db: # Get total count count_result = await db.execute( text(f"SELECT COUNT(*) FROM indexing_jobs WHERE {where_sql}"), params ) total_count = count_result.scalar() # Get paginated results result = await db.execute( text(f""" SELECT * FROM indexing_jobs WHERE {where_sql} ORDER BY created_at DESC LIMIT :limit OFFSET :offset """), params ) rows = result.fetchall() jobs = [dict(row._mapping) for row in rows] return { "jobs": jobs, "total_count": total_count, "limit": limit, "offset": offset, } ``` **Risks**: - Query performance might degrade without proper indexes - Large result sets might cause memory issues --- #### T018-test: Write tests for ETA calculation [Testing] **Dependencies**: T010 (status tool exists) **Estimated Time**: 20 minutes **Parallelization**: [P] Can run parallel with other tests **User Story**: [US2] Enhanced monitoring **Description**: Write tests for estimated completion time calculation logic. **Deliverables**: - [ ] Add tests to `tests/unit/test_background_indexing.py` - [ ] Test ETA calculation based on progress rate - [ ] Test ETA when no progress yet (return None) - [ ] Test ETA accuracy validation (within 20% of actual) **Acceptance Criteria**: - [ ] Tests validate ETA formula correctness - [ ] Tests handle edge cases (0% progress, 100% progress) - [ ] Tests validate ETA is reasonable (not negative, not infinite) **Constitutional Principles**: - Principle VII: TDD (tests before implementation) - Principle IV: Performance (ETA calculation <10ms) **Technical Notes**: - ETA = (elapsed_time / progress_percentage) * (100 - progress_percentage) - Return None if progress_percentage == 0 - Return 0 if progress_percentage == 100 **Test Structure**: ```python def test_calculate_eta_mid_progress(): """Test ETA calculation at 50% progress.""" elapsed_seconds = 60 # 1 minute elapsed progress_percentage = 50 eta = calculate_eta(elapsed_seconds, progress_percentage) assert eta == 60 # 1 more minute remaining def test_calculate_eta_no_progress(): """Test ETA returns None when no progress yet.""" eta = calculate_eta(elapsed_seconds=30, progress_percentage=0) assert eta is None ``` **Risks**: - ETA might be inaccurate if processing rate varies significantly --- #### T018: Implement estimated completion time in get_indexing_status [Implementation] **Dependencies**: T018-test, T010 (status tool exists) **Estimated Time**: 30 minutes **Parallelization**: Cannot parallelize (depends on T018-test) **User Story**: [US2] Enhanced monitoring **Description**: Add estimated completion time calculation to get_indexing_status response. **Deliverables**: - [ ] Add calculate_eta() helper function - [ ] Integrate ETA calculation into get_indexing_status - [ ] Add estimated_completion_at field to response - [ ] Add estimated_seconds_remaining field to response **Acceptance Criteria**: - [ ] All tests from T018-test pass - [ ] ETA calculated based on elapsed time and progress - [ ] ETA returns None if progress_percentage == 0 - [ ] ETA returns 0 if progress_percentage == 100 - [ ] ETA accuracy within 20% of actual (validated in integration tests) **Constitutional Principles**: - Principle IV: Performance (calculation overhead <10ms) - Principle VIII: Type Safety (proper typing for optional fields) **Technical Notes**: - Reference spec.md FR-005 for requirements - Use datetime arithmetic for ETA calculation - Handle edge cases gracefully **Implementation**: ```python def calculate_eta(elapsed_seconds: float, progress_percentage: int) -> int | None: """Calculate estimated seconds remaining. Returns None if no progress yet, 0 if complete. """ if progress_percentage == 0: return None if progress_percentage == 100: return 0 rate = elapsed_seconds / progress_percentage remaining = rate * (100 - progress_percentage) return int(remaining) # In get_indexing_status: if row["started_at"] and row["status"] == "running": elapsed = (datetime.now(timezone.utc) - row["started_at"]).total_seconds() eta_seconds = calculate_eta(elapsed, row["progress_percentage"]) if eta_seconds is not None: result["estimated_seconds_remaining"] = eta_seconds result["estimated_completion_at"] = ( datetime.now(timezone.utc) + timedelta(seconds=eta_seconds) ).isoformat() ``` **Risks**: - ETA might fluctuate if processing rate varies - Users might rely too heavily on estimates --- #### T019-test: Write tests for phase-specific progress messages [Testing] **Dependencies**: T008 (worker exists) **Estimated Time**: 15 minutes **Parallelization**: [P] Can run parallel with other tests **User Story**: [US2] Enhanced monitoring **Description**: Write tests to validate phase-specific progress messages are meaningful and accurate. **Deliverables**: - [ ] Add tests to `tests/unit/test_indexer.py` - [ ] Test scanning phase message: "Scanning repository..." - [ ] Test chunking phase message: "Chunking files: X/Y" - [ ] Test embedding phase message: "Generating embeddings: X/Y" - [ ] Test writing phase message: "Writing to database..." **Acceptance Criteria**: - [ ] Tests validate message format and content - [ ] Tests validate file counts in messages are accurate - [ ] Tests validate messages update as work progresses **Constitutional Principles**: - Principle VII: TDD (tests before implementation) - Principle VIII: Type Safety (message string validation) **Technical Notes**: - Use mock progress callback to capture messages - Validate messages at different progress percentages **Test Structure**: ```python @pytest.mark.asyncio async def test_progress_messages_during_chunking(): """Test chunking phase shows file counts.""" messages = [] async def mock_callback(message: str, percentage: int, **kwargs): messages.append(message) await index_repository( repo_path=test_repo, name="test", db=test_db, project_id="test", progress_callback=mock_callback, ) # Find chunking messages chunking_msgs = [m for m in messages if "Chunking files" in m] assert len(chunking_msgs) > 0 assert "100/100" in chunking_msgs[-1] # Final count ``` **Risks**: - Message formats might change during implementation - Tests might be brittle if tied to exact wording --- #### T019: Add phase-specific progress messages to indexer [Implementation] **Dependencies**: T019-test, T011 (progress callback exists) **Estimated Time**: 30 minutes **Parallelization**: Cannot parallelize (depends on T019-test) **User Story**: [US2] Enhanced monitoring **Description**: Enhance progress callback invocations in indexer to include phase-specific messages with work counts. **Deliverables**: - [ ] Update scanning phase message: "Scanning repository..." - [ ] Update chunking phase message: "Chunking files: {current}/{total}" - [ ] Update embedding phase message: "Generating embeddings: {current}/{total}" - [ ] Update writing phase message: "Writing to database..." **Acceptance Criteria**: - [ ] All tests from T019-test pass - [ ] Messages clearly indicate current phase - [ ] Messages include progress counts where applicable - [ ] Messages are user-friendly and actionable **Constitutional Principles**: - Principle V: Production Quality (clear user communication) - Principle VIII: Type Safety (string formatting safety) **Technical Notes**: - Reference spec.md User Story 2 for requirements - Use f-strings for message formatting - Keep messages concise (<80 chars) **Implementation**: ```python # In index_repository(): # After scanning (10%) if progress_callback: await progress_callback( f"Scanning repository...", 10, files_scanned=len(all_files) ) # During chunking (10-50%) for i, batch in enumerate(batches): percentage = 10 + int(40 * (i / total_batches)) if progress_callback: await progress_callback( f"Chunking files: {i * batch_size}/{total_files}", percentage, files_indexed=i * batch_size ) # During embedding (50-90%) for i, batch in enumerate(embedding_batches): percentage = 50 + int(40 * (i / total_batches)) if progress_callback: await progress_callback( f"Generating embeddings: {i * batch_size}/{total_chunks}", percentage, chunks_created=i * batch_size ) # Before database writes (95%) if progress_callback: await progress_callback( "Writing to database...", 95, files_indexed=total_files, chunks_created=total_chunks ) ``` **Risks**: - Message generation might add overhead to indexing - Users might expect more granular updates --- #### T020-test: Write tests for phase percentage calculation [Testing] **Dependencies**: T008 (worker exists) **Estimated Time**: 15 minutes **Parallelization**: [P] Can run parallel with other tests **User Story**: [US2] Enhanced monitoring **Description**: Write tests to validate phase percentage calculations are accurate and monotonically increasing. **Deliverables**: - [ ] Add tests to `tests/unit/test_indexer.py` - [ ] Test scanning phase: 0-10% - [ ] Test chunking phase: 10-50% - [ ] Test embedding phase: 50-90% - [ ] Test writing phase: 90-100% - [ ] Test percentages are monotonically increasing **Acceptance Criteria**: - [ ] Tests validate percentage ranges for each phase - [ ] Tests validate percentages never decrease - [ ] Tests validate final percentage is exactly 100 **Constitutional Principles**: - Principle VII: TDD (tests before implementation) - Principle IV: Performance (calculation overhead minimal) **Technical Notes**: - Capture all progress callbacks - Validate percentage progression - Check for gaps or overlaps in ranges **Test Structure**: ```python @pytest.mark.asyncio async def test_progress_percentages_monotonic(): """Test progress percentages always increase.""" percentages = [] async def mock_callback(message: str, percentage: int, **kwargs): percentages.append(percentage) await index_repository( repo_path=test_repo, name="test", db=test_db, project_id="test", progress_callback=mock_callback, ) # Validate monotonic increase for i in range(1, len(percentages)): assert percentages[i] >= percentages[i-1] # Validate final is 100 assert percentages[-1] == 100 ``` **Risks**: - Rounding errors might cause percentage jumps - Phase boundaries might overlap --- #### T020: Implement phase percentage calculation in indexer [Implementation] **Dependencies**: T020-test, T019 (phase messages exist) **Estimated Time**: 30 minutes **Parallelization**: Cannot parallelize (depends on T020-test) **User Story**: [US2] Enhanced monitoring **Description**: Implement accurate phase percentage calculations in indexer with smooth progression. **Deliverables**: - [ ] Calculate scanning phase percentages: 0-10% - [ ] Calculate chunking phase percentages: 10-50% - [ ] Calculate embedding phase percentages: 50-90% - [ ] Calculate writing phase percentages: 90-100% - [ ] Ensure monotonic progression **Acceptance Criteria**: - [ ] All tests from T020-test pass - [ ] Percentages are accurate for each phase - [ ] Percentages never decrease - [ ] Final percentage is exactly 100 **Constitutional Principles**: - Principle IV: Performance (calculation overhead <1ms) - Principle VIII: Type Safety (integer percentage validation) **Technical Notes**: - Use integer arithmetic to avoid floating point errors - Reference spec.md for phase ranges - Validate percentages at phase boundaries **Implementation**: ```python # Phase percentage ranges PHASE_SCANNING = (0, 10) PHASE_CHUNKING = (10, 50) PHASE_EMBEDDING = (50, 90) PHASE_WRITING = (90, 100) def calculate_phase_percentage( phase: tuple[int, int], current: int, total: int ) -> int: """Calculate percentage within a phase range.""" start, end = phase range_size = end - start if total == 0: return start progress = min(current / total, 1.0) # Cap at 100% return start + int(range_size * progress) # In index_repository(): # Scanning (0-10%) percentage = calculate_phase_percentage(PHASE_SCANNING, len(scanned_files), total_files) # Chunking (10-50%) percentage = calculate_phase_percentage(PHASE_CHUNKING, files_chunked, total_files) # Embedding (50-90%) percentage = calculate_phase_percentage(PHASE_EMBEDDING, chunks_embedded, total_chunks) # Writing (90-100%) percentage = calculate_phase_percentage(PHASE_WRITING, rows_written, total_rows) ``` **Risks**: - Rounding might cause small gaps in progression - Phase boundaries might not align perfectly --- #### T021: Verify database indexes for list queries [Validation] **Dependencies**: T001 (migration created), T017 (list tool exists) **Estimated Time**: 15 minutes **Parallelization**: [P] Can run parallel with other tasks **User Story**: [US2] Enhanced monitoring **Description**: Verify that database indexes created in T001 migration support efficient list queries. **Deliverables**: - [ ] Run EXPLAIN ANALYZE on list queries - [ ] Verify indexes are used (not sequential scans) - [ ] Validate query times <200ms p95 - [ ] Document index usage patterns **Acceptance Criteria**: - [ ] idx_indexing_jobs_status used for status filter - [ ] idx_indexing_jobs_project_id used for project filter - [ ] idx_indexing_jobs_created_at used for ordering - [ ] Query plan shows Index Scan, not Seq Scan - [ ] Query times consistently <200ms **Constitutional Principles**: - Principle IV: Performance (<200ms query target) - Principle V: Production Quality (validation before deployment) **Technical Notes**: - Use PostgreSQL EXPLAIN ANALYZE - Test with realistic data volumes (100+ jobs) - Verify partial index effectiveness **Validation Script**: ```sql -- Test status filter performance EXPLAIN ANALYZE SELECT * FROM indexing_jobs WHERE status = 'running' ORDER BY created_at DESC LIMIT 10; -- Expected: Index Scan using idx_indexing_jobs_status -- Expected execution time: <50ms -- Test project filter performance EXPLAIN ANALYZE SELECT * FROM indexing_jobs WHERE project_id = 'test-project' ORDER BY created_at DESC LIMIT 10; -- Expected: Index Scan using idx_indexing_jobs_project_id -- Expected execution time: <50ms ``` **Risks**: - Indexes might not be effective with small datasets - Query planner might choose sequential scan for small tables --- ### Phase 7: Indexer Enhancement (Parallel with Phase 5-6) **Checkpoint**: Existing indexer service supports progress callbacks ✅ **Serves**: [US1] + [US2] (dual-story support) --- #### T011-test: Write tests for progress callback integration [Testing] **Dependencies**: T002 (schema exists) **Estimated Time**: 30 minutes **Parallelization**: [P] Can run parallel with Phase 5 tests **User Story**: [US1] + [US2] (dual label) **Description**: Write tests for progress callback integration in the index_repository service. **Deliverables**: - [ ] Add tests to `tests/unit/test_indexer.py` - [ ] Test callback invoked at scanning milestone - [ ] Test callback invoked during chunking - [ ] Test callback invoked during embedding - [ ] Test callback receives correct percentages - [ ] Test callback receives counters (files_scanned, etc.) **Acceptance Criteria**: - [ ] Tests initially fail (callback parameter doesn't exist) - [ ] Test callback invoked at 10% (scanning complete) - [ ] Test callback invoked during 10-50% (chunking) - [ ] Test callback invoked during 50-90% (embedding) - [ ] Test callback invoked at 95% (writing) - [ ] Test callback receives files_scanned count - [ ] Test indexer works without callback (optional parameter) **Constitutional Principles**: - Principle VII: TDD (tests before implementation) - Principle IV: Performance (validate callback overhead minimal) **Technical Notes**: - Use mock callback to capture invocations - Validate percentage progression: 10% → 50% → 90% → 100% - Ensure callback is optional (backward compatibility) - Reference architecture doc lines 842-910 **Test Structure**: ```python @pytest.mark.asyncio async def test_progress_callback_invoked(test_repo_path, test_db): """Test progress callback invoked at key milestones.""" invocations = [] async def mock_callback(message: str, percentage: int, **kwargs): invocations.append({ "message": message, "percentage": percentage, "kwargs": kwargs, }) result = await index_repository( repo_path=test_repo_path, name="test-repo", db=test_db, project_id="test", progress_callback=mock_callback, ) # Verify callback invocations assert len(invocations) >= 3 # At least 3 milestones assert invocations[0]["percentage"] == 10 # Scanning assert "files_scanned" in invocations[0]["kwargs"] ``` **Risks**: - Callback overhead might impact performance (measure with benchmarks) - Callback exceptions might crash indexing (needs error handling) --- #### T011: Add progress_callback parameter to index_repository [Implementation] **Dependencies**: T011-test **Estimated Time**: 1 hour **Parallelization**: [P] Can run parallel with Phase 5-6 work **User Story**: [US1] + [US2] (dual label) **Description**: Enhance the existing index_repository function to accept an optional progress_callback parameter and invoke it at key milestones. **Deliverables**: - [ ] Add progress_callback parameter to index_repository signature - [ ] Invoke callback after scanning (10%, files_scanned) - [ ] Invoke callback during chunking (10-50%, files_indexed) - [ ] Invoke callback during embedding (50-90%, chunks_created) - [ ] Invoke callback before database writes (95%) - [ ] Handle callback exceptions gracefully - [ ] Maintain backward compatibility (callback optional) **Acceptance Criteria**: - [ ] All tests from T011-test pass - [ ] Callback invoked at 4+ milestones - [ ] Progress percentages match specification (10, 50, 90, 95) - [ ] Counters passed to callback are accurate - [ ] Callback exceptions logged but don't crash indexing - [ ] Indexing works without callback (None default) **Constitutional Principles**: - Principle IV: Performance (callback overhead <5%) - Principle V: Production Quality (error handling for callback failures) - Principle VIII: Type Safety (proper callback signature typing) **Technical Notes**: - Reference architecture doc lines 842-910 - Callback signature: `Callable[[str, int], Awaitable[None]]` - Calculate percentages based on batch progress - Use try/except around callback invocations - Log callback exceptions but continue indexing **Code Changes**: ```python # In src/services/indexer.py from typing import Callable, Awaitable async def index_repository( repo_path: Path, name: str, db: AsyncSession, project_id: str, force_reindex: bool = False, progress_callback: Callable[[str, int], Awaitable[None]] | None = None, # NEW ) -> IndexResult: """Index repository with progress callbacks. Args: ... progress_callback: Optional async function(message: str, percentage: int, **kwargs) Called at key milestones with progress updates """ # After scanning if progress_callback: try: await progress_callback( "Scanning repository...", 10, files_scanned=len(all_files) ) except Exception as e: logger.error(f"Progress callback failed: {e}") # During chunking for i, batch in enumerate(batches): percentage = 10 + int(40 * (i / total_batches)) if progress_callback: try: await progress_callback( f"Chunking files: {i * batch_size}/{total_files}", percentage, files_indexed=i * batch_size ) except Exception as e: logger.error(f"Progress callback failed: {e}") # During embedding (50-90%) # Before database writes (95%) # Continue existing logic... ``` **Risks**: - Callback overhead might slow down indexing (measure with benchmarks) - Callback exceptions might lose progress tracking --- ### Phase 8: Integration Testing **Checkpoint**: End-to-end workflows validated ✅ **Independent Test**: Start indexing on 15K file repository, verify user receives immediate confirmation (<1s), poll status until completion, verify job completes successfully with accurate summary. --- #### T012: Write integration test for complete workflow [Testing] **Dependencies**: T021 (all Phase 6 tasks complete) **Estimated Time**: 1 hour **Parallelization**: Cannot parallelize (requires full system) **User Story**: [US1] + [US2] (validates both stories) **Description**: Write comprehensive integration test for the complete background indexing workflow: start → poll → complete. **Deliverables**: - [ ] Create test file: `tests/integration/test_background_indexing_workflow.py` - [ ] Test small repository (4 files, fast completion) - [ ] Test polling loop with timeout - [ ] Verify final status and counters - [ ] Test force_reindex flag behavior - [ ] Test error handling (invalid repo path) **Acceptance Criteria**: - [ ] Test creates job via start_indexing_background - [ ] Test polls get_indexing_status every 2 seconds - [ ] Test verifies status transitions: pending → running → completed - [ ] Test verifies files_indexed > 0 and chunks_created > 0 - [ ] Test completes within 30 seconds (small repo) - [ ] Test validates progress_percentage reaches 100 - [ ] Test verifies repository_id populated in final status - [ ] Test validates ETA calculation (US2 requirement) **Constitutional Principles**: - Principle VII: TDD (comprehensive integration testing) - Principle IV: Performance (validate 60s target for 10K files) - Principle V: Production Quality (end-to-end validation) **Technical Notes**: - Use pytest fixtures for test repository (4-5 Python files) - Poll with asyncio.sleep(2) between status checks - Use asyncio.wait_for() for timeout safety (30s max) - Clean up job records after test **Test Structure**: ```python @pytest.mark.integration @pytest.mark.asyncio async def test_background_indexing_complete_workflow(test_repo_path): """Test complete background indexing workflow.""" # Start job result = await start_indexing_background( repo_path=str(test_repo_path), project_id="test", ) job_id = result["job_id"] # Poll until completion max_attempts = 15 # 30 seconds max (2s polling interval) for attempt in range(max_attempts): status = await get_indexing_status(job_id=job_id, project_id="test") if status["status"] in ["completed", "failed"]: break await asyncio.sleep(2) else: pytest.fail("Job did not complete within 30 seconds") # Verify completion assert status["status"] == "completed" assert status["progress_percentage"] == 100 assert status["files_indexed"] > 0 assert status["chunks_created"] > 0 assert status["completed_at"] is not None # US2: Verify ETA was provided during progress # (This validates T018 integration) ``` **Risks**: - Test might be flaky if system under load - Timeout might be too short for slower systems --- #### T013: Write integration test for cancellation [Testing] **Dependencies**: T021 **Estimated Time**: 30 minutes **Parallelization**: [P] Can run parallel with T012 **User Story**: [US3] Cancellation (preview for Phase 5) **Description**: Write integration test for job cancellation during execution. **Deliverables**: - [ ] Add test to `tests/integration/test_background_indexing_workflow.py` - [ ] Start job with large repository (slow indexing) - [ ] Cancel job mid-execution - [ ] Verify status transitions to cancelled - [ ] Verify worker aborts cleanly **Acceptance Criteria**: - [ ] Test starts indexing job - [ ] Test waits 1 second for job to start - [ ] Test calls cancel_indexing_background (tool implemented later in US3) - [ ] Test verifies status becomes 'cancelled' within 10 seconds - [ ] Test verifies cancelled_at timestamp set - [ ] Test verifies no hanging connections or tasks **Constitutional Principles**: - Principle VII: TDD (edge case testing) - Principle V: Production Quality (clean cancellation) **Technical Notes**: - Use larger test repository to ensure job runs long enough - Verify worker detects cancellation within 5 seconds (progress update interval) - Clean up test database after cancellation **Test Structure**: ```python @pytest.mark.integration @pytest.mark.asyncio async def test_background_indexing_cancellation(): """Test job cancellation during execution.""" # Start job result = await start_indexing_background( repo_path=str(large_test_repo), project_id="test", ) job_id = result["job_id"] # Wait for job to start await asyncio.sleep(1) # Cancel job (Note: cancel tool implemented in Phase 5 - US3) cancel_result = await cancel_indexing_background( job_id=job_id, project_id="test", ) assert cancel_result["status"] == "cancelled" # Wait for worker to detect cancellation await asyncio.sleep(6) # Verify cancellation status = await get_indexing_status(job_id=job_id, project_id="test") assert status["status"] == "cancelled" assert status["cancelled_at"] is not None ``` **Risks**: - Cancellation might not propagate quickly enough - Worker might not clean up resources properly **Note**: This test will initially be skipped until US3 cancellation tool is implemented. --- #### T014: Write integration test for list queries [Testing] **Dependencies**: T017 (list tool exists) **Estimated Time**: 30 minutes **Parallelization**: [P] Can run parallel with T012 **User Story**: [US2] Enhanced monitoring **Description**: Write integration test for list_background_jobs filtering and pagination. **Deliverables**: - [ ] Add test to `tests/integration/test_background_indexing_workflow.py` - [ ] Create multiple jobs with different statuses - [ ] Test status filtering - [ ] Test pagination - [ ] Verify ordering by created_at DESC **Acceptance Criteria**: - [ ] Test creates 5 jobs with various statuses - [ ] Test filters by status='running' returns correct subset - [ ] Test pagination with limit=2 returns 2 results - [ ] Test offset=2 skips first 2 results - [ ] Test ordering shows newest jobs first - [ ] Test query completes in <200ms **Constitutional Principles**: - Principle VII: TDD (comprehensive testing) - Principle IV: Performance (<200ms target) **Technical Notes**: - Use fixtures to create test jobs - Validate response structure matches spec - Measure query execution time **Test Structure**: ```python @pytest.mark.integration @pytest.mark.asyncio async def test_list_background_jobs_filtering(): """Test list with filtering and pagination.""" # Create test jobs jobs = [] for i in range(5): result = await start_indexing_background( repo_path=str(test_repo), project_id="test", ) jobs.append(result["job_id"]) # Update some jobs to 'running' # ... (update logic) # Test status filter result = await list_background_jobs( project_id="test", status="running", ) assert result["total_count"] >= 0 assert all(job["status"] == "running" for job in result["jobs"]) # Test pagination result = await list_background_jobs( project_id="test", limit=2, offset=0, ) assert len(result["jobs"]) <= 2 assert result["limit"] == 2 assert result["offset"] == 0 ``` **Risks**: - Test might be affected by concurrent test runs - Query performance validation requires realistic data volume --- #### T015: Write test for state persistence across restarts [Testing] **Dependencies**: T021 **Estimated Time**: 30 minutes **Parallelization**: [P] Can run parallel with other tests **User Story**: [US4] Resumption (preview for future phase) **Description**: Write test that validates job state persists across server restarts (simulated connection pool closure). **Deliverables**: - [ ] Add test to `tests/integration/test_background_indexing_workflow.py` - [ ] Create job record - [ ] Simulate restart (close connection pools) - [ ] Verify job still queryable after restart - [ ] Verify job status preserved **Acceptance Criteria**: - [ ] Test creates job in database - [ ] Test closes all connection pools - [ ] Test reinitializes connection pools - [ ] Test queries job status successfully - [ ] Job status matches pre-restart state **Constitutional Principles**: - Principle V: Production Quality (state persistence) - Principle II: Local-First (PostgreSQL reliability) **Technical Notes**: - Don't start actual worker (just test persistence) - Close pools via provisioning.close_all_pools() - Reinitialize via get_session() **Test Structure**: ```python @pytest.mark.integration @pytest.mark.asyncio async def test_job_state_persistence(): """Test job state persists across server restarts.""" # Create job async with get_session(project_id="test") as db: result = await db.execute( text("INSERT INTO indexing_jobs (...) VALUES (...) RETURNING id") ) job_id = result.fetchone()[0] await db.commit() # Simulate restart (close pools) await close_all_pools() # Reinitialize and query status = await get_indexing_status(job_id=str(job_id), project_id="test") assert status["job_id"] == str(job_id) assert status["status"] == "pending" ``` **Risks**: - Test might not accurately simulate restart - Connection pool state might not fully reset --- ### Phase 9: Documentation & Polish **Checkpoint**: User documentation complete, feature ready for production ✅ --- #### T016: Update .env.example with background indexing config [Documentation] **Dependencies**: All previous phases **Estimated Time**: 15 minutes **Parallelization**: [P] Can run parallel with other docs **User Story**: All (cross-cutting) **Description**: Add configuration examples for background indexing to .env.example. **Deliverables**: - [ ] Add section "Background Indexing" to .env.example - [ ] Document MAX_CONCURRENT_INDEXING_JOBS - [ ] Document INDEXING_JOB_TIMEOUT_SECONDS - [ ] Document JOB_STATUS_RETENTION_DAYS - [ ] Document MAX_REPO_SIZE_GB - [ ] Document PROGRESS_UPDATE_INTERVAL_SECONDS - [ ] Add comments explaining each option **Acceptance Criteria**: - [ ] All configuration options documented - [ ] Defaults match architecture spec - [ ] Comments explain purpose and impact - [ ] Examples show reasonable production values **Constitutional Principles**: - Principle V: Production Quality (clear configuration) - Principle VI: Specification-First (documentation matches spec) **Technical Notes**: - Reference architecture doc lines 1020-1036 - Include both development and production examples - Explain performance implications of settings **Configuration Template**: ```bash # Background Indexing Configuration # Maximum number of concurrent background indexing jobs MAX_CONCURRENT_INDEXING_JOBS=2 # Timeout per indexing job (seconds) INDEXING_JOB_TIMEOUT_SECONDS=3600 # Cleanup completed jobs older than N days JOB_STATUS_RETENTION_DAYS=7 # Maximum repository size (GB) - reject larger repos MAX_REPO_SIZE_GB=10 # Progress update frequency (seconds) - don't set too low PROGRESS_UPDATE_INTERVAL_SECONDS=2 # Cancellation check interval (seconds) CANCELLATION_CHECK_INTERVAL_SECONDS=5 ``` **Risks**: - None (documentation only) --- #### T022: Update CLAUDE.md with background indexing usage [Documentation] **Dependencies**: T016 **Estimated Time**: 30 minutes **Parallelization**: [P] Can run parallel with T023 **User Story**: All (cross-cutting) **Description**: Add usage patterns and examples for background indexing to CLAUDE.md project instructions. **Deliverables**: - [ ] Add "Background Indexing" section to CLAUDE.md - [ ] Document when to use background vs. foreground indexing - [ ] Provide usage examples (start, poll, cancel patterns) - [ ] Document troubleshooting tips - [ ] Link to architecture doc **Acceptance Criteria**: - [ ] Section explains background indexing purpose - [ ] 3 usage patterns documented (start-and-poll, fire-and-forget, monitoring with list) - [ ] Examples use correct MCP tool syntax - [ ] Troubleshooting covers common issues (timeout, cancellation, failure) - [ ] Links to architecture doc for details **Constitutional Principles**: - Principle VI: Specification-First (documentation from spec) - Principle XI: FastMCP Foundation (tool usage examples) **Technical Notes**: - Reference architecture doc lines 912-965 for usage patterns - Include code examples in Python - Explain when background indexing is required (10K+ files) **Documentation Template**: ```markdown ## Background Indexing Large repositories (10,000+ files) require 5-10 minutes to index, which exceeds typical MCP request timeouts. Use background indexing for these repositories. ### When to Use Background Indexing - **Foreground (default)**: Repositories with <5,000 files (completes in <60 seconds) - **Background**: Repositories with 10,000+ files (requires 5-10 minutes) ### Usage Pattern 1: Start and Poll ```python # Start indexing result = await start_indexing_background( repo_path="/path/to/large/repo", ctx=ctx ) job_id = result["job_id"] # Poll for completion while True: status = await get_indexing_status(job_id=job_id, ctx=ctx) if status["status"] in ["completed", "failed", "cancelled"]: break print(f"Progress: {status['progress_percentage']}% - {status['progress_message']}") print(f"ETA: {status.get('estimated_seconds_remaining', 'calculating...')}s") await asyncio.sleep(2) if status["status"] == "completed": print(f"✅ Indexed {status['files_indexed']} files!") ``` ### Usage Pattern 2: List and Monitor Multiple Jobs ```python # Start multiple jobs job_ids = [] for repo in large_repos: result = await start_indexing_background(repo_path=repo, ctx=ctx) job_ids.append(result["job_id"]) # Monitor all running jobs result = await list_background_jobs(status="running", ctx=ctx) for job in result["jobs"]: print(f"{job['repo_name']}: {job['progress_percentage']}%") ``` ### Troubleshooting **Job stuck in 'running' status**: - Check server logs: `/tmp/codebase-mcp.log` - Verify Ollama is running: `curl http://localhost:11434/api/tags` - Cancel and restart: `await cancel_indexing_background(job_id=...)` **Job fails immediately**: - Verify repository path exists and is accessible - Check path is absolute (not relative) - Ensure sufficient disk space **Slow progress or stalled**: - Check ETA in status response - Verify embedding service responsiveness - Review progress_message for current phase For detailed architecture, see: `docs/architecture/background-indexing.md` ``` **Risks**: - None (documentation only) --- #### T023: Create integration examples in docs/ [Documentation] **Dependencies**: T015 (tests exist as examples) **Estimated Time**: 15 minutes **Parallelization**: [P] Can run parallel with T022 **User Story**: All (cross-cutting) **Description**: Create standalone example scripts demonstrating background indexing integration. **Deliverables**: - [ ] Create file: `docs/examples/background_indexing_example.py` - [ ] Example 1: Basic start-and-poll workflow - [ ] Example 2: List and monitor multiple jobs (US2) - [ ] Example 3: Fire-and-forget workflow - [ ] Add README.md explaining examples **Acceptance Criteria**: - [ ] Examples are runnable scripts - [ ] Each example has clear comments - [ ] README explains how to run examples - [ ] Examples cover main use cases **Constitutional Principles**: - Principle VI: Specification-First (examples from spec) - Principle XI: FastMCP Foundation (correct tool usage) **Technical Notes**: - Copy patterns from integration tests - Simplify for readability - Include error handling **Risks**: - None (documentation only) --- #### T024: Validate quickstart.md scenarios [Testing] **Dependencies**: T022 (documentation exists) **Estimated Time**: 30 minutes **Parallelization**: Cannot parallelize (final validation) **User Story**: All (cross-cutting) **Description**: Manually validate all quickstart scenarios documented in CLAUDE.md work correctly. **Deliverables**: - [ ] Run start-and-poll example - [ ] Run list-and-monitor example (US2) - [ ] Run fire-and-forget example - [ ] Verify all examples complete without errors - [ ] Document any issues found **Acceptance Criteria**: - [ ] All 3 usage patterns execute successfully - [ ] Progress updates visible in logs - [ ] ETA calculation visible in status (US2) - [ ] No errors in `/tmp/codebase-mcp.log` - [ ] Database state clean after completion **Constitutional Principles**: - Principle VII: TDD (validation before release) - Principle V: Production Quality (end-to-end verification) **Technical Notes**: - Use real MCP server (not mocks) - Test with realistic repository (1,000-5,000 files) - Verify against architecture specification **Risks**: - Quickstart might reveal issues not caught by unit/integration tests --- ## Critical Path Analysis Tasks that cannot be delayed (blocking other work): 1. **T001** → T002 → T003: Database schema (blocks all other work) 2. **T004-test** → T004: Models (blocks utilities) 3. **T006-test** → T006 → T007: Database utilities (blocks worker) 4. **T008-test** → T008: Background worker (blocks tools) 5. **T009-test** → T009 → T010: MCP tools US1 (blocks US2) 6. **T017-T021**: US2 enhancements (blocks integration tests) 7. **T012-T015**: Integration testing (blocks release) 8. **T024**: Quickstart validation (final gate) **Total Critical Path Time**: ~11 hours (updated with US2) **Critical Path Sequence**: ``` T001 → T002 → T003 → T004 → T006 → T007 → T008 → T009 → T010 → T017 → T018 → T019 → T020 → T021 → T012 → T024 ``` --- ## Parallel Work Opportunities These tasks can run simultaneously (marked with [P]): ### Phase 2 Parallelization - T004-test (model tests) [P] - T005-test (path validation tests) [P] ### Phase 6 Parallelization (US2) - T017-test (list tests) [P] - T018-test (ETA tests) [P] - T019-test (message tests) [P] - T020-test (percentage tests) [P] ### Phase 7 Parallelization - T011-test (progress callback tests) [P] - while T008-T010 in progress - T011 (indexer enhancement) [P] - parallel with T017-T021 ### Phase 8 Parallelization - T012 (workflow test) - T013 (cancellation test) [P] - requires US3 cancel tool - T014 (list test) [P] - T015 (persistence test) [P] ### Phase 9 Parallelization - T016 (env docs) [P] - T022 (CLAUDE.md) [P] - T023 (examples) [P] **Savings from Parallelization**: ~2 hours --- ## Risk Register ### High Priority Risks 1. **Path Traversal Validation Edge Cases** - Risk: Validation might not catch all attack vectors - Mitigation: Comprehensive test coverage (T005-test), security review - Owner: T004, T005 2. **Worker Resource Leaks** - Risk: Background workers might not clean up connections - Mitigation: finally blocks in worker, connection pool monitoring - Owner: T008 3. **Race Conditions in Job State** - Risk: Concurrent updates might cause inconsistency - Mitigation: PostgreSQL transaction isolation, row-level locking - Owner: T006, T007 4. **Migration Rollback Safety** - Risk: Downgrade might leave database in bad state - Mitigation: Test downgrade path thoroughly (T003) - Owner: T001, T003 ### Medium Priority Risks 5. **Progress Callback Performance** - Risk: Callback overhead might slow indexing >5% - Mitigation: Benchmark before/after (T011), optimize if needed - Owner: T011 6. **Cancellation Latency** - Risk: Worker might not detect cancellation quickly enough - Mitigation: Check cancellation on every progress update (~2s interval) - Owner: T008, T013 7. **Test Flakiness** - Risk: Integration tests might be timing-dependent - Mitigation: Generous timeouts, proper test isolation - Owner: T012-T015 8. **ETA Calculation Accuracy (US2)** - Risk: ETA might be wildly inaccurate with variable processing rates - Mitigation: Use conservative estimates, include disclaimer in docs - Owner: T018 ### Low Priority Risks 9. **Documentation Drift** - Risk: Docs might not match implementation - Mitigation: Generate docs from code, validate quickstart (T024) - Owner: T016, T022, T023 10. **List Query Performance Degradation (US2)** - Risk: Query performance might degrade with many jobs - Mitigation: Database indexes (T001), query validation (T021) - Owner: T017, T021 --- ## Success Metrics **Feature Complete When:** 1. ✅ All acceptance criteria pass (updated count with US2) 2. ✅ All tests pass (unit, integration, contract) 3. ✅ mypy --strict passes with no errors 4. ✅ Migration applies cleanly (upgrade and downgrade) 5. ✅ All MCP tools registered and functional (start, status, list) 6. ✅ Background indexing completes within 60s for 10K files 7. ✅ Job state persists across server restarts 8. ✅ Cancellation works within 5 seconds (US3 - future phase) 9. ✅ No connection pool leaks or resource leaks 10. ✅ Documentation complete (CLAUDE.md, examples, .env) 11. ✅ Quickstart validation passes (T024) 12. ✅ No CRITICAL constitutional violations 13. ✅ ETA calculation within 20% accuracy (US2) 14. ✅ List queries complete in <200ms p95 (US2) **Performance Targets:** - Indexing 10K files: <60 seconds (maintained from baseline) - Job creation: <1s (non-blocking) - Status query: <100ms (simple SELECT) - List query: <200ms p95 (US2) - Cancellation: <5s propagation time - Progress updates: <10ms per update - ETA calculation: <10ms overhead (US2) **Quality Targets:** - Test coverage: >80% (existing project standard) - Type safety: 100% mypy --strict compliance - Security: 0 path traversal vulnerabilities - Reliability: State persists across restarts **User Story Coverage:** - [US1] Core background indexing: COMPLETE (Phase 5) - [US2] Enhanced monitoring: COMPLETE (Phase 6) - [US3] Cancellation: DEFERRED (future implementation) - [US4] Resumption: DEFERRED (future implementation) --- ## Implementation Notes ### Git Micro-Commit Strategy After completing each task (or logical subtask), make an atomic commit: ```bash # After T001 (migration) git add migrations/versions/008_add_indexing_jobs.py git commit -m "feat(indexing): add indexing_jobs table migration - Add indexing_jobs table with 18 columns - Add 4 performance indexes with WHERE clauses (including US2 list query indexes) - Add CHECK constraints for status and progress range - Implement upgrade and downgrade functions Constitutional Compliance: Principle V (Production Quality), Principle VIII (Type Safety) User Stories: Infrastructure (blocks US1, US2, US3, US4) References: docs/architecture/background-indexing.md lines 73-129" # After T004 (models) git add src/models/indexing_job.py src/models/__init__.py git commit -m "feat(models): add IndexingJob Pydantic models - Add IndexingJobStatus enum (5 states) - Add IndexingJobCreate with path traversal validation - Add IndexingJobProgress (immutable output model) - Validator prevents relative paths and ../ sequences Constitutional Compliance: Principle VIII (Type Safety), Principle V (Production Quality) User Stories: Infrastructure (blocks all stories) References: docs/architecture/background-indexing.md lines 136-193" # After T017 (US2 list tool) git add src/mcp/tools/background_indexing.py tests/contract/test_list_background_jobs.py git commit -m "feat(indexing): implement list_background_jobs MCP tool [US2] - Add filtering by status and project_id - Add pagination with max limit 100 - Order by created_at DESC (newest first) - Use database indexes for <200ms performance Constitutional Compliance: Principle XI (FastMCP), Principle IV (Performance) User Stories: [US2] Enhanced monitoring References: specs/014-add-background-indexing/spec.md FR-007" # Continue pattern for each task... ``` ### Testing Strategy **Test Execution Order:** 1. Unit tests first (T004-test, T005-test, T006-test, T011-test, T018-test, T019-test, T020-test) 2. Integration tests second (T008-test, T012-T015) 3. Contract tests third (T009-test, T017-test) 4. Manual validation last (T021, T024) **Test Isolation:** - Each test creates its own test project database - Clean up job records after each test - Use pytest fixtures for database setup/teardown ### Constitutional Compliance Checkpoints **After Phase 1** (Database Schema): - ✅ Principle V: Indexes for performance (including US2 list query indexes) - ✅ Principle VIII: Explicit column types **After Phase 2** (Models): - ✅ Principle VIII: Pydantic models, mypy --strict - ✅ Principle V: Path traversal prevention **After Phase 4** (Worker): - ✅ Principle V: Comprehensive error handling - ✅ Principle IV: Non-blocking execution **After Phase 5** (US1 Tools): - ✅ Principle XI: FastMCP decorators - ✅ Principle III: MCP protocol compliance - ✅ Delivers: [US1] Core background indexing **After Phase 6** (US2 Tools): - ✅ Principle IV: Performance (<200ms list queries, <10ms ETA calculation) - ✅ Principle V: Clear progress communication - ✅ Delivers: [US2] Enhanced monitoring **After Phase 8** (Testing): - ✅ Principle VII: TDD approach throughout - ✅ Principle IV: Performance validation --- ## MVP Scope (Recommended Initial Delivery) **Phase 1-5**: Core Background Indexing (US1) - Database schema (T001-T003) - Models (T004-T005) - Utilities (T006-T007) - Worker (T008) - Core tools (T009-T010) - Basic integration test (T012) **Total MVP Time**: ~8 hours (Phase 1-5 + T012) **Delivers**: - Background indexing for large repositories - Job status tracking - Real-time progress updates - State persistence **Deferred to Next Iteration**: - Enhanced monitoring with list, ETA, detailed messages (US2) - Phase 6 - Cancellation (US3) - Future implementation - Resumption (US4) - Future implementation --- ## Post-MVP Enhancement: User Story 2 **Phase 6**: Enhanced Monitoring (US2) - List tool (T017) - ETA calculation (T018) - Phase messages (T019) - Phase percentages (T020) - Index validation (T021) - Integration tests (T014) **Total Enhancement Time**: ~2 hours **Delivers**: - List all background jobs with filtering - Accurate ETA estimates - Phase-specific progress messages - Smooth percentage progression --- ## Post-Implementation Tasks (Future Enhancements) **Not required for US1/US2 release, but documented for future work:** 1. **User Story 3: Cancellation** (Priority P3) - Implement cancel_indexing_background MCP tool - Add cancellation detection in worker - Graceful shutdown within 5 seconds - Estimated: 3 hours 2. **User Story 4: Resumption** (Priority P2) - Implement checkpoint save/load logic - Auto-resume on server startup - <1% duplicate work guarantee - Estimated: 4 hours 3. **Job History Cleanup** (Optional) - Implement background task to clean up old completed jobs - Reference: architecture doc lines 1390-1419 4. **LISTEN/NOTIFY** (Optional) - Add real-time notifications via PostgreSQL LISTEN/NOTIFY - For advanced clients that want push-based updates - Reference: architecture doc lines 1351-1386 5. **Resource Limits Enforcement** (Nice-to-have) - Enforce MAX_CONCURRENT_INDEXING_JOBS - Add repository size validation - Job timeout via asyncio.wait_for() --- ## Completion Summary **When MVP (US1) complete:** - ✅ PostgreSQL-native background indexing fully functional - ✅ Job state persists across server restarts - ✅ Path traversal attacks prevented - ✅ All constitutional principles maintained - ✅ Comprehensive test coverage (unit, integration, contract) - ✅ Documentation complete and validated - ✅ Production-ready reliability and security **When US2 enhancement complete:** - ✅ All MVP features PLUS: - ✅ List all background jobs with filtering and pagination - ✅ Accurate ETA calculation (within 20%) - ✅ Phase-specific progress messages - ✅ Smooth percentage progression (0-100%) - ✅ <200ms list query performance **Ready for merge to main branch when:** - All tasks for target scope marked [X] (MVP = T001-T012, Full = T001-T024) - All acceptance criteria validated - Quickstart scenarios pass (T024) - No CRITICAL constitutional violations - Code review approved - CI/CD pipeline green --- **End of Merged Tasks Document** Total Tasks: 25 (T001-T024, renumbered from original 19+5) Total Estimated Time: 12.5-14.5 hours Critical Path: 11 hours Parallelization Savings: 2 hours **MVP Scope**: T001-T012 (8 hours, US1 only) **Full Scope**: T001-T024 (12.5-14.5 hours, US1 + US2) **Next Step**: Create feature branch and start with T001 (migration) ```bash git checkout -b 014-add-background-indexing git push -u origin 014-add-background-indexing ```

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Ravenight13/codebase-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

background-indexing-tasks-merged.md•94.7 KiB