Markdown RAG Documentation

Overview Schema Related Servers Score Discussions

22-index-snapshot-system.md•28.2 KiB

# 22. Index Snapshot System **Version:** 1.0.0 **Date:** 2026-01-23 **Status:** Implemented --- ## Executive Summary **Purpose:** Document the version-based index snapshot system that enables zero-downtime index synchronization between worker and main processes in multiprocess mode. **Scope:** This specification covers the snapshot publishing, versioning, atomic updates, cleanup, and hot-reload mechanisms that allow the worker process to persist indices and the main process to load them without blocking operations. **Decision:** Use file-based versioned snapshots with atomic version file updates rather than shared memory or direct IPC serialization. --- ## 1. Goals & Non-Goals ### Goals 1. **Zero-Downtime Updates:** Main process can reload indices without blocking queries 2. **Fault Isolation:** Worker crashes don't corrupt main process indices 3. **Atomic Consistency:** Snapshots are either complete or not visible (no partial states) 4. **Version Tracking:** Explicit version numbering prevents stale reads 5. **Storage Efficiency:** Automatic cleanup of old snapshots to prevent unbounded growth 6. **Portability:** Works across all platforms (Linux, macOS, Windows) ### Non-Goals 1. **Instant Propagation:** Minor latency (100ms polling) is acceptable 2. **Deduplication:** No content-addressable storage (each snapshot is independent) 3. **Compression:** Raw indices stored as-is (optimization not required) 4. **Point-in-Time Rollback:** No multi-version history (only keeps last N) 5. **Concurrent Writers:** Only worker process publishes snapshots --- ## 2. Architecture Overview ### 2.1. System Context The snapshot system operates within the broader multiprocess architecture: ``` ┌──────────────────────────────────────────────────────────┐ │ MAIN PROCESS │ │ ┌────────────────────┐ ┌────────────────────┐ │ │ │ ReadOnlyContext │◀─────▶│ IndexSyncReceiver │ │ │ │ - vector │ │ - polls version │ │ │ │ - keyword │ │ - hot-reloads │ │ │ │ - graph │ └────────────────────┘ │ │ └────────────────────┘ │ │ │ ▲ │ │ └─────────┼───────────────────────────────┼──────────────┘ │ │ │ reload indices │ poll │ │ ┌─────────┼───────────────────────────────┼──────────────┐ │ │ ▼ │ │ ┌──────┴──────────────┐ ┌────────────────────┐ │ │ │ Mutable Indices │ │ IndexSyncPublisher │ │ │ │ - vector │──────▶│ - creates v{N} │ │ │ │ - keyword │persist│ - writes version │ │ │ │ - graph │ │ - cleans old │ │ │ └─────────────────────┘ └────────────────────┘ │ │ WORKER PROCESS │ └────────────────────────────────────────────────────────┘ ┌────────────────────┐ │ FILESYSTEM │ │ snapshots/ │ │ ├── version.bin │ │ ├── v1/ │ │ │ ├── vector/ │ │ │ ├── keyword/ │ │ │ └── graph/ │ │ └── v2/ │ │ ├── vector/ │ │ ├── keyword/ │ │ └── graph/ │ └────────────────────┘ ``` ### 2.2. Core Components | Component | Location | Process | Responsibility | |-----------|----------|---------|---------------| | `IndexSyncPublisher` | `src/ipc/index_sync.py` | Worker | Create versioned snapshots, manage cleanup | | `IndexSyncReceiver` | `src/ipc/index_sync.py` | Main | Poll for updates, trigger hot-reload | | `ReadOnlyContext` | `src/reader/context.py` | Main | Hold read-only indices, coordinate reload | | `WorkerState` | `src/worker/process.py` | Worker | Trigger publishes when indices change | --- ## 3. Snapshot Structure ### 3.1. Directory Layout ``` {index_path}/snapshots/ ├── version.bin # Binary: uint32 current version ├── v1/ # Snapshot version 1 │ ├── vector/ │ │ ├── docstore.json │ │ ├── index_store.json │ │ ├── faiss_index.bin │ │ ├── doc_id_mapping.json │ │ └── chunk_id_mapping.json │ ├── keyword/ │ │ ├── MAIN_*.toc │ │ ├── MAIN_*.seg │ │ └── _MAIN_*.pos │ ├── graph/ │ │ ├── graph.json │ │ └── communities.json │ └── memory/ # If memory enabled │ ├── vector/ │ ├── keyword/ │ └── graph/ └── v2/ # Snapshot version 2 ├── vector/ ├── keyword/ ├── graph/ └── memory/ ``` ### 3.2. Version File Format **File:** `version.bin` **Format:** Binary little-endian uint32 ```python import struct # Writing version_data = struct.pack("<I", version_number) # 4 bytes version_file.write_bytes(version_data) # Reading data = version_file.read_bytes() version = struct.unpack("<I", data[:4])[0] ``` **Properties:** - **Atomic writes:** Always written to `.tmp` and renamed (atomic on POSIX) - **Immutable:** Once written, never modified (replaced entirely) - **Minimal:** 4 bytes total, fast to read/parse ### 3.3. Snapshot Naming Convention - **Format:** `v{N}` where N is monotonically increasing integer - **Starting point:** v1 (not v0) for clarity - **Gaps allowed:** Cleaned-up versions leave gaps (e.g., v1, v5, v7) - **No padding:** No zero-padding (not `v001`), simplifies parsing --- ## 4. Publishing Flow ### 4.1. Worker Process Publishing **Trigger conditions:** 1. Initial indexing complete (`_initialize_worker`) 2. File watcher detects changes and indexing finishes (`_check_for_index_updates`) 3. Manual reindex command (if implemented) **Publishing sequence:** ```python def _publish_snapshot(state: WorkerState) -> int: """Publish current indices to new snapshot version.""" def persist_callback(snapshot_dir: Path) -> None: # Worker's IndexManager persists all indices state.index_manager.persist_to(snapshot_dir) # Also persist memory indices if enabled if state.memory_manager: memory_dir = snapshot_dir / "memory" memory_dir.mkdir() state.memory_manager.persist_to(memory_dir) # IndexSyncPublisher handles versioning + atomicity version = state.sync_publisher.publish(persist_callback) state.last_index_time = time.time() return version ``` ### 4.2. IndexSyncPublisher.publish() **Implementation:** [src/ipc/index_sync.py#L28-L52](../../src/ipc/index_sync.py) **Algorithm:** 1. **Increment version:** `new_version = self._version + 1` 2. **Create snapshot directory:** `snapshots/v{new_version}/` 3. **Call persist callback:** Indices write themselves to snapshot directory 4. **Atomic version update:** - Write to `version.bin.tmp` - Rename to `version.bin` (atomic on POSIX, near-atomic on Windows) 5. **Update internal state:** `self._version = new_version` 6. **Cleanup old snapshots:** Keep only last 2 versions 7. **Return version number:** For notification **Error handling:** - On failure, delete incomplete snapshot directory - Version file not updated (main process sees old version) - Re-raise exception to caller ### 4.3. Atomicity Guarantees **Problem:** Prevent main process from loading partial snapshots **Solution layers:** 1. **Snapshot directory isolation:** `v{N}/` created fully before version file updated 2. **Atomic version file write:** `rename()` syscall guarantees atomicity 3. **Read ordering:** Main process checks version → then loads snapshot 4. **No in-place updates:** Never modify existing snapshot directories **Race condition prevention:** | Scenario | Outcome | |----------|---------| | Worker writes snapshot, crashes before version update | Main process never sees new version (old data served) | | Worker updates version, crashes during cleanup | Old snapshots retained (space leak but no corruption) | | Main process reads version mid-update | Reads either old or new (atomic rename) | | Multiple workers (unsupported) | Only one worker allowed (enforced by config) | --- ## 5. Loading Flow ### 5.1. Main Process Initialization **Entry point:** `ReadOnlyContext.create()` [src/reader/context.py#L39](../../src/reader/context.py) **Sequence:** 1. **Create empty indices:** VectorIndex, KeywordIndex, GraphStore 2. **Find latest snapshot:** Read `version.bin`, resolve `v{N}/` path 3. **Load indices (if snapshot exists):** ```python if latest_snapshot: await asyncio.to_thread( _load_indices_from_snapshot, vector, keyword, graph, latest_snapshot ) ``` 4. **Create IndexSyncReceiver:** Register reload callback 5. **Start background watcher:** Poll for updates ### 5.2. IndexSyncReceiver.reload_if_needed() **Implementation:** [src/ipc/index_sync.py#L105-L122](../../src/ipc/index_sync.py) **Algorithm:** 1. **Read published version:** Parse `version.bin` 2. **Compare versions:** `published_version > self._current_version`? 3. **Validate snapshot:** Check `v{published_version}/` exists 4. **Call reload callback:** ```python self._reload_callback(snapshot_dir, published_version) # Callback reloads VectorIndex, KeywordIndex, GraphStore ``` 5. **Update tracking:** `self._current_version = published_version` 6. **Log success:** Informational log for observability ### 5.3. Hot-Reload Mechanism **Callback implementation:** [src/reader/context.py#L100-L102](../../src/reader/context.py) ```python def reload_callback(snapshot_dir: Path, version: int) -> None: _load_indices_from_snapshot(vector, keyword, graph, snapshot_dir) logger.info("Reloaded indices from snapshot v%d", version) ``` **Index replacement strategy:** | Index Type | Reload Method | Thread Safety | |-----------|---------------|---------------| | VectorIndex | `load_from()` | Lock-protected (`_index_lock`) | | KeywordIndex | `load_from()` | Whoosh reader reopened | | GraphStore | `load_from()` | NetworkX graph replaced atomically | **Query consistency:** - Queries in flight complete using old indices - New queries after reload use new indices - No mixed-version queries (all-or-nothing reload) --- ## 6. Event-Driven Index Sync ### 6.1. IndexSyncReceiver.watch() **Implementation:** [src/ipc/index_sync.py#L127-L157](../../src/ipc/index_sync.py) ```python async def watch(self) -> None: """Watch for index updates using filesystem events.""" from watchdog.events import FileSystemEventHandler from watchdog.observers import Observer event_queue: asyncio.Queue[None] = asyncio.Queue() loop = asyncio.get_running_loop() class VersionHandler(FileSystemEventHandler): def on_modified(self, event): if not event.is_directory and str(event.src_path).endswith("version.bin"): loop.call_soon_threadsafe(event_queue.put_nowait, None) observer = Observer() observer.schedule(VersionHandler(), str(self._snapshot_base), recursive=False) observer.start() try: while True: await event_queue.get() try: if self.check_for_update(): await asyncio.to_thread(self.reload_if_needed) except Exception: logger.exception("Error checking for index updates") finally: observer.stop() observer.join(timeout=1.0) ``` **Characteristics:** - **Event-driven:** Uses watchdog inotify (Linux) / FSEvents (macOS) / ReadDirectoryChangesW (Windows) - **Zero CPU when idle:** No polling loop, blocks on filesystem events - **Non-blocking:** Runs as asyncio task - **Error tolerance:** Exceptions logged, watching continues - **Graceful shutdown:** Task cancelled on `ReadOnlyContext.stop()`, observer cleaned up in finally block **Latency analysis:** | Event | Latency | Explanation | |-------|---------|-------------| | File change → Worker index | ~100ms | Debounced file watcher | | Index → Snapshot publish | ~50-500ms | Depends on index size | | Snapshot → Event delivery | <10ms | inotify/FSEvents instant notification | | Detect → Reload complete | ~50-200ms | Index loading | | **Total:** | **~200-800ms** | End-to-end propagation | --- ## 7. Snapshot Cleanup ### 7.1. IndexSyncPublisher._cleanup_old_snapshots() **Implementation:** [src/ipc/index_sync.py#L54-L72](../../src/ipc/index_sync.py) **Algorithm:** 1. **Enumerate snapshots:** Find all `v{N}/` directories 2. **Parse versions:** Extract integer from directory name 3. **Sort descending:** Newest versions first 4. **Keep top N:** Default `keep=2` (configurable) 5. **Delete remainder:** `shutil.rmtree()` on old directories **Retention policy:** | Configuration | Behavior | |---------------|----------| | `keep=1` | Only current version (aggressive cleanup) | | `keep=2` | Current + previous (default, allows rollback) | | `keep=3+` | Multiple historical versions (debugging/analysis) | **Safety:** - Cleanup runs **after** version file updated - Main process already notified of new version - Even if main hasn't reloaded yet, new version available ### 7.2. Storage Impact **Typical snapshot size:** - Small docs (<10 files): ~1-5 MB per snapshot - Medium docs (100 files): ~10-50 MB per snapshot - Large docs (1000+ files): ~100-500 MB per snapshot **With `keep=2` default:** - Storage overhead: 2x index size - Temporary spike: 3x during publish (old + new + version) --- ## 8. Thread Safety & Concurrency ### 8.1. Worker Process (Publisher) **Constraints:** - **Single writer:** Only worker process publishes - **Sequential publishes:** No concurrent `publish()` calls - **No index mutation during publish:** File watcher paused **Enforcement:** - Config validation: `worker.enabled` must be true/false (not multi-worker) - Command loop: Processes commands sequentially - Persist callback: Blocks until complete ### 8.2. Main Process (Receiver) **Constraints:** - **Single poller:** One `watch()` task per ReadOnlyContext - **Reload serialization:** `asyncio.to_thread()` queues reloads - **Index locking:** VectorIndex uses `_index_lock` for state updates **Query handling during reload:** ```python # Queries before reload query_1 → VectorIndex v5 → results_1 # Reload starts (background thread) reload_callback → load v6 into new index # Queries during reload (old index still accessible) query_2 → VectorIndex v5 → results_2 # Reload completes (atomic swap) vector._index = new_index_v6 # Queries after reload query_3 → VectorIndex v6 → results_3 ``` **No query blocking:** Reload happens in background thread, queries continue --- ## 9. Error Handling & Recovery ### 9.1. Corruption Detection **Symptoms:** - `version.bin` exists but unreadable - Snapshot directory missing/incomplete - Index files corrupted within snapshot **Publisher recovery:** ```python # IndexSyncPublisher._load_current_version() try: self._version = struct.unpack("<I", data[:4])[0] except (OSError, struct.error): logger.warning("Failed to load version file, starting from 0") self._version = 0 # Start fresh ``` **Receiver recovery:** ```python # IndexSyncReceiver.reload_if_needed() if not snapshot_dir.exists(): logger.warning("Snapshot directory for v%d not found", version) return False # Skip this reload, continue polling try: self._reload_callback(snapshot_dir, version) except Exception: logger.exception("Failed to reload index from snapshot v%d", version) return False # Keep current version, retry next poll ``` ### 9.2. Incomplete Snapshot Handling **Scenario:** Worker crashes mid-publish **Protection:** 1. Incomplete snapshot in `v{N}/` (orphaned directory) 2. `version.bin` still points to previous version 3. Main process never sees incomplete snapshot 4. Worker restart detects last good version, continues **Cleanup:** Orphaned directories cleaned up on next successful publish ### 9.3. Version File Corruption **Scenario:** `version.bin` corrupted/deleted **Impact:** - Publisher: Starts from v0, creates v1 (may overwrite old v1) - Receiver: Sees no version, treats as no snapshots available - Indices: Continue with last loaded state **Recovery:** Automatic on next successful publish --- ## 10. Performance Characteristics ### 10.1. Publish Performance **Measured on typical documentation corpus (100 files, 5MB total):** | Operation | Time | Explanation | |-----------|------|-------------| | Vector persist | ~100-200ms | FAISS index write | | Keyword persist | ~50-100ms | Whoosh segment write | | Graph persist | ~10-50ms | JSON serialization | | Version file write | ~1ms | Atomic 4-byte write | | **Total publish:** | **~160-350ms** | Per snapshot | **Scaling:** - Linear with document count (indexing time dominates) - Sublinear with document size (chunking limits growth) ### 10.2. Reload Performance **Measured on same corpus:** | Operation | Time | Explanation | |-----------|------|-------------| | Vector load | ~50-100ms | FAISS index mmap | | Keyword load | ~20-50ms | Whoosh reader open | | Graph load | ~5-20ms | JSON parsing | | **Total reload:** | **~75-170ms** | Per reload | **Impact on queries:** - Reload happens in background thread - Queries continue serving from old indices - No blocking observed in benchmarks ### 10.3. Polling Overhead **CPU usage:** <0.1% (file stat only, no disk read) **Latency:** 100ms poll interval = max 100ms detection delay **Memory:** Negligible (no buffering) --- ## 11. Configuration ### 11.1. Worker Configuration **File:** `pyproject.toml` or `config.toml` ```toml [tool.ragdocs.worker] enabled = true # Enable multiprocess mode health_check_interval = 1.0 # Worker health check (seconds) restart_on_failure = true # Auto-restart on crash ``` **Environment variables:** ```bash RAGDOCS_WORKER_ENABLED=true RAGDOCS_WORKER_HEALTH_CHECK_INTERVAL=1.0 ``` ### 11.2. Snapshot Configuration **Hardcoded defaults** (no user configuration yet): | Parameter | Value | Location | |-----------|-------|----------| | Snapshot base | `{index_path}/snapshots` | Lifecycle coordinator | | Poll interval | 100ms | `IndexSyncReceiver.watch()` | | Retention count | 2 | `_cleanup_old_snapshots(keep=2)` | | Version format | uint32 little-endian | `struct.pack("<I", ...)` | **Future configuration options:** - `snapshot_retention_count` - `snapshot_poll_interval_ms` - `snapshot_compression_enabled` --- ## 12. Observability ### 12.1. Logging **Publisher events:** ``` INFO - Published index snapshot v3 to /path/to/snapshots/v3 DEBUG - Cleaned up old snapshot: /path/to/snapshots/v1 WARNING - Failed to clean up snapshot /path/to/snapshots/v2: Permission denied ``` **Receiver events:** ``` INFO - Loading indices from snapshot: /path/to/snapshots/v3 INFO - Reloaded indices from snapshot v3 INFO - Index sync watcher started WARNING - Snapshot directory for v4 not found ERROR - Failed to reload index from snapshot v5 ``` ### 12.2. Metrics **Tracked by worker:** - `last_index_time`: Timestamp of last successful index - `sync_publisher.version`: Current published version **Tracked by main:** - `sync_receiver.current_version`: Currently loaded version - `is_ready()`: Whether any snapshot loaded **Exposed via:** - `HealthStatusResponse` (IPC command) - `/status` endpoint (REST API if enabled) ### 12.3. Debugging **Check current state:** ```bash # Check version file hexdump -C {index_path}/snapshots/version.bin # Output: 00 00 00 03 (version 3) # List snapshots ls -lh {index_path}/snapshots/ # v1/ v2/ v3/ version.bin # Check snapshot contents ls -lR {index_path}/snapshots/v3/ # vector/ keyword/ graph/ ``` **Common issues:** | Symptom | Cause | Resolution | |---------|-------|------------| | Version stuck at 0 | Worker not publishing | Check worker logs, file watcher | | Version increments but not reloaded | Main process not polling | Check sync watcher started | | Snapshots accumulating | Cleanup disabled/failing | Verify write permissions | | Reload fails | Corrupted snapshot | Delete bad snapshot, worker republishes | --- ## 13. Testing Strategy ### 13.1. Unit Tests **File:** [tests/unit/test_index_sync.py](../../tests/unit/test_index_sync.py) **Coverage:** | Test | Validates | |------|-----------| | `test_publish_creates_version_file` | Version file format correct | | `test_publish_creates_snapshot_directory` | Directory naming convention | | `test_publish_increments_version` | Monotonic version growth | | `test_publish_calls_persist_callback` | Callback receives correct path | | `test_cleanup_old_snapshots` | Retention policy enforced | | `test_receiver_detects_update` | Poll detection works | | `test_receiver_reload_callback` | Reload callback invoked | | `test_publisher_receiver_integration` | End-to-end flow | ### 13.2. Integration Tests **File:** [tests/integration/test_index_sync_e2e.py](../../tests/integration/test_index_sync_e2e.py) (hypothetical) **Scenarios:** 1. **Worker publishes, main reloads:** - Start worker, index documents - Verify main process detects and loads snapshot - Query returns indexed content 2. **Multiple publishes:** - Add documents, trigger publish - Add more documents, trigger publish - Verify main process reloads both times 3. **Worker crash recovery:** - Publish snapshot v1 - Crash worker mid-publish of v2 - Restart worker, verify continues from v1 4. **Snapshot cleanup:** - Publish v1, v2, v3 - Verify only v2, v3 retained ### 13.3. Performance Tests **File:** [tests/performance/test_snapshot_latency.py](../../tests/performance/test_snapshot_latency.py) (hypothetical) **Benchmarks:** | Benchmark | Target | Actual | |-----------|--------|--------| | Publish latency (100 docs) | <500ms | ~250ms | | Reload latency (100 docs) | <200ms | ~120ms | | Poll overhead (CPU) | <1% | <0.1% | | End-to-end propagation | <1s | ~600ms | --- ## 14. Architecture Decision Records ### ADR-1: File-Based Snapshots vs. Shared Memory **Status:** Accepted **Context:** Indices must be accessible to both processes without blocking. **Decision:** Use file-based snapshots with version tracking. **Alternatives Considered:** | Option | Pros | Cons | |--------|------|------| | **Snapshots (selected)** | Fault isolation, simple, portable | 2x memory, I/O latency | | Shared memory (mmap) | Zero-copy, instant updates | Complex sync, crash risk | | IPC serialization | No disk usage | Huge message overhead | **Rationale:** Snapshots provide complete fault isolation. Worker crashes don't affect main process. The ~100ms reload latency is acceptable for file-change-driven updates. --- ### ADR-2: Binary Version File vs. JSON Metadata **Status:** Accepted **Context:** Need to track current snapshot version efficiently. **Decision:** Use 4-byte binary uint32 in `version.bin`. **Alternatives Considered:** | Option | Pros | Cons | |--------|------|------| | **Binary uint32 (selected)** | Fast, atomic, minimal | Not human-readable | | JSON metadata | Extensible, readable | Not atomic, slower | | Filename (e.g., `current.txt`) | Human-readable | Rename not atomic on Windows | **Rationale:** 4-byte binary file reads are atomic on all platforms. Version number is sufficient metadata. Extensibility not needed (snapshots are ephemeral). --- ### ADR-3: Polling vs. File System Events **Status:** Accepted **Context:** Main process needs to detect new snapshots. **Decision:** Poll `version.bin` every 100ms. **Alternatives Considered:** | Option | Pros | Cons | |--------|------|------| | **Polling (selected)** | Simple, cross-platform | Slight latency, CPU overhead | | inotify/FSEvents | Instant notification | Platform-specific, complex | | IPC notification | Direct communication | Adds message type, queue dependency | **Rationale:** Polling is simple and reliable. 100ms latency is acceptable. CPU overhead (<0.1%) is negligible. Queue notifications provide faster path but polling is backup. --- ### ADR-4: Keep N Snapshots vs. Keep Duration **Status:** Accepted **Context:** Old snapshots must be cleaned up to prevent unbounded growth. **Decision:** Keep last N versions (N=2 default). **Alternatives Considered:** | Option | Pros | Cons | |--------|------|------| | **Keep N versions (selected)** | Predictable storage, simple | No time-based reasoning | | Keep by duration (e.g., 1 hour) | Intuitive | Unpredictable count | | Keep indefinitely | Debugging history | Storage leak | **Rationale:** Version count is predictable (2x index size). Duration-based cleanup is complex (need timestamps). N=2 allows previous version fallback. --- ## 15. Future Enhancements ### Considered but Not Implemented **1. Snapshot Compression** - **Benefit:** Reduce disk usage by 50-70% - **Cost:** CPU overhead on publish/load - **Decision:** Defer until storage is a bottleneck **2. Content-Addressable Storage** - **Benefit:** Deduplicate unchanged index segments - **Cost:** Significant complexity (ref counting, GC) - **Decision:** Defer until profiling shows redundancy **3. Multi-Version History** - **Benefit:** Point-in-time rollback for debugging - **Cost:** Storage overhead, complex UX - **Decision:** Defer until requested by users **4. Configurable Poll Interval** - **Benefit:** Tune latency vs. CPU trade-off - **Cost:** Another configuration knob - **Decision:** Defer until benchmarking shows need **5. Delta Snapshots** - **Benefit:** Only persist changed indices - **Cost:** Complex change tracking, consistency issues - **Decision:** Defer indefinitely (full snapshots simpler) --- ## 16. Implementation Files | File | Role | LOC | |------|------|-----| | [src/ipc/index_sync.py](../../src/ipc/index_sync.py) | Publisher and receiver classes | ~140 | | [src/reader/context.py](../../src/reader/context.py) | ReadOnlyContext with snapshot loading | ~200 | | [src/worker/process.py](../../src/worker/process.py) | Worker snapshot publishing | ~50 | | [src/lifecycle.py](../../src/lifecycle.py) | Snapshot path configuration | ~20 | | [tests/unit/test_index_sync.py](../../tests/unit/test_index_sync.py) | Unit tests | ~200 | | [tests/unit/test_reader_context.py](../../tests/unit/test_reader_context.py) | Context loading tests | ~100 | **Total implementation:** ~710 LOC --- ## 17. References ### Internal Documentation 1. **Multiprocess Architecture:** [docs/specs/21-multiprocess-architecture.md](./21-multiprocess-architecture.md) 2. **Architecture Overview:** [docs/architecture.md](../architecture.md) (IndexSyncPublisher/Receiver section) 3. **Self-Healing Indices:** [docs/specs/19-self-healing-indices.md](./19-self-healing-indices.md) (Corruption recovery) ### External References 1. **POSIX Atomicity:** `rename(2)` syscall atomicity guarantees 2. **Python multiprocessing:** Official documentation on Queue and Process 3. **Atomic file operations:** Martin Kleppmann, "Designing Data-Intensive Applications" (Chapter 3: Storage and Retrieval) --- ## 18. Changelog | Version | Date | Changes | |---------|------|---------| | 1.0.0 | 2026-01-23 | Initial post-hoc specification | --- **Document Maintainer:** AI Agents (lead, docs) **Review Cycle:** As-needed (feature is stable) **Last Updated:** 2026-01-23

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/andnp/ragdocs-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

22-index-snapshot-system.md•28.2 KiB