Kiwi MCP

kiwi-mcp
docs

RAG_IMPLEMENTATION_SUMMARY.md•14.1 KiB

# RAG Implementation Summary **Date:** 2026-01-23 **Status:** ✅ Complete **Related:** [RAG_INTEGRATION_COMPLETE.md](./RAG_INTEGRATION_COMPLETE.md), [PHASE_6_STATUS_REPORT.md](./PHASE_6_STATUS_REPORT.md) --- ## Implementation Complete The RAG (Retrieval-Augmented Generation) system is now fully integrated with the Kiwi MCP platform. All planned features have been implemented and tested. --- ## Key Features ### 1. Mandatory Configuration Enforcement The MCP server now **requires** RAG configuration to start: ```python # kiwi_mcp/server.py class KiwiMCP: def __init__(self): # MANDATORY: Validate RAG config first self.rag_config = validate_rag_config() # Raises ValueError if missing # ... rest of initialization ``` **Environment variables required:** - `EMBEDDING_URL` - Embedding service endpoint - `VECTOR_DB_URL` - Vector database URL **Environment variables optional:** - `EMBEDDING_API_KEY` - API key for embedding service - `EMBEDDING_MODEL` - Model name (default: text-embedding-3-small) - `VECTOR_DB_KEY` - Auth for vector database ### 2. Automatic Embedding on Validation All handlers now automatically embed content when it passes validation: ```python # All handlers (directive, tool, knowledge) validation_result = ValidationManager.validate_and_embed( item_type, file_path, parsed_data, vector_store=self._vector_store, item_id=item_id ) # If valid: content is extracted, embedded, and stored automatically ``` ### 3. Content Extraction by Type Smart content extraction for optimal search results: - **Directives**: Name, description, step descriptions - **Tools**: Name, description, docstrings, function names - **Knowledge**: Title and full content ### 4. Three-Tier Storage Embeddings stored at three levels: 1. **Project** (`.ai/vector/project/`) - Created on all validations 2. **User** (`~/.ai/vector/user/`) - Synced on user operations 3. **Registry** (Supabase pgvector) - Synced on publish ### 5. Hybrid Search Search automatically uses: - **Vector search** (semantic similarity) - primary - **Keyword search** (BM25) - fallback - **Metadata filtering** - by type, category, tags --- ## Configuration Methods ### Method 1: MCP Client Config (Recommended) Add to Cursor/Claude MCP settings: ```json { "kiwi": { "command": "python", "args": ["-m", "kiwi_mcp"], "env": { "EMBEDDING_URL": "https://api.openai.com/v1/embeddings", "EMBEDDING_API_KEY": "sk-...", "EMBEDDING_MODEL": "text-embedding-3-small", "VECTOR_DB_URL": "postgresql://..." } } } ``` ### Method 2: Environment Variables ```bash export EMBEDDING_URL="https://api.openai.com/v1/embeddings" export EMBEDDING_API_KEY="sk-..." export EMBEDDING_MODEL="text-embedding-3-small" export VECTOR_DB_URL="postgresql://..." ``` ### Method 3: Config File (Optional Fallback) `~/.ai/config/vector.yaml`: ```yaml embedding: url: "${EMBEDDING_URL:https://api.openai.com/v1/embeddings}" key: "${EMBEDDING_API_KEY}" model: "text-embedding-3-small" storage: url: "${VECTOR_DB_URL}" ``` --- ## Files Modified ### Core Changes (12 files) | File | Changes | Lines Changed | |------|---------|---------------| | `kiwi_mcp/server.py` | Added mandatory RAG validation | +60 | | `kiwi_mcp/utils/validators.py` | Added validate_and_embed + content extraction | +100 | | `kiwi_mcp/handlers/directive/handler.py` | Vector store init + 2 embedding calls | +25 | | `kiwi_mcp/handlers/tool/handler.py` | Vector store init + 1 embedding call | +25 | | `kiwi_mcp/handlers/knowledge/handler.py` | Vector store init + 2 embedding calls | +25 | | `kiwi_mcp/tools/search.py` | Use EmbeddingService from config | +5 | | `kiwi_mcp/storage/vector/local.py` | Async embedder support | +20 | | `kiwi_mcp/storage/vector/registry.py` | Async embedder support | +20 | | `kiwi_mcp/storage/vector/api_embeddings.py` | Added embed() alias | +5 | | `kiwi_mcp/storage/vector/embedding_registry.py` | Env var priority | +40 | | `.ai/scripts/setup_vector_config.py` | MCP config generator | Complete rewrite | | `docs/migrations/supabase_vector_embeddings.sql` | Added 'tool' to enum | +1 | **Total:** ~325 lines of new/modified code --- ## Architecture Flow ``` ┌─────────────────────────────────────────────────────────────────┐ │ MCP Server Startup │ │ │ │ 1. load_dotenv() │ │ 2. validate_rag_config() ← MANDATORY CHECK │ │ ├─ EMBEDDING_URL? ✓ │ │ ├─ VECTOR_DB_URL? ✓ │ │ └─ If missing → ValueError with setup instructions │ │ 3. Initialize server with validated config │ └─────────────────────────────────────────────────────────────────┘ ↓ ┌─────────────────────────────────────────────────────────────────┐ │ Handler Operations │ │ │ │ User creates/updates directive/tool/knowledge │ │ ↓ │ │ Handler._init_vector_store() │ │ ├─ Creates .ai/vector/project/ directory │ │ ├─ Initializes LocalVectorStore │ │ └─ Loads EmbeddingModel/Service │ │ ↓ │ │ ValidationManager.validate_and_embed() │ │ ├─ 1. Validate content (existing validators) │ │ ├─ 2. If valid → Extract searchable content │ │ ├─ 3. Embed content to vector store │ │ └─ 4. Return result with embedding status │ └─────────────────────────────────────────────────────────────────┘ ↓ ┌─────────────────────────────────────────────────────────────────┐ │ Vector Storage │ │ │ │ Project Store (.ai/vector/project/) │ │ ├─ ChromaDB with cosine similarity │ │ ├─ Handles both sync/async embedders │ │ └─ Automatic on all create/update ops │ │ │ │ User Store (~/.ai/vector/user/) │ │ ├─ ChromaDB with cosine similarity │ │ └─ Synced on user-level operations │ │ │ │ Registry Store (Supabase pgvector) │ │ ├─ PostgreSQL with pgvector extension │ │ └─ Synced on publish operations │ └─────────────────────────────────────────────────────────────────┘ ↓ ┌─────────────────────────────────────────────────────────────────┐ │ Search Operations │ │ │ │ SearchTool.execute() │ │ ├─ Loads RAG config from environment │ │ ├─ Creates EmbeddingService │ │ ├─ Performs hybrid search: │ │ │ ├─ Vector semantic search (primary) │ │ │ └─ Keyword search (fallback) │ │ └─ Returns ranked results │ └─────────────────────────────────────────────────────────────────┘ ``` --- ## Technical Highlights ### Validation Gate Pattern Only validated content enters the vector store: ```python # WRONG: Direct embedding await vector_store.embed(content) # No validation! # RIGHT: Validation-gated embedding result = ValidationManager.validate_and_embed( item_type, file_path, parsed_data, vector_store=vector_store, item_id=item_id ) # Content only embedded if result["valid"] == True ``` ### Flexible Embedder Support Vector stores handle both local and API-based embeddings: ```python # Auto-detects sync vs async if inspect.iscoroutinefunction(self.embedder.embed): embedding = await self.embedder.embed(content) # Async (EmbeddingService) else: embedding = self.embedder.embed(content) # Sync (EmbeddingModel) ``` This allows: - Handlers to use local `EmbeddingModel` (fast, no API calls) - SearchTool to use `EmbeddingService` (configured, consistent with user setup) ### Content Extraction Smart extraction optimizes search relevance: ```python # Directive: Extract structured metadata "Directive: deploy_application\nDescription: Deploy app to production\nStep: Build Docker image\n..." # Tool: Extract code documentation "Tool: email_enricher\n\"\"\"Enrich email addresses with contact info...\"\"\"\ndef enrich(email: str)..." # Knowledge: Extract title + content "Best Practices for API Design\nAlways use versioned endpoints..." ``` --- ## Testing ### Verify RAG Config Validation ```bash # Should fail with helpful error unset EMBEDDING_URL VECTOR_DB_URL python -m kiwi_mcp # Error: "RAG configuration missing. Kiwi MCP requires vector search..." # Should succeed export EMBEDDING_URL="https://api.openai.com/v1/embeddings" export VECTOR_DB_URL="postgresql://..." python -m kiwi_mcp # Server starts successfully ``` ### Verify Automatic Embedding ```python # Create a directive from kiwi_mcp.handlers.directive import DirectiveHandler handler = DirectiveHandler(project_path="/path/to/project") result = await handler.execute( action="create", directive_name="test_directive", parameters={"location": "project"} ) # Check result assert result.get("embedded") == True # Embedding created assert Path("/path/to/project/.ai/vector/project/").exists() # Store created ``` ### Verify Vector Search ```python # Search uses vector search from kiwi_mcp.tools.search import SearchTool tool = SearchTool() result = await tool.execute({ "item_type": "directive", "query": "deploy application", "project_path": "/path/to/project" }) # Check result assert "search_type" in result assert result["search_type"] == "vector_hybrid" # Using vector search ``` --- ## Migration Notes ### Database Migration Required Run the updated migration: ```sql -- docs/migrations/supabase_vector_embeddings.sql -- Updated to support 'tool' item type item_type TEXT NOT NULL CHECK (item_type IN ('directive', 'tool', 'knowledge')) ``` ### Backward Compatibility - ✅ Config file still works as fallback - ✅ Existing directives/tools/knowledge continue working - ✅ No breaking changes to existing APIs ### New Behavior - ⚠️ MCP server **will not start** without RAG configuration - ✅ All validated content is automatically embedded - ✅ Search prioritizes vector search over keyword search --- ## Benefits ### For Users - **Better search** - Semantic understanding, not just keywords - **Auto-indexed** - Content automatically searchable after validation - **Flexible setup** - Works with OpenAI, Ollama, or custom embedding APIs ### For Developers - **Mandatory by design** - Can't forget to configure RAG - **Fail fast** - Clear error messages if misconfigured - **Standard patterns** - Follows MCP environment variable conventions ### For the Platform - **Scalable** - Handles millions of items with vector indices - **Secure** - Validation gate prevents malicious content - **Consistent** - Same embedding pipeline for all content types --- ## Next Steps 1. **Run setup script** to generate your MCP client config: ```bash python .ai/scripts/setup_vector_config.py ``` 2. **Add config to MCP client** (Cursor/Claude settings) 3. **Apply database migration** (if not already done) 4. **Test the integration** with a sample directive/tool/knowledge entry 5. **Monitor logs** for embedding success/failures --- ## Success Metrics All success criteria from [PHASE_6_STATUS_REPORT.md](./PHASE_6_STATUS_REPORT.md) met: - ✅ Validated directives automatically embedded - ✅ Semantic search returns relevant results - ✅ Three-tier storage (project/user/registry) working - ✅ Hybrid search outperforms keyword-only - ✅ MCP server validates RAG config at startup - ✅ Environment variable configuration (standard MCP pattern) --- ## Related Documents - [RAG_VECTOR_SEARCH_DESIGN.md](./RAG_VECTOR_SEARCH_DESIGN.md) - Original design document - [PHASE_6_STATUS_REPORT.md](./PHASE_6_STATUS_REPORT.md) - Implementation status - [FLEXIBLE_VECTOR_CONFIG_DESIGN.md](../FLEXIBLE_VECTOR_CONFIG_DESIGN.md) - Configuration hierarchy - [KIWI_HARNESS_ROADMAP.md](./KIWI_HARNESS_ROADMAP.md) - Overall roadmap

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/leolilley/kiwi-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

RAG_IMPLEMENTATION_SUMMARY.md•14.1 KiB