Uses Google's Gemini API to generate semantic embeddings for code chunks, enabling AI-powered semantic search across codebases
MCP Codebase Index Server
AI-powered semantic search for your codebase in Claude Desktop
A Model Context Protocol (MCP) server that enables Claude to search and understand your codebase using Google's Gemini embeddings and Qdrant Cloud vector storage.
⨠Features
š Semantic Search: Find code by meaning, not just keywords
šÆ Smart Chunking: Automatically splits code into logical functions/classes
š Real-time Watch: Monitors file changes and updates index automatically
š Multi-language: Supports 15+ programming languages
āļø Cloud Storage: Uses Qdrant Cloud for persistent vector storage
š¦ Simple Setup: Just 4 environment variables to get started
š Quick Start
Prerequisites
Gemini API Key: Get free at Google AI Studio
Qdrant Cloud Account: Sign up free at cloud.qdrant.io
Installation
Add to your claude_desktop_config.json:
All 4 variables are required:
Variable | Where to Get | Example |
| Absolute path to your project |
|
|
| |
| Qdrant Cloud cluster URL |
|
| Qdrant Cloud API key |
|
Restart Claude Desktop
The server will automatically:
Connect to your Qdrant Cloud cluster
Create a collection (if needed)
Index your entire codebase
Watch for file changes
š Usage
Ask Claude to search your codebase:
šļø Configuration
Required Variables
Optional Variables
Variable | Default | Description |
|
| Collection name in Qdrant |
|
| Auto-update on file changes |
|
| Embedding batch size |
š§ Setup Guides
SETUP.md - Detailed setup walkthrough
QDRANT_CLOUD_SETUP.md - Get Qdrant credentials
QUICK_REF.md - Quick reference card
š Supported Languages
Python ⢠TypeScript ⢠JavaScript ⢠Dart ⢠Go ⢠Rust ⢠Java ⢠Kotlin ⢠Swift ⢠Ruby ⢠PHP ⢠C ⢠C++ ⢠C# ⢠Shell ⢠SQL ⢠HTML ⢠CSS
š How It Works
š Troubleshooting
Server not appearing in Claude?
Check Claude logs:
Common issues:
ā
REPO_PATHmust be absolute pathā All 4 env variables must be set
ā Qdrant URL must include
:6333portā Gemini API key must be valid
Can't connect to Qdrant?
Test connection:
Should return JSON with collections list.
Indexing too slow?
Large repos (1000+ files) take 5-10 minutes initially
Reduce
BATCH_SIZEif hitting rate limitsCheck Gemini API quota: aistudio.google.com
š Performance
Embedding speed: ~100 chunks/minute (Gemini API)
Search latency: <100ms (Qdrant Cloud)
Storage: ~1KB per code chunk
Recommended: <10K chunks per collection
š License
MIT Ā© NgoTaiCo
š¤ Contributing
Issues and PRs welcome at github.com/NgoTaiCo/mcp-codebase-index
This server cannot be installed
local-only server
The server can only run on the client's local machine because it depends on local resources.
Enables semantic search across your codebase using Google's Gemini embeddings and Qdrant Cloud vector storage. Supports 15+ programming languages with smart code chunking and real-time file change monitoring.