Enriches code chunks with authorship, timestamps, churn metrics, and task IDs extracted from commit history and git blame data.
Supports extracting GitHub task IDs from commit messages to provide context and linking between code and project issues.
Enables extraction of JIRA task IDs from commit messages to associate indexed code chunks with specific project tickets.
Integrates with Ollama for local, privacy-first embedding generation and semantic codebase search.
Supports OpenAI embedding models for semantic vectorization and high-performance code search.
Provides specialized Ruby AST-aware chunking to improve the accuracy and relevance of semantic search in Ruby codebases.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Tea Rags MCPsearch for where user authentication is implemented"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
MCP server for semantic code search with git trajectory reranking. AST-aware chunking, incremental indexing, millions of LOC. Reranks results using authorship, churn, bug-fix rates, and 19 other signals β not just embedding similarity. Built on Qdrant. Works with Ollama (local) or cloud providers (OpenAI, Cohere, Voyage).
π Full documentation β 15-minute quickstart, agent workflows, architecture deep dives.
𧬠Trajectory Enrichment
Standard code RAG retrieves by similarity alone. Trajectory enrichment augments each chunk with signals about how code evolves β at the function level, not just file level.
π Git trajectory β churn, authorship, volatility, bug-fix rates, task traceability. 19 signals feed composable rerank presets (
hotspots,ownership,techDebt,securityAudit...)πΈοΈ Topological trajectory (planned) β symbol graphs, cross-file coupling, blast radius
Opt-in via CODE_ENABLE_GIT_METADATA=true. Without it β standard semantic search with AST-aware chunking.
π‘ An agent can find stable templates, avoid anti-patterns, match domain owner's style, and assess modification risk β all backed by empirical data. Read more β
π Quick Start
Then ask your agent: "Index this codebase for semantic search"
π Documentation
Section | What's inside | |
π | Installation, first index & query | |
βοΈ | Env vars, providers, tuning | |
π€ | Prompt strategies, generation modes, deep analysis | |
ποΈ | Pipeline, data model, reranker internals |
π€ Contributing
See CONTRIBUTING.md for workflow and conventions.
π Acknowledgments
Built on a fork of mhalder/qdrant-mcp-server β clean architecture, solid tests, open-source spirit. And its ancestor qdrant/mcp-server-qdrant. Code vectorization inspired by claude-context (Zilliz).
Feel free to fork this fork. It's forks all the way down. π’