Semantic Search MCP Server

Overview Schema Related Servers Score Discussions

codesight
specs

002-embedding-model-config.md•1.55 KiB

# Spec 002: Embedding Model Configuration **Status:** Planned **Target Version:** v0.2 ## Summary Allow users to configure the embedding model via environment variable. Add `nomic-embed-text-v1.5` and `jina-embeddings-v2-base-code` as options for better retrieval quality on code-heavy repos. ## Acceptance Criteria - [ ] `SEMANTIC_SEARCH_EMBEDDING_MODEL` env var selects the model - [ ] Supported values: `all-MiniLM-L6-v2` (default), `nomic-embed-text-v1.5`, `jina-embeddings-v2-base-code` - [ ] Invalid model name fails fast with clear error message - [ ] Changing the model invalidates existing index (different dims/space) - [ ] Model name stored in index metadata for staleness detection - [ ] File preamble (imports + docstring) prepended to chunk context before embedding ## API / Tool Surface Environment variable — no tool API change. ```bash SEMANTIC_SEARCH_EMBEDDING_MODEL=nomic-embed-text-v1.5 python -m semantic_search_mcp ``` ## Edge Cases - User changes model after indexing: auto-rebuild index with new model - Model not installed: clear error with `pip install` command - Dim mismatch between stored vectors and loaded model: detected via metadata ## Out of Scope - API-based embedding models (OpenAI, etc.) — local-only for now - Multiple models simultaneously — one model per index ## Test Plan - `tests/test_embeddings.py` — all three models load and produce correct-dimension vectors - `tests/test_config.py` — invalid model name raises ValueError with helpful message - Integration: index with model A, change to model B, verify auto-rebuild

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/camilojourney/codesight'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

002-embedding-model-config.md•1.55 KiB