How do I use Semantic Search MCP Server?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@Semantic Search MCP Server find where we handle JWT token validation" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

Semantic Search MCP Server

by camilojourney

Overview Schema Related Servers Score Discussions

Python

Local

CodeSight

AI-powered document search engine — hybrid BM25 + vector + RRF retrieval with pluggable LLM answer synthesis.

Quick Start

# Install
pip install -e ".[dev]"

# Install with AST chunking support (Python/JS/TS — higher MRR for code)
pip install -e ".[dev,ast]"

# Index a folder of documents
python -m codesight index /path/to/documents

# Search (hybrid BM25 + vector)
python -m codesight search "payment terms" /path/to/documents

# Filter by file type
python -m codesight search "auth" /path/to/code --glob '*.py'

# Ask a question (requires LLM API key — see Configuration)
python -m codesight ask "What are the payment terms?" /path/to/documents

# Machine-readable output
python -m codesight search "query" /path --json

# Check index status
python -m codesight status /path/to/documents

# Launch the web chat UI
pip install -e ".[demo]"
python -m codesight demo

Related MCP server: semantic-search-mcp

Python API

from codesight import CodeSight

engine = CodeSight("/path/to/documents")
engine.index()                                     # Index all files
results = engine.search("payment terms")           # Hybrid search
answer = engine.ask("What are the payment terms?") # Search + LLM answer
status = engine.status()                           # Index freshness check

The package root exports CodeSight, ServerConfig, Answer, IndexStats, RepoStatus, and SearchResult for stable public imports.

Supported Formats

Format	Extension	Parser
PDF	`.pdf`	pymupdf
Word	`.docx`	python-docx
PowerPoint	`.pptx`	python-pptx
Code	`.py`, `.js`, `.ts`, `.go`, `.rs`, etc.	AST-based (tree-sitter) + regex fallback
Text	`.md`, `.txt`, `.csv`	Built-in

Architecture

Document Parsing: PDF, DOCX, PPTX text extraction with page/section metadata
Chunking: AST-based (tree-sitter) for Python/JS/TS — function/class boundaries preserve semantic units. Regex fallback for other languages. Paragraph-aware splitting for documents.
Embeddings: voyage-code-3 (API, code files) / all-MiniLM-L6-v2 (local, docs). Auto-detected via VOYAGE_API_KEY.
Vector Store: LanceDB (serverless, file-based)
Keyword Search: SQLite FTS5 sidecar
Retrieval: Hybrid BM25 + vector + code-vector with RRF merge → metadata filename boost → optional reranker
Reranker: voyage rerank-2 (code-aware, auto-enabled with VOYAGE_API_KEY). Local ms-marco cross-encoder opt-in only.
Answer Synthesis: Pluggable LLM backend (Claude, Azure OpenAI, OpenAI, Ollama)

See ARCHITECTURE.md for the full system tour.

Performance

Measured on the holusight codebase (96 files, 20 representative queries):

Configuration	Hit Rate	MRR@10
Baseline (fixed windows, no reranker)	52.5%	0.352
+ VPRF + voyage reranker	100%	0.599
+ AST chunking (tree-sitter)	100%	0.823
+ voyage-code-3 + voyage rerank-2	100%	0.793

AST chunking is the largest single lever (+0.224 MRR). The local ms-marco cross-encoder hurts code retrieval — only enable it explicitly.

Configuration

Variable	Default	Description
`ANTHROPIC_API_KEY`	—	Required for Claude backend (`ask()`)
`VOYAGE_API_KEY`	—	Enables voyage-code-3 embeddings + voyage rerank-2 (recommended for code)
`CODESIGHT_LLM_BACKEND`	`claude`	LLM backend: `claude`, `azure`, `openai`, `ollama`
`CODESIGHT_DATA_DIR`	`~/.codesight/data`	Where indexes are stored
`CODESIGHT_EMBEDDING_MODEL`	`all-MiniLM-L6-v2`	Embedding model (overridden by voyage-code-3 for code when key set)
`CODESIGHT_LLM_MODEL`	`claude-sonnet-4-20250514`	LLM model for answers
`CODESIGHT_RERANKER`	`true` (if VOYAGE_API_KEY set)	Enable reranker
`CODESIGHT_RERANKER_BACKEND`	`voyage` (if key set)	Reranker backend: `voyage` or `local`
`CODESIGHT_STALE_SECONDS`	`300`	Index freshness threshold (seconds)
`LOG_LEVEL`	`INFO`	Logging verbosity

See .env.example for all options.

Stack

Python 3.11+
LanceDB + SQLite FTS5
sentence-transformers + voyage-code-3 (optional)
tree-sitter (optional — AST chunking for Python/JS/TS)
Anthropic Claude API / Azure OpenAI / OpenAI / Ollama
Streamlit (web chat UI)
pymupdf, python-docx, python-pptx (document parsing)

This server cannot be installed

license - not found

quality - not tested

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/camilojourney/holusight'

If you have feedback or need assistance with the MCP directory API, please join our Discord server