How do I use Semantic Search MCP Server?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@Semantic Search MCP Server find where we handle JWT token validation" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

Semantic Search MCP Server

by camilojourney

Overview Schema Related Servers Score Discussions

Python

Local

CodeSight

AI-powered document search engine — hybrid BM25 + vector + RRF retrieval with pluggable LLM answer synthesis.

Quick Start

# Install
pip install -e ".[dev]"

# Index a folder of documents
python -m codesight index /path/to/documents

# Search (hybrid BM25 + vector)
python -m codesight search "payment terms" /path/to/documents

# Filter by file type
python -m codesight search "auth" /path/to/code --glob '*.py'

# Ask a question (requires LLM API key — see Configuration)
python -m codesight ask "What are the payment terms?" /path/to/documents

# Machine-readable output
python -m codesight search "query" /path --json

# Check index status
python -m codesight status /path/to/documents

# Launch the web chat UI
pip install -e ".[demo]"
python -m codesight demo

Python API

from codesight import CodeSight

engine = CodeSight("/path/to/documents")
engine.index()                                     # Index all files
results = engine.search("payment terms")           # Hybrid search
answer = engine.ask("What are the payment terms?") # Search + LLM answer
status = engine.status()                           # Index freshness check

Supported Formats

Format	Extension	Parser
PDF	`.pdf`	pymupdf
Word	`.docx`	python-docx
PowerPoint	`.pptx`	python-pptx
Code	`.py`, `.js`, `.ts`, `.go`, `.rs`, etc.	Built-in (10 languages)
Text	`.md`, `.txt`, `.csv`	Built-in

Architecture

Document Parsing: PDF, DOCX, PPTX text extraction with page/section metadata
Chunking: Language-aware regex splitting (code) + paragraph-aware splitting (documents)
Embeddings: all-MiniLM-L6-v2 via sentence-transformers (local, no API key)
Vector Store: LanceDB (serverless, file-based)
Keyword Search: SQLite FTS5 sidecar
Retrieval: Hybrid BM25 + vector with RRF merge
Answer Synthesis: Pluggable LLM backend (Claude, Azure OpenAI, OpenAI, Ollama)

See ARCHITECTURE.md for the full system tour.

Configuration

Variable	Default	Description
`ANTHROPIC_API_KEY`	—	Required for Claude backend (`ask()`)
`CODESIGHT_LLM_BACKEND`	`claude`	LLM backend: `claude`, `azure`, `openai`, `ollama`
`CODESIGHT_DATA_DIR`	`~/.codesight/data`	Where indexes are stored
`CODESIGHT_EMBEDDING_MODEL`	`all-MiniLM-L6-v2`	Embedding model
`CODESIGHT_LLM_MODEL`	`claude-sonnet-4-20250514`	LLM model for answers
`CODESIGHT_STALE_SECONDS`	`300`	Index freshness threshold (seconds)
`LOG_LEVEL`	`INFO`	Logging verbosity

See .env.example for all options.

Stack

Python 3.11+
LanceDB + SQLite FTS5
sentence-transformers
Anthropic Claude API / Azure OpenAI / OpenAI / Ollama
Streamlit (web chat UI)
pymupdf, python-docx, python-pptx (document parsing)

This server cannot be installed

license - not found

quality - not tested

maintenance

How are these scores calculated?

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
open source
OpenAI
Tool Definition Quality Score (TDQS)
By punkpeye on April 3, 2026.
mcp
The Hackers Who Tracked My Sleep Cycle
By punkpeye on March 26, 2026.
security

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/camilojourney/holusight'

If you have feedback or need assistance with the MCP directory API, please join our Discord server