Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Semantic Search MCP Serverfind where we handle JWT token validation"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
codesight
AI-powered document search engine — hybrid BM25 + vector + RRF retrieval with Claude answer synthesis.
Quick Start
# Install
pip install -e ".[dev]"
# Index a folder of documents
python -m codesight index /path/to/documents
# Search
python -m codesight search "payment terms" /path/to/documents
# Ask a question (requires ANTHROPIC_API_KEY)
python -m codesight ask "What are the payment terms?" /path/to/documents
# Launch the web chat UI
pip install -e ".[demo]"
python -m codesight demoPython API
from codesight import CodeSight
engine = CodeSight("/path/to/documents")
engine.index() # Index all files
results = engine.search("payment terms") # Hybrid search
answer = engine.ask("What are the payment terms?") # Search + Claude answer
status = engine.status() # Index freshness checkSupported Formats
Format | Extension | Parser |
| pymupdf | |
Word |
| python-docx |
PowerPoint |
| python-pptx |
Code |
| Built-in (10 languages) |
Text |
| Built-in |
Architecture
Document Parsing: PDF, DOCX, PPTX text extraction with page/section metadata
Chunking: Language-aware regex splitting (code) + paragraph-aware splitting (documents)
Embeddings:
all-MiniLM-L6-v2via sentence-transformers (local, no API key)Vector Store: LanceDB (serverless, file-based)
Keyword Search: SQLite FTS5 sidecar
Retrieval: Hybrid BM25 + vector with RRF merge
Answer Synthesis: Claude API generates answers with source citations
See ARCHITECTURE.md for the full system tour.
Configuration
Variable | Default | Description |
| — | Required for |
|
| Where indexes are stored |
|
| Embedding model |
|
| Claude model for answers |
|
| Index freshness threshold |
|
| Logging verbosity |
See .env.example for all options.
Workflow: Explore → Plan → Execute → Review
Opus in VS Code plans and launches autonomous CLI agents in the background — the user never leaves the conversation. Agents run via env -u CLAUDECODE claude --dangerously-skip-permissions --model [model] -p '...' with output redirected to files. Multiple cycles ensure quality: Sonnet implements, Opus reviews. See .claude/rules/workflow.md for full details.
Stack
Python 3.11+
LanceDB + SQLite FTS5
sentence-transformers
Anthropic Claude API
Streamlit (web chat UI)
pymupdf, python-docx, python-pptx (document parsing)
This server cannot be installed
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.