How do I use semantic-search-mcp?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@semantic-search-mcp find code for user authentication" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

semantic-search-mcp

by Zulaxy

Overview Schema Related Servers Score Discussions

JavaScript

Local

semantic-search-mcp

Semantic code search for AI coding agents. Local embeddings. No API keys. No data leaves your machine.

Your AI agent (opencode, Claude) can grep for exact words - but semantic-search-mcp lets it find code by meaning. Ask "where do we handle authentication?" and it returns auth.controller.ts, login.component.jsx, auth.config.php - even if the word "handle" doesn't appear in any of them.

80MB model. Runs 100% locally. Powered by bge-small-en-v1.5.

Grep vs. Semantic Search

On a 6,900-file codebase:

Query	Grep	semantic-search-mcp
"where users upload avatars"	30+ results, unsorted, mixed noise	5 ranked, best match first (0.835)
"how error logs are sent"	0 results (no file contains "sent" + "logs")	5 results across handlers, mailers, config
"scheduled task for cleanup"	2 results (only exact matches)	5 results - cron jobs, queues, commands
Time	~30s searching + scanning	2 seconds from cache

Quick Start (3 steps)

1. Install

npm install -g semantic-search-mcp

2. Index your project

cd /path/to/your-project
semantic-search-mcp index

The folder you run this from gets indexed. Shows live progress:

████████████████░░░░░░ 70% (5200/7368) - ~120s remaining
██████████████████████ Done! 7368 chunks in 726s.

First run downloads the model (~80MB, one-time) + indexes your code (5-15 min depending on project size). After that, the cache is saved and restarts are instant.

Multiple projects? Run cd /project-a && semantic-search-mcp index, then cd /project-b && semantic-search-mcp index. Each project gets its own cache automatically.

3. Connect your AI agent

Add this to your opencode.json (or opencode.jsonc) in the project root:

{
  "mcp": {
    "semantic-search": {
      "type": "local",
      "command": ["npx", "-y", "semantic-search-mcp"],
      "enabled": true
    }
  }
}

Claude Desktop - add to claude_desktop_config.json:

{
  "mcpServers": {
    "semantic-search": {
      "command": "npx",
      "args": ["-y", "semantic-search-mcp"]
    }
  }
}

Claude Code (CLI) - add .mcp.json to your project root:

{
  "mcpServers": {
    "semantic-search": {
      "command": "npx",
      "args": ["-y", "semantic-search-mcp"]
    }
  }
}

Restart your AI agent. Done. Searches are instant - cache was already built.

Related MCP server: CodeGrok MCP

FAQ

Which folder gets indexed?

The folder you cd into before running semantic-search-mcp index. It's your current working directory. When opencode or Claude starts the MCP server, that same folder gets used automatically.

I have 3 projects. Do I index each one?

Yes. Each project has its own cache:

project-a/.semantic-search/cache/index.json
project-b/.semantic-search/cache/index.json
project-c/.semantic-search/cache/index.json

Where is the cache stored?

{your-project}/.semantic-search/cache/index.json

About 50-100MB per project. Survives PC restarts, Git pulls, everything. It's just files on disk. Only cleared if you run semantic-search-mcp clean.

How do I remove the cache?

semantic-search-mcp clean

Do I need to re-index after code changes?

No. But if you add many new files or want fresh results: semantic-search-mcp clean && semantic-search-mcp index.

What model does it use?

Xenova/bge-small-en-v1.5 by default (80MB, 384-dim, retrieval-optimized). You can switch models via semantic-search-mcp config.

Is my code sent anywhere?

No. Everything runs on your machine - model, embeddings, search. Zero network calls after model download.

CLI Commands

semantic-search-mcp index    # Index current folder (live progress bar)
semantic-search-mcp config   # Interactive TUI to pick extensions, model, thresholds
semantic-search-mcp clean    # Remove index cache
semantic-search-mcp init     # Print opencode/Claude config snippet
semantic-search-mcp           # Start the MCP server (used by AI agents)
semantic-search-mcp --help   # All commands

Configuration

Run semantic-search-mcp config for interactive setup (checkboxes for extensions, searchable model picker, number inputs).

Or create .semantic-search.json in your project root:

{
  "extensions": [".php", ".js", ".jsx", ".ts", ".tsx"],
  "skipDirs": ["node_modules", "vendor", ".git", "dist"],
  "model": "Xenova/bge-small-en-v1.5",
  "chunkThreshold": 300,
  "maxChunksPerFile": 4
}

Or env vars: SEMANTIC_SEARCH_EXTENSIONS=.php,.js, SEMANTIC_SEARCH_MODEL=Xenova/bge-small-en-v1.5

All options

Key	Default
`extensions`	20+ code extensions	File types to index
`skipDirs`	node_modules, vendor, .git, ...	Directories to skip
`model`	Xenova/bge-small-en-v1.5	HuggingFace embedding model
`cacheDir`	`.semantic-search/cache`	Where cache is stored (per project)
`chunkThreshold`	300	Lines before splitting file
`maxChunksPerFile`	4	Max chunks per large file
`maxResults`	50	Max search results
`defaultLimit`	10	Default results per query

How It Works

Scan - walk your project, find code files
Extract - split at function/class boundaries (PHP, JS, TS, Python, Go, Rust, Java)
Embed - run each chunk through a local ONNX model (384-dim vectors)
Cache - save everything to disk
Search - embed your query, find closest matches via cosine similarity

License

MIT

This server cannot be installed

license - permissive license

quality - not tested

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Related MCP Servers

PAMPA
RAG Systems Search Developer Tools
tecnomanu
A
license
-
quality
C
maintenance
Provides semantic code search and retrieval capabilities for AI agents, enabling them to query codebases using natural language with automatic learning, hybrid search, and intelligent chunking of functions and classes.
Last updated 2025-09-25
12
29
ISC
CodeGrok MCP
Code Analysis Developer Tools Search
dondetir
A
license
-
quality
F
maintenance
Enables semantic code search for AI assistants by indexing codebases with embeddings and Tree-sitter, returning relevant snippets via natural language queries.
Last updated 2026-03-15
15
MIT
punt-quarry
RAG Systems Search Vector Databases
punt-labs
A
license
-
quality
A
maintenance
Enables local semantic search over documents (PDFs, code, etc.) using local embedding models, allowing AI agents and users to find information by meaning without API keys or cloud services.
Last updated 2026-07-31
3
MIT
smart-coding-mcp
Code Analysis Developer Tools
amalikn
A
license
-
quality
D
maintenance
Enables AI assistants to perform intelligent semantic code search across codebases using local AI embeddings for meaning-based retrieval.
Last updated 2026-03-29
24
MIT

View all related MCP servers

Related MCP Connectors

agent-memory
Persistent semantic memory for AI agents: store and recall text by meaning (RAG). x402
ref-tools-mcp
Token-efficient search for coding agents over public and private documentation.
agentbay-mcp
Persistent memory and knowledge management for AI agents with semantic search and 50+ tools.

View all MCP Connectors

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Zulaxy/semantic-search-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

semantic-search-mcp

Grep vs. Semantic Search

Quick Start (3 steps)

1. Install

2. Index your project

3. Connect your AI agent

FAQ

Which folder gets indexed?

I have 3 projects. Do I index each one?

Where is the cache stored?

How do I remove the cache?

Do I need to re-index after code changes?

What model does it use?

Is my code sent anywhere?

CLI Commands

Configuration

All options

How It Works

License

Maintenance

Resources

Looking for Admin?

Related MCP Servers

PAMPA

CodeGrok MCP

punt-quarry

smart-coding-mcp

Related MCP Connectors

Latest Blog Posts

MCP directory API