What can you do with this server?

SIFS is a fast local code-search MCP server that lets AI agents search, inspect, and navigate codebases using natural language, semantic, or exact-symbol queries. Search & Query * search — Search using natural language, code snippets, or exact symbols with three modes: hybrid (default), bm25 (exact identifiers/literals), or semantic (conceptual queries). Supports filtering by language or file paths and optional ranking explanations. * find_related — Find code chunks semantically similar to a known file path + line number, useful for exploring code after an initial search hit. Codebase Management * index_status — Inspect the active index: file/chunk counts, language coverage, and cache state. * refresh_index — Rebuild and replace the in-memory index cache after file changes. * clear_index — Remove a source from the in-memory cache to trigger a fresh rebuild on next call. * list_files — List all repository-relative file paths in the active index. * get_chunk — Retrieve the indexed code chunk at a specific file path and line number. Agent Integration & Configuration * agent_context — Retrieve the versioned SIFS CLI/MCP contract (commands, flags, tools). * agent_print — Render agent-specific artifacts (skills, snippets, MCP guidance) for targets like Codex, Claude Code, OpenClaw, or Hermes, without writing files. * agent_doctor — Inspect the readiness of SIFS agent artifacts for one or all supported targets. * init_agent — Create the SIFS Claude agent file in the shared workspace (e.g., .claude/agents/sifs-search.md). * profile_list / profile_show — List and inspect saved profiles (stored defaults for source, mode, etc.). Feedback * feedback_create — Record local feedback about agent interaction friction, optionally tied to a query and expected result. * feedback_list — List previously recorded local feedback entries.

Which integrations are available for this server?

Enables Hermes agents to perform efficient code search using SIFS's indexing and ranking capabilities.

de en es ja ko ru zh

sifs

by tristanmanchester

Overview Schema Related Servers Score Discussions

Rust

Local

SIFS (SIFS Is Fast Search) is a local code-search engine for AI coding agents and the developers who work through them. Point it at a repository and ask questions like "where is authentication handled?", "what validates session tokens?", or "which code builds the MCP handshake?". SIFS returns ranked file paths, line ranges, and code chunks fast enough for an agent to search before reading half the tree.

It runs as a CLI, a Rust crate, or a local MCP server. BM25 mode runs offline with no model files; hybrid and semantic modes run locally once the embedding model is cached.

Across 63 repositories and 1,251 annotated tasks, SIFS builds a cold sparse index in 167 ms, answers warm queries in 2.7 ms, and scores NDCG@10 = 0.8471.

Use SIFS to:

Find the code behind a natural-language question.
Look up exact symbols and identifiers without warming an IDE.
Hand an LLM a compact context pack instead of a whole repository.
Give Codex, Claude Code, Cursor, OpenClaw, Hermes, or another agent a search tool it can call before broad file reads.

Quickstart

Install SIFS with Homebrew:

brew install tristanmanchester/tap/sifs

Or install it with Cargo:

cargo install --locked sifs

Then search any local project:

cd /path/to/project
sifs search "where is authentication handled" --mode bm25 --offline --limit 5

BM25 mode runs offline with no model download. Results include the matching file, line range, score, ranking mode, and code chunk.

For semantic search, cache the local model and use the default hybrid mode:

sifs model pull
sifs search "what checks whether a session is expired" --limit 5
sifs pack "how login sessions are created and validated" --budget-tokens 6000 --json

The default search mode is hybrid (semantic + BM25). Omit --source to search the current directory, or pass a local path or Git URL explicitly:

sifs search "parse JWT claims" --source /path/to/project --mode bm25 --offline --limit 10
sifs symbol SessionToken --source /path/to/project --offline --json
sifs outline src/auth/session.rs --source /path/to/project --offline \
  --symbols-limit 200 --chunks-limit 100 --json
sifs find-related src/auth/session.rs 42 --source /path/to/project --limit 8
sifs search "stream upload backpressure" --source https://github.com/owner/project --limit 5

Common entry points:

Goal	Command
Search without downloads	`sifs search "query" --mode bm25 --offline`
Use semantic + lexical ranking	`sifs model pull`, then `sifs search "query"`
Build an offline context pack	`sifs pack "query" --mode bm25 --offline --budget-tokens 6000 --json`
Build a hybrid context pack	`sifs model pull`, then `sifs pack "query" --budget-tokens 6000 --json`
Inspect what was indexed	`sifs status --json`, `sifs list-files --json`, `sifs symbol <name> --json`, `sifs outline <file> --json`
Teach an agent to use SIFS	`sifs agent install --target codex --artifact snippet --file AGENTS.md`

Related MCP server: codeix

Agent integration

SIFS is most useful when agents know they can search first. Install a project instruction snippet or local skill so Codex, Claude Code, OpenClaw, Hermes, or any skill-aware agent uses SIFS before broad file reads:

sifs agent print --target codex --artifact snippet
sifs agent install --target codex --artifact snippet --file AGENTS.md --dry-run --json
sifs agent install --target codex --artifact snippet --file AGENTS.md
sifs agent doctor --target codex --json

Generated guidance is CLI-first. Agents use MCP tools when they're visible in the current session and otherwise fall back to shell commands like sifs search, sifs pack, sifs list-files, sifs get, and sifs agent-context --json.

Full integration reference: docs/agent-integration.md.

Features

Fast local search. 167 ms cold sparse index, 2.7 ms warm query, 4.9 µs cached repeat. Rust, CPU-only.
Strong cross-language quality. NDCG@10 of 0.8471 across 63 repositories, 19 languages, and 1,251 annotated tasks.
Three search modes. hybrid for most queries, semantic for natural language, bm25 for symbols and identifiers. Switch per query.
Offline-capable. BM25 needs no model. Hybrid and semantic work offline once the model is cached.
MCP server. Stdio server for Claude Code, Codex, Cursor, and any MCP-compatible agent. Sources index on demand and refresh on request.
Structural inspection. Browse indexed paths, symbols, file outlines, chunks, related code, and context packs.
Agent skills and snippets. Render, install, inspect, and remove SIFS guidance with sifs agent.
Local and remote sources. Pass a local path or Git URL with --source.
Machine-readable contract. sifs agent-context --json describes every command, flag, and tool.
Profiles and feedback. Save defaults for repeated sessions and log friction with sifs feedback.
Benchmark diagnostics. Run quality and latency benchmarks with the diagnostics feature.

Install

# crates.io
cargo install --locked sifs

# Homebrew
brew install tristanmanchester/tap/sifs

# From source
cargo build --release
target/release/sifs search "authentication flow" --source . --mode bm25 --offline

Keep installed binaries current with:

sifs update --check
sifs update --dry-run
sifs update

sifs update delegates to Cargo or Homebrew only when the current binary is owned by that package manager. For copied, development, or ambiguous binaries, it prints manual next actions rather than touching an unrelated install.

The sifs-benchmark and sifs-embed diagnostic binaries require the diagnostics feature:

cargo build --release --features diagnostics --bins

Run the test suite after changing indexing, chunking, ranking, model loading, or MCP behavior:

cargo test

MCP server

Install SIFS as a local stdio MCP server in two commands:

sifs daemon install-agent
sifs mcp install --client all

This registers a reusable server. Tool calls pass source to target a specific local checkout or Git URL.

To pin the server to a single source:

sifs mcp install --client all --source /path/to/project
sifs mcp install --client codex --source /path/to/project
sifs mcp install --client claude --scope local --source /path/to/project

You can also start the server directly. Without --source, the server uses its working directory as the default. Passing --source pins the server to that source, so MCP clients can call search and find_related without sending a source on every tool call.

sifs mcp
sifs mcp --source /path/to/project

The installer calls the client CLIs when they're available:

codex mcp add sifs -- /absolute/path/to/sifs mcp
claude mcp add-json sifs '{"type":"stdio","command":"/absolute/path/to/sifs","args":["mcp"],"env":{}}' --scope local

If a client CLI isn't available, sifs mcp install --dry-run prints the config to paste manually.

Codex (~/.codex/config.toml):

[mcp_servers.sifs]
command = "/absolute/path/to/sifs"
args = ["mcp"]
startup_timeout_sec = 20
tool_timeout_sec = 60

Claude Code (.mcp.json in your project):

{
  "mcpServers": {
    "sifs": {
      "type": "stdio",
      "command": "/absolute/path/to/sifs",
      "args": ["mcp"],
      "env": {}
    }
  }
}

Only commit a project-scoped .mcp.json to repositories you trust. It grants read access to whatever local paths tool calls pass in.

To run the daemon directly:

sifs daemon run --replace-existing-socket
sifs daemon ping
sifs daemon status --json

CLI

# Search the current directory
sifs search "where is authentication handled"

# Search a local project with hybrid ranking
sifs search "parse oauth callback" --source /path/to/project --mode hybrid --limit 10

# Use model-free offline BM25 search
sifs search "SessionToken" --source /path/to/project --mode bm25 --offline --limit 10

# Search a remote Git repository
sifs search "stream upload backpressure" --source https://github.com/owner/project

# Find code related to a known location
sifs find-related src/auth/session.rs 42 --source /path/to/project --limit 8

Use --json, --jsonl, or --format for structured output. Use --language, --filter-path, and --context-lines when an agent needs narrower results.

Use profiles for repeated agent sessions:

sifs profile save current --source /path/to/project --mode bm25 --offline --json
sifs search "mcp startup" --profile current --json

Index caches live in platform cache directories by default (~/Library/Caches/sifs on macOS, ${XDG_CACHE_HOME:-~/.cache}/sifs on Linux). Override with --cache-dir, disable with --no-cache, or opt into a repo-local .sifs/ cache with --project-cache.

Full CLI reference: docs/cli.md.

Platform support

Direct CLI search, library use, and MCP stdio work on macOS and Linux. The shared sifs daemon uses same-user Unix sockets, so daemon mode runs on Unix only. On Windows, use direct CLI or MCP stdio. sifs doctor --json reports daemon platform status.

Rust library

use sifs::{SearchMode, SearchOptions, SifsIndex};

fn main() -> anyhow::Result<()> {
    let index = SifsIndex::from_path("/path/to/project")?;
    let results = index.search_with(
        "where is authentication handled",
        &SearchOptions::new(5).with_mode(SearchMode::Hybrid),
    )?;

    for result in results {
        println!("{} {}", result.chunk.location(), result.score);
    }

    Ok(())
}

Use SifsIndex::from_path_sparse for a BM25-only index that never touches semantic state. Use SifsIndex::from_git for remote repositories. Full API docs, model policy, filters, and chunk-level construction: docs/library.md.

How it works

SIFS walks a repo with .gitignore-aware file selection, splits files into code chunks, builds a sparse BM25 index, and loads semantic state lazily when a semantic or hybrid query needs it.

bm25 — sparse lexical search. Good for identifiers, symbols, and exact terms. No model files required.

semantic — embedding similarity using minishlab/potion-code-16M through a local Model2Vec loader. Tensors and tokenizer files load directly into the Rust process and stay on the machine.

hybrid — the default. Semantic and BM25 rankings fuse with reciprocal rank fusion, then rerank. Symbol-like queries lean on BM25; natural-language questions keep more semantic weight.

Query-aware mode weighting. Symbol queries (Foo::bar, getUserById) get more BM25 weight. Natural-language queries stay balanced.
Definition boosts. A chunk that defines the queried symbol (class, fn, def) ranks above chunks that only reference it.
Identifier stemming. Query tokens are stemmed and matched against identifier stems, so parse config boosts chunks containing parseConfig, ConfigParser, or config_parser.
File coherence. When multiple chunks from the same file match, the file is boosted so results reflect file-level relevance.
Noise penalties. Test files, compat//legacy/ shims, example code, and .d.ts stubs are down-ranked so canonical implementations surface first.

Use sifs model pull (or its alias sifs model fetch) to pre-download the default model. Use sifs doctor to confirm semantic search is ready for offline use.

Benchmarks

Benchmarks run across 63 pinned open-source repositories, 19 languages, and 1,251 annotated search tasks.

SIFS search quality versus warm uncached query latency

Method	NDCG@10	Cold index	Warm query	Cached repeat
CodeRankEmbed Hybrid	0.8617	57.3 s	16.9 ms	n/a
Semble	0.8544	439.4 ms	1.3 ms	n/a
SIFS	0.8471	167.0 ms	2.7 ms	0.0049 ms
CodeRankEmbed	0.7648	57.3 s	13.3 ms	n/a
ColGREP	0.6925	3.9 s	979.3 ms	n/a
grepai	0.5606	35.0 s	47.7 ms	n/a
probe	0.3872	—	207.1 ms	n/a
ripgrep	0.1257	—	8.8 ms	n/a

SIFS reports separate timing fields so caching effects stay legible:

cold_index_ms — fresh sparse/chunk index, no persistent cache
cold_semantic_build_or_load_ms — first semantic embedding build or load
cold_first_search_ms — first search, including semantic first-use cost
warm_uncached_query_ms — normal query after the index exists (use this for comparisons)
warm_cached_repeat_query_ms — repeated identical query in the same process

Quality by query type

SIFS is strongest on symbol queries but holds up well on semantic and architecture questions too.

Query type	NDCG@10
symbol	0.9711
semantic	0.8412
architecture	0.7857

SIFS quality by query type and search mode

Context efficiency

The chart below tracks how quickly annotated relevant files enter an agent's context as retrieved chunks are added to the prompt budget.

SIFS context efficiency: recall versus retrieved context tokens

Full methodology, per-language breakdown, ablations, and benchmark artifacts: docs/benchmark-report.md.

File coverage

SIFS indexes code files by default and skips generated files, dependency directories, and caches. It uses the ignore crate, so .gitignore files, Git excludes, global ignores, and hidden files behave the same as in ripgrep or fd.

Recognized extensions: Python, JavaScript, TypeScript, Go, Rust, Java, Kotlin, Ruby, PHP, C, C++, C#, Swift, Scala, Elixir, Dart, Lua, SQL, Bash, Zig, Haskell, Markdown, YAML, TOML, JSON.

Pass --include-docs to add Markdown, YAML, TOML, JSON, and plain text. Use --extension (repeatable) to add custom file types.

Documentation

CLI usage — every command and flag
Rust library — SifsIndex, search modes, filters, indexing options
MCP server — stdio protocol and tool schemas
Agent-native scorecard — agent-facing contract and readiness evidence
Benchmarking — quality, latency, embedding, and smoke benchmarks
Architecture — file selection, chunking, embedding, sparse search, dense search, hybrid ranking

License

MIT

Install Server

license - permissive license

quality

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

2dRelease cycle

10Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Tools

View all tools

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/tristanmanchester/sifs'

If you have feedback or need assistance with the MCP directory API, please join our Discord server