How do I use fsq-codebase?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@fsq-codebase find the rate limiting implementation" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

fsq-codebase

by cprepos

Overview Schema Related Servers Score Discussions

Python

Local

fsq-codebase

Zero-config codebase indexer with FSQ embeddings for fast semantic code search. Why Finite Scalar Quantization to compress? Because nobody has tried it before, that's why. Also FSQ still loosely maintains the shape of the vector and does not need a codebook, in case I ever wanted to round trip the embeddings back to code (for example for previewing purposes, like a jpg thumbnail).

Features

Fast semantic search: 10x compression with int8 embeddings, 2.7x faster search
Multi-language: Python, JavaScript, TypeScript, Go, Rust, Java, and more
Zero-config: Just point at a directory and search
MCP server: Claude Code integration via Model Context Protocol

Related MCP server: Claude Context Local

Installation

Work in progress. For now you would need to build the model yourself. ANd

Quick Start

Python API

from fsq_codebase import CodebaseIndex, FSQEmbedder

# Index a codebase
index = CodebaseIndex.create("./my-project")
results = index.query("add rate limiting", top_k=10)
print(results.tree())

# Or use the embedder directly
embedder = FSQEmbedder.from_bundled("codet5plus-96d")
embeddings = embedder.encode(["def hello(): pass", "function greet() {}"])

MCP Server (Claude Code)

# Start the MCP server
fsq-codebase --index ./codebase.index

Configure in Claude Code's .mcp.json:

{
  "mcpServers": {
    "fsq-codebase": {
      "command": "fsq-codebase",
      "args": ["--index", "./codebase.index", "--verbose"]
    }
  }
}

Bundled Models

Model	Encoder	FSQ Dim	Size
`codet5plus-96d`	CodeT5+ 110M	96	268 KB
`unixcoder-96d`	UniXcoder	96	652 KB

The encoder (CodeT5+ or UniXcoder) downloads automatically from HuggingFace on first use (~440MB).

Performance

Compared to CodeT5+ baseline on CoIR benchmark:

Model	MRR	Storage	Search Speed
CodeT5+ baseline	0.9699	1024B	0.39ms
fsq-codebase	0.9706	96B	0.14ms

10.7x compression with 2.7x faster search while maintaining accuracy.

License

MIT

Install Server

license - permissive license

quality

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Tools

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/cprepos/codeinfuse'

If you have feedback or need assistance with the MCP directory API, please join our Discord server