fsq-codebase
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@fsq-codebasefind the rate limiting implementation"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
fsq-codebase
Zero-config codebase indexer with FSQ embeddings for fast semantic code search. Why Finite Scalar Quantization to compress? Because nobody has tried it before, that's why. Also FSQ still loosely maintains the shape of the vector and does not need a codebook, in case I ever wanted to round trip the embeddings back to code (for example for previewing purposes, like a jpg thumbnail).
Features
Fast semantic search: 10x compression with int8 embeddings, 2.7x faster search
Multi-language: Python, JavaScript, TypeScript, Go, Rust, Java, and more
Zero-config: Just point at a directory and search
MCP server: Claude Code integration via Model Context Protocol
Related MCP server: Claude Context Local
Installation
Work in progress. For now you would need to build the model yourself. ANd
Quick Start
Python API
from fsq_codebase import CodebaseIndex, FSQEmbedder
# Index a codebase
index = CodebaseIndex.create("./my-project")
results = index.query("add rate limiting", top_k=10)
print(results.tree())
# Or use the embedder directly
embedder = FSQEmbedder.from_bundled("codet5plus-96d")
embeddings = embedder.encode(["def hello(): pass", "function greet() {}"])MCP Server (Claude Code)
# Start the MCP server
fsq-codebase --index ./codebase.indexConfigure in Claude Code's .mcp.json:
{
"mcpServers": {
"fsq-codebase": {
"command": "fsq-codebase",
"args": ["--index", "./codebase.index", "--verbose"]
}
}
}Bundled Models
Model | Encoder | FSQ Dim | Size |
| CodeT5+ 110M | 96 | 268 KB |
| UniXcoder | 96 | 652 KB |
The encoder (CodeT5+ or UniXcoder) downloads automatically from HuggingFace on first use (~440MB).
Performance
Compared to CodeT5+ baseline on CoIR benchmark:
Model | MRR | Storage | Search Speed |
CodeT5+ baseline | 0.9699 | 1024B | 0.39ms |
fsq-codebase | 0.9706 | 96B | 0.14ms |
10.7x compression with 2.7x faster search while maintaining accuracy.
License
MIT
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
- Your AI Chatbot Just Exposed Your CEO's Salary to an InternBy Om-Shree-0709 on .Agent IdentityMCP SecurityOAuth Delegation
- Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)By Om-Shree-0709 on .Agentic AiPrompt InjectionWebAssembly
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/cprepos/codeinfuse'
If you have feedback or need assistance with the MCP directory API, please join our Discord server