Skip to main content
Glama

Omniscience is a highly optimized Model Context Protocol (MCP) server designed to give Large Language Models (LLMs) token-efficient, surgical access to massive codebases. Instead of flooding the LLM's context window with entire repositories, Omniscience uses a sophisticated Dual-Brain architecture to find exactly what the LLM needsβ€”and absolutely nothing more.

🧠 The Dual-Brain Architecture

graph TD
    A[Codebase] -->|Real-time watcher| B(Omniscience Scanner)
    B -->|Code| C{Dual-Brain Parser}
    
    subgraph Structural Brain
    C -->|AST Parsing| D[Tree-Sitter]
    D -->|Function Definitions & Calls| E[(SQLite Graph DB)]
    end
    
    subgraph Semantic Brain
    C -->|Text/Code| F[Voyage-4-nano]
    F -->|Local Embeddings| G[(LanceDB Vector DB)]
    end
    
    E -.->|Graph Query| H[MCP Client]
    G -.->|Semantic Search| H

1. Structural Brain (Tree-sitter)

Parses the AST (Abstract Syntax Tree) of your codebase in real-time. It maps out exact file locations, boundary lines for functions/classes, and automatically generates a complete Call-Graph (Caller -> Callee relationships) stored in a local SQLite database.

2. Semantic Brain (LanceDB & Voyage-4-nano)

Generates and stores high-quality semantic embeddings of every code symbol completely locally. Allows the LLM to search for abstract concepts ("how does the auth routing work?") using lightning-fast hybrid search.


Related MCP server: MCP Context Manager

πŸ› οΈ Exposed MCP Tools

The server exposes powerful tools to the AI, allowing it to navigate your project like a senior engineer.

Tool

Description

Token Impact

πŸ” semantic_search

Finds relevant code symbols based on a natural language query or keywords.

Low

πŸ•ΈοΈ graph_query

Returns the blast radius of a specific symbol based on the AST Call-Graph.

Low

πŸ“– surgical_read

Extracts only the exact code snippet for a single function or class.

Massive Savings

πŸ—οΈ apply_surgical_patch

Replaces an exact code symbol with new code and triggers a background re-index.

Low

πŸ”„ rebuild_index

Manually triggers a complete re-indexing of the entire workspace.

None


πŸš€ Installation & Setup

Omniscience is designed to be ridiculously fast. We use uv for lightning-fast dependency resolution.

# 1. Clone the repository
# git clone <repo-url>
cd mcp-omniscience

# 2. Run the Initialization Script (Downloads model, syncs env)
./init.sh

IDE Integration

Add Omniscience to your MCP client configuration (mcp_config.json, claude_desktop_config.json, etc.):

{
  "mcpServers": {
    "omniscience": {
      "command": "/path/to/mcp-omniscience/run_server.sh",
      "args": []
    }
  }
}
TIP

No initialization prompt required! When the MCP server starts in a new WORKSPACE_DIR, it automatically builds the vector and graph databases in the background.


πŸ’° Token Cost Analysis

Why use Omniscience over traditional whole-file reading?

  • Full File Read (server.py): ~911 Tokens

  • Omniscience Surgical Read (1 function): ~117 Tokens

  • Context Window Saved: 87.16% per interaction!

By isolating exactly what is needed, the LLM hallucinates less, replies faster, and drastically reduces API costs.


A
license - permissive license
-
quality - not tested
B
maintenance

Maintenance

–Maintainers
–Response time
–Release cycle
–Releases (12mo)
Commit activity

Resources

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/FreakyLetsFail/mcp-omniscience'

If you have feedback or need assistance with the MCP directory API, please join our Discord server