What can you do with this server?

The OpenCode MCP Server is an AI orchestration layer that refines prompts, manages semantic memory, and optimizes token usage to enhance AI model performance and reduce hallucinations. * Prompt Refinement (refine_prompt): Transforms vague prompts into detailed, technically precise instructions using semantic memory, local AI (Ollama), and optionally real-time documentation (Context7), injecting relevant context via structured XML tags. * Context Learning & Memorization (learn_context): Stores preferences, architectural decisions, technical rules, and coding patterns into a semantic vector memory (LanceDB), optionally organized by category (e.g., architecture, style, preference). * Semantic Memory Retrieval: Searches and retrieves the most relevant stored knowledge via vector embeddings during prompt refinement, surfacing only exact, relevant context. * Token Optimization: Reduces tokens sent to your LLM by pre-processing and enriching prompts locally, achieving up to ~88% token savings compared to full-context injection. * Hybrid Routing: Intelligently balances local memory hits and external documentation lookups based on a configurable confidence threshold to minimize token waste and latency. * Real-time Documentation Injection: Fetches and caches (LRU) the latest official documentation for inferred technologies (e.g., React, Node.js, Supabase) and injects them into refined prompts to eliminate hallucinations. * Project Indexing: Scans the project structure to build a structural map in memory for instant architectural awareness. * Semantic Dashboard: Provides a local dashboard to visualize memory health, knowledge distribution, and statistics.

How do I use OpenCode MCP Server?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@OpenCode MCP Server refine my prompt for adding login feature" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

de en es ja ko ru zh

OpenCode MCP Server

by marlondivino

Overview Schema Related Servers Score Discussions

TypeScript

Hybrid

This repository contains the OpenCode MCP Server, a high-performance orchestration layer based on the Model Context Protocol (MCP). It is designed to act as a Proactive Architectural Assistant for Antigravity, transforming how AI interacts with your codebase.

The TL;DR: Think of it like a Tech Startup 🏢

If your local AI setup was a company:

Ollama is the Muscle 🦾. It crunches the numbers, rewrites text, and generates vector embeddings.
LanceDB is the Archive 🗄️. It securely stores and instantly retrieves snippets of your past technical decisions.
Context7 is the Lead Researcher 🕵️. It fetches the latest official documentation whenever a specific technology is mentioned.
OpenCode is the Engineering Manager 👔. It receives your vague request, tells Ollama to rewrite it, searches the Archive for context, asks the Researcher for the latest docs, and packages it all into a perfect "Super Prompt" before handing it over to the CEO (your main LLM, like Claude or Antigravity) to write the final code.

The main goal of this MCP is to drastically save tokens and prevent hallucinations by ensuring your main AI model only processes highly refined, contextualized prompts.

Features

Prompt Refinement: Transforms vague prompts into detailed and technical instructions.
Real-time Documentation (Context7): Automatically fetches the latest documentation for technologies like Supabase, React, Tailwind, etc.
Development Support: Assists in bug fixing and implementing new features with a focus on efficiency.
Semantic Memory: Stores and retrieves technical context using Semantic Chunking, Category Filtering, and XML Formatting.
Hybrid Routing (NEW): Intelligently balances local memory hits and external documentation (Context7) based on confidence thresholds to minimize token waste.
Documentation Caching (NEW): Built-in LRU cache for external docs to reduce latency and redundant API calls.
Proactive Indexing: Automatically maps your project structure to memory for instant architectural awareness.
Memory Dashboard: Visualize your knowledge distribution and memory health.

Related MCP server: Mem0 MCP Server

Architecture

The OpenCode MCP Server acts as an orchestration layer between the AI Client and local specialized tools.

graph TD
    Client["AI Client (Antigravity/Claude)"] -- "MCP Protocol (Stdio/HTTP)" --> Server["OpenCode MCP Server"]

    subgraph "OpenCode Engine"
        Server --> Tools["Tools: refine_prompt / learn_context"]
        Tools --> Memory["Memory Manager"]
        Tools --> Cache[("LRU Cache (5m TTL)")]
        Cache --> Docs["Context7 Fetcher"]
    end

    subgraph "Local Infrastructure"
        Memory -- "Store/Search" --> LDB[("LanceDB Vector Store")]
        Memory -- "Generate Embeddings" --> OLL["Ollama: nomic-embed-text"]
    end

    subgraph "External Resources"
        Docs -- "HTTPS/SSE" --> C7["Context7 API (Upstash)"]
    end

    Server -- "Refined Prompt + Local/External Context" --> Client

The Process Flow

sequenceDiagram
    participant User as User / Developer
    participant AG as AI Client (Antigravity)
    participant OC as OpenCode MCP Server
    participant LDB as LanceDB (Memory)
    participant OLL as Ollama (Local AI)
    participant Cache as LRU Cache
    participant C7 as Context7 (API)

    User->>AG: "How do I fix the auth bug?"
    Note over AG: Rule 1: Notify User & Refine
    AG->>User: "Refining with OpenCode for precision..."
    AG->>OC: refine_prompt("How do I fix the auth bug?")

    OC->>OLL: Rewrite prompt & infer technologies (e.g., llama3)
    OLL-->>OC: JSON { refinedPrompt, technologies: ["react"] }

    OC->>OLL: Generate embedding for refined prompt (nomic-embed-text)
    OLL-->>OC: Vector representation

    OC->>LDB: Vector search for top-relevant context
    LDB-->>OC: Snippets: "Auth uses JWT" (Returns Confidence Score)

    opt If Confidence < Threshold & Tech Inferred
        OC->>Cache: Check for cached docs (e.g., "react")
        alt Cache Hit
            Cache-->>OC: Return cached docs
        else Cache Miss
            OC->>C7: Fetch documentation (Context7)
            C7-->>OC: Latest API reference & examples
            OC->>Cache: Store docs in LRU Cache (5m TTL)
        end
    end

    Note over OC: Merge Refined Prompt + Local Memory + External Docs
    OC-->>AG: "SUPER PROMPT + <semantic_memory> + <external_documentation>"

    Note over AG: Generate high-precision answer
    AG->>User: Technical solution with codebase context

Technology Stack

The solution is built using a modern and efficient stack designed for high performance and local privacy:

OpenCode: The core orchestration engine that manages tool execution, prompt refinement logic, and semantic memory integration.
Model Context Protocol (MCP): The standard protocol for connecting AI models to local/remote data and tools.
LanceDB: A serverless, high-performance vector database that allows for incredibly fast semantic searches without the overhead of a traditional database server.
Ollama: Orchestrates local AI models. We use nomic-embed-text to generate high-quality vector embeddings locally, ensuring your technical data never leaves your machine.
TypeScript & Node.js: Provides a type-safe and performant runtime environment for the server logic.
Express: Used for the Remote Mode (HTTP/SSE), providing a robust foundation for the Streamable HTTP transport.

Prerequisites

Before starting, you need to set up the development environment. We recommend using NVM (Node Version Manager) to manage Node.js versions on Windows.

1. NVM and Node.js Installation (Windows)

Download the nvm-setup.exe installer from nvm-windows.
Follow the installation instructions.
Open a new PowerShell terminal and install the recommended Node.js version:
```
nvm install 22
nvm use 22
```

2. Ollama Installation (For Local AI)

Ollama is heavily used by OpenCode MCP for two distinct local tasks:

Embeddings: Generating vectors for semantic memory (nomic-embed-text).
Local Refinement: Rewriting vague prompts and inferring technologies dynamically (llama3 by default).
Open PowerShell as Administrator and run:
```
winget install ollama
```

After installation, restart the terminal and download the required models:

# Model for Vector Embeddings (Required)
ollama pull nomic-embed-text

# Model for Prompt Refinement & Inference (Recommended: llama3, qwen2.5:0.5b, etc.)
ollama pull llama3

3. Verify Installation

Check if the tools are ready in PowerShell:

node -v # Should return v22.x.x or higher
ollama --version

Project Installation and Configuration

Follow the steps below to configure the OpenCode MCP Server using PowerShell:

Clone the repository:

git clone <repository-url>
cd open-code-as-mcp

Install dependencies:
```
npm install
```
Build the project:
```
npm run build
```

Antigravity Configuration

To integrate this MCP server with Antigravity, you must choose between Local mode (running on the same machine) or Remote mode (running on a server/cloud).

Option A: Local Configuration (Stdio)

Use this option if the server is on the same machine as the client.

Global Memory (Default)

Memory will be shared across all projects and stored in the server folder.

{
  "mcpServers": {
    "opencode": {
      "command": "node",
      "args": ["D:/IA/MCP/open-code-as-mcp/build/index.js"]
    }
  }
}

Per-Project Memory (Recommended)

For each project to have its own isolated memory inside the project's .mcp_memory folder.

IMPORTANT

Always useabsolute paths in the MCP_MEMORY_PATH environment variable when configuring the server in a global MCP config (like Claude Desktop). This ensures the server finds the correct folder regardless of the current working directory.

{
  "mcpServers": {
    "opencode": {
      "command": "node",
      "args": ["D:/IA/MCP/open-code-as-mcp/build/index.js"],
      "env": {
        "MCP_MEMORY_PATH": "D:/IA/MCP/open-code-as-mcp/.mcp_memory/vectors"
      }
    }
  }
}

Note: Be sure to add .mcp_memory/ to your .gitignore if you don't want to version the database.

TIP

EnsureOllama is running and you have downloaded the model with ollama pull nomic-embed-text.

Context7 Integration (Optional)

To enable real-time documentation retrieval, you can add your Context7 API key (get it at context7.com). OpenCode uses a local model via Ollama to intelligently rewrite your prompt and infer which technologies are mentioned before fetching their official documentation.

{
  "mcpServers": {
    "opencode": {
      "command": "node",
      "args": ["D:/IA/MCP/open-code-as-mcp/build/index.js"],
      "env": {
        "CONTEXT7_API_KEY": "your_api_key_here",
        "ENABLE_CONTEXT7": "true",
        "USE_HYBRID": "true",
        "LOCAL_CONFIDENCE_THRESHOLD": "0.7",
        "MCP_INFERENCE_MODEL": "llama3"
      }
    }
  }
}

New Hybrid Configuration Parameters:

Variable	Default	Description
`USE_HYBRID`	`true`	When enabled, only calls Context7 if local memory confidence is below the threshold.
`LOCAL_CONFIDENCE_THRESHOLD`	`0.7`	Value between 0-1. Higher values force more frequent external documentation lookups.

TIP

Ensure you have pulled the inference model configured inMCP_INFERENCE_MODEL (e.g., ollama pull llama3) to allow the local refinement step to work properly.

Option B: Remote Configuration (Streamable HTTP)

Use this option if the server is running remotely. The server uses the modern Streamable HTTP transport, which is more robust and efficient.

{
  "mcpServers": {
    "opencode": {
      "url": "http://your-remote-server:3000/mcp"
    }
  }
}

Note: The server also maintains backward compatibility for legacy clients at http://your-remote-server:3000/sse.

Automatic Usage and Global Rules

To ensure Antigravity consistently follows best practices, the Global Rules are stored in two key locations:

Global Level (Windows): Inside the GEMINI.md file, located in your user profile: %USERPROFILE%\.gemini\GEMINI.md (a copy is available in this repo as GEMINI.md).
Project Level: Inside the .cursorrules file in the root of this repository.

Access via Environment Variable

You can reference the global rules path by setting an environment variable in your terminal or system configuration:

$env:ANTIGRAVITY_RULES_PATH = "$HOME\.gemini\GEMINI.md"

The Rules

To ensure Antigravity uses this MCP correctly, configure the following rules in your System Prompt:

Antigravity Global Rules

Prompt Refinement: Whenever the user sends a request, first announce to the user: "Refining your request with OpenCode for technical precision...", then use opencode:refine_prompt.
Context Enrichment: Upon receiving the refined prompt, validate if there are technical terms or project patterns that require additional lookup in semantic memory. Mention if you are pulling specific context from OpenCode memory.
Continuous Learning: After successfully implementing a complex feature, use opencode:learn_context. Briefly inform the user that this knowledge is being persisted in OpenCode's semantic memory.

TIP

You can find the raw version of these rules in the.cursorrules or GEMINI.md file for easy copying into your System Prompt.

Available Tools

The OpenCode MCP provides the following tools:

1. `refine_prompt`

Refines a development prompt to make it clearer and more efficient, injecting targeted context via XML tags.

Arguments:
- prompt: (string) The original prompt that needs refinement.
- categoryFilter: (string, optional) Optional category to filter memories (e.g., 'architecture', 'style') to increase precision and reduce token usage.

2. `learn_context`

Memorizes important information (preference, technical rule, context) for future use in semantic memory.

Arguments:
- information: (string) The information to be remembered.
- category: (string, optional) Information category (e.g., 'preference', 'architecture', 'style').

3. `search_memory`

Directly queries the semantic memory without refining a prompt.

Arguments:
- query: (string) The search query.
- category: (string, optional) Filter results by category.
- limit: (number, optional) Number of results to return.

4. `index_codebase`

Performs a recursive scan of the project to build a structural map in memory.

Arguments:
- path: (string, optional) Root path to scan.

📊 Semantic Dashboard

You can visualize your memory health and stats using the local dashboard:

node dashboard.cjs

Remote Access (SSE)

The server supports remote access via SSE (Server-Sent Events). To run in remote mode in PowerShell, use:

Running in remote mode:

$env:MCP_MODE="sse"; $env:PORT="3000"; npm start

Development

To run the server in development mode with hot-reload in PowerShell:

npm run dev

Debugging

You can test the server locally by running in PowerShell:

node build/test-mcp.js

Token Efficiency Validation

A technical analysis was performed to measure the efficiency of semantic retrieval vs. full-context injection.

Test Scenario 1: Local Semantic Memory (Auth Middleware Migration)

Knowledge Base: Complex technical documentation for migrating session-based authentication to JWT, including security rules and legacy fallback patterns (~8,000 characters).
Query: "How to implement the JWT fallback for legacy session endpoints?"

Results (Local Memory)

Metric	Traditional (Full Context)	MCP (Semantic Retrieval)	Efficiency Gain
Characters Sent	~8,000	~950	~88% Savings
Tokens (Est. 1:4)	~2,000	~238	~88% Savings
Response Accuracy	Medium (Noise risk)	High (Exact context)	Qualitative Boost

Test Scenario 2: Context7 Real-Time Documentation Impact

Scenario: A developer asks a poorly formulated, open-ended question about a specific technology that requires strong external context.
Query: "how do I create a server with nodejs? is there a fast way?"

Results (Context7 Integration)

Metric	Without Context7 (Local Only)	With Context7 Enabled	Impact
Tokens (Est.)	~285 tokens	~1,712 tokens	+1,427 tokens
Context Quality	Limited to local codebase memory	High (Injected official Node.js/Express docs)	Drastic Contextual Boost
Risk of Hallucination	High (Model relies on generic training data)	Zero (Grounded by official `<external_documentation>`)	Precise Answers

Test Scenario 3: Hybrid Routing Optimization (The "Smart Balance")

Scenario: Frequent technical queries where some are specific to the local repo and others are generic to the framework.
Query: "How do we handle S3 uploads in this project?" (Local info exists) vs "How does S3 multipart upload work?" (Needs external docs)

Results (Hybrid Mode)

Metric	Context7 Always On	Hybrid Mode (Local First)	Benefit
Avg. Tokens (Mixed Workload)	~1,800 tokens	~550 tokens	~70% Savings
Avg. Latency	~450ms (Network heavy)	~120ms (Local first)	~73% Faster
Documentation Quality	Maximum (Always fresh)	Optimized (Local patterns preferred)	Reduced "Context Bloat"

Conclusion: The Hybrid Routing strategy is the "Golden Ratio" of MCP performance. It ensures that when you ask a project-specific question, you aren't paying the token/latency tax of external lookups, while still providing a safety net for framework-level queries. Combined with the LRU Cache, it makes the OpenCode MCP one of the most cost-efficient orchestrators available.

Advanced Optimizations

To maximize efficiency, the server actively implements:

Semantic Chunking: Large knowledge blocks are automatically split into smaller, focused chunks before being embedded. This ensures only the exact relevant paragraph is retrieved.
Category Filtering: Queries can be scoped to specific categories (e.g., architecture or style), significantly reducing noise and allowing the result limit to be tightened.
XML Context Formatting: Retrieved memories are injected into the prompt using strict XML tags (<semantic_memory> and <context_item>). This aligns with how modern LLMs best parse context, eliminating attention dilution.

Benefits of OpenCode MCP

Token Savings: By refining prompts locally, we reduce the context load sent to Antigravity.
Enriched Context: OpenCode can access local files and provide richer context for Antigravity.
Agility: Fast responses for refinement tasks.

Install Server

license - permissive license

quality

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Tools

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/marlondivino/open-code-as-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

The TL;DR: Think of it like a Tech Startup 🏢

Features

Architecture

The Process Flow

Technology Stack

Prerequisites

1. NVM and Node.js Installation (Windows)

2. Ollama Installation (For Local AI)

3. Verify Installation

Project Installation and Configuration

Antigravity Configuration

Option A: Local Configuration (Stdio)

Global Memory (Default)

Per-Project Memory (Recommended)

Context7 Integration (Optional)

New Hybrid Configuration Parameters:

Option B: Remote Configuration (Streamable HTTP)

Automatic Usage and Global Rules

Access via Environment Variable

The Rules

Antigravity Global Rules

Available Tools

1. refine_prompt

2. learn_context

3. search_memory

4. index_codebase

📊 Semantic Dashboard

Remote Access (SSE)

Running in remote mode:

Development

Debugging

Token Efficiency Validation

Test Scenario 1: Local Semantic Memory (Auth Middleware Migration)

Results (Local Memory)

Test Scenario 2: Context7 Real-Time Documentation Impact

Results (Context7 Integration)

Test Scenario 3: Hybrid Routing Optimization (The "Smart Balance")

Results (Hybrid Mode)

Advanced Optimizations

Benefits of OpenCode MCP

Maintenance

Resources

Looking for Admin?

Tools

Latest Blog Posts

MCP directory API

1. `refine_prompt`

2. `learn_context`

3. `search_memory`

4. `index_codebase`