Kontext MCP Server
Uses Azure OpenAI embeddings for semantic similarity search in the memory retrieval process.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Kontext MCP Serverremember that the user prefers dark mode"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Kontext MCP Server
Own your Kontext: portable, provider‑agnostic memory for AI agents. Never repeat yourself again.
Kontext transforms Azure Data Explorer (Kusto) into a sophisticated context engine that goes beyond simple vector storage. While traditional vector DBs only store embeddings, Kontext provides layered memory with rich temporal and usage signals—combining recency, frequency, semantic similarity, pins, and decay scoring.
Overview
Kontext provides two powerful MCP tools for intelligent memory management:
remember
remember(fact: str, type: str, scope: Optional[str] = "global") -> strStores a memory item in the Kusto-backed memory store with automatic embedding generation.
Parameters:
fact: Text to remembertype: Memory type ("fact","context", or"thought")scope: Memory scope (defaults to"global")
Returns: Unique ID of the stored memory
recall
recall(query: str, filters: Optional[Dict[str, Any]] = None, top_k: int = 10) -> List[Dict[str, Any]]Retrieves relevant memories using semantic similarity and KQL-powered ranking.
Parameters:
query: Search query for semantic matchingfilters: Optional filters (e.g.,{"type": "fact", "scope": "global"})top_k: Maximum number of results to return
Returns: List of memory objects with metadata (id, fact, type, scope, creation_time, sim)
Related MCP server: ogham-mcp
Why Kontext?
The Gap: Agents need intelligent memory that considers not just semantic similarity, but also temporal patterns, usage frequency, and contextual relevance. Most vector databases fall short by ignoring these rich signals and locking you into a single cloud provider.
The Solution: Kontext leverages Kusto's powerful query language (KQL) to score and rank memories using multiple dimensions:
// Conceptual query for scoring memories
Memory
| extend score = w_t * exp(-ago(ingest)/7d) *
w_f * log(1+hits) *
w_s * cosine_sim *
w_p * pin
| top 20 by scoreKey Benefits
Temporal Reasoning: Native timestamp handling, retention policies, and time-decay scoring
Semantic Retrieval: Built-in vector columns with cosine similarity search
Expressive Ranking: KQL enables complex scoring that weighs time, frequency, pins, and semantics
Cost Effective: Free tier with instant provisioning and predictable scaling
True Portability: Simple MCP API keeps your models and cloud providers interchangeable
Architecture
Agent ⇆ Kontext MCP
├── remember(fact, meta)
└── recall(query, meta)
↓
Azure KustoIngest: Text splitting → embedding generation → vector + metadata storage
Retrieve: KQL-powered scoring combines temporal, frequency, semantic, and pin signals
Quick Setup
Add Kontext to your MCP settings with the following configuration:
{
"servers": {
"kontext": {
"type": "stdio",
"command": "uvx",
"args": ["kontext-mcp"],
"env": {
"KUSTO_CLUSTER": "https://your-cluster.kusto.windows.net/",
"KUSTO_DATABASE": "your-database",
"KUSTO_TABLE": "Memory",
"EMBEDDING_URI": "https://your-openai.azure.com/openai/deployments/text-embedding-3-large/embeddings?api-version=2023-05-15;managed_identity=system"
}
}
}
}Environment Variables:
KUSTO_CLUSTER: Your Azure Data Explorer cluster URLKUSTO_DATABASE: Database name for storing memoriesKUSTO_TABLE: Table name for memory storage (default: "Memory")EMBEDDING_URI: Azure OpenAI endpoint for embedding generation
Current Features
remember: Store facts with automatic embedding generation using Kusto's
ai_embeddings()pluginrecall: Retrieve semantically similar facts using cosine similarity search
FastMCP Integration: Built on the FastMCP framework for easy tool registration and schema generation
Kusto Backend: Leverages Azure Data Explorer for scalable storage and querying
Roadmap
Advanced Scoring: Multi-dimensional ranking with temporal decay, frequency weighting, and pin support
Memory Tiers: Short-term context, working memory, and long-term fact storage
Hosted Embeddings: Optional E5 model hosting to reduce setup friction
Enhanced Caching: Multi-tier memory management and query optimization
License
MIT License - see LICENSE file for details.
Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
This server cannot be installed
Maintenance
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/danield137/kontext-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server