The Vectara MCP server provides a Model Context Protocol (MCP) compliant interface for AI systems to access Vectara's Trusted RAG platform, enabling fast, reliable RAG with reduced hallucination.
Core Capabilities:
RAG Query with Generation: Run semantic search queries with AI-generated responses using the
ask_vectara
toolSemantic Search Only: Perform search queries without generation using the
search_vectara
toolHallucination Detection & Correction: Identify and fix hallucinations in generated text using Vectara's VHC API with the
correct_hallucinations
toolFactual Consistency Evaluation: Assess how well generated text matches source documents using the
eval_factual_consistency
toolAPI Key Management: Securely configure and manage Vectara API keys using
setup_vectara_api_key
andclear_vectara_api_key
Infrastructure Features:
Multiple Transport Modes: Support for HTTP (recommended), SSE, and STDIO transport protocols
Production-Ready Security: Built-in bearer token authentication, rate limiting, CORS protection, and HTTPS readiness
Flexible Configuration: Customizable through command-line arguments and environment variables
MCP Integration: Compatible with MCP clients including Claude Desktop
Vectara MCP Server
🔌 Compatible with
Vectara MCP is also compatible with any MCP client
The Model Context Protocol (MCP) is an open standard that enables AI systems to interact seamlessly with various data sources and tools, facilitating secure, two-way connections.
Vectara-MCP provides any agentic application with access to fast, reliable RAG with reduced hallucination, powered by Vectara's Trusted RAG platform, through the MCP protocol.
Installation
You can install the package directly from PyPI:
Available Tools
ask_vectara: Run a RAG query using Vectara, returning search results with a generated response.
Args:
query: str, The user query to run - required.
corpus_keys: list[str], List of Vectara corpus keys to use for the search - required. Please ask the user to provide one or more corpus keys.
api_key: str, The Vectara API key - required.
n_sentences_before: int, Number of sentences before the answer to include in the context - optional, default is 2.
n_sentences_after: int, Number of sentences after the answer to include in the context - optional, default is 2.
lexical_interpolation: float, The amount of lexical interpolation to use - optional, default is 0.005.
max_used_search_results: int, The maximum number of search results to use - optional, default is 10.
generation_preset_name: str, The name of the generation preset to use - optional, default is "vectara-summary-table-md-query-ext-jan-2025-gpt-4o".
response_language: str, The language of the response - optional, default is "eng".
Returns:
The response from Vectara, including the generated answer and the search results.
search_vectara: Run a semantic search query using Vectara, without generation.
Args:
query: str, The user query to run - required.
corpus_keys: list[str], List of Vectara corpus keys to use for the search - required. Please ask the user to provide one or more corpus keys.
api_key: str, The Vectara API key - required.
n_sentences_before: int, Number of sentences before the answer to include in the context - optional, default is 2.
n_sentences_after: int, Number of sentences after the answer to include in the context - optional, default is 2.
lexical_interpolation: float, The amount of lexical interpolation to use - optional, default is 0.005.
Returns:
The response from Vectara, including the matching search results.
Configuration with Claude Desktop
Add to your claude_desktop_config.json:
Usage in Claude Desktop App
Once the installation is complete, and the Claude desktop app is configured, you must completely close and re-open the Claude desktop app to see the Vectara-mcp server. You should see a hammer icon in the bottom left of the app, indicating available MCP tools, you can click on the hammer icon to see more detial on the Vectara-search and Vectara-extract tools.
Now claude will have complete access to the Vectara-mcp server, including the ask-vectara and search-vectara tools. When you issue the tools for the first time, Claude will ask you for your Vectara api key and corpus key (or keys if you want to use multiple corpora). After you set those, you will be ready to go. Here are some examples you can try (with the Vectara corpus that includes information from our website:
Vectara RAG Examples
Querying Vectara corpus:
Searching Vectara corpus:
Acknowledgments ✨
Model Context Protocol for the MCP specification
Anthropic for Claude Desktop
remote-capable server
The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.
Vectara MCP server
- Installation
- Available Tools
- Configuration with Claude Desktop
- Usage in Claude Desktop App
- Acknowledgments ✨
Related Resources
Related MCP Servers
- MIT License
- AsecurityFlicenseAqualityVictoriaMetrics-mcp-serverLast updated -617
- -securityAlicense-qualityTeradata MCP Server by CDataLast updated -MIT License