rag-retrieval-mcp
Provides embedding generation for semantic search using OpenAI's text-embedding models.
Enables vector similarity search in PostgreSQL via the pgvector extension.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@rag-retrieval-mcpfind relevant documents about renewable energy"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
MCP Server for RAG Retrieval
A generic Retrieval-Augmented Generation (RAG) Model Context Protocol (MCP) server with pluggable embedding providers and vector stores.
Why this server?
Vendor MCP servers usually only support their (own) integrated embedding models. If your index uses external embeddings (e.g., OpenAI), those servers can't query it. This server fills that gap — it embeds your query with the provider of your choice, then searches any supported vector store.
Currently Supports
Embedding Providers:
OpenAI (
text-embedding-3-small,text-embedding-3-large,text-embedding-ada-002, etc.)
Vector Stores:
Pinecone
pgvector (PostgreSQL)
Tools
retrieve
Search a knowledge base and return relevant content.
Parameters:
query(string, required) — The search query to find relevant content.
Returns a JSON array of results, each with text, score, and metadata fields.
Install & Run
Run directly with uvx (no install needed):
uvx rag-retrieval-mcp[all]Or install with pip:
pip install rag-retrieval-mcp[all]
rag-retrieval-mcpMCP client configuration
{
"mcpServers": {
"rag-retrieval": {
"command": "uvx",
"args": ["rag-retrieval-mcp[all]"],
"env": {
"OPENAI_API_KEY": "your-openai-api-key",
"PINECONE_API_KEY": "your-pinecone-api-key",
"PINECONE_HOST": "your-pinecone-index-host-url"
}
}
}
}Environment Variables
Variable | Required | Default | Description |
| No |
| Embedding provider to use |
| No |
| Vector store to use |
| No |
| Number of results to return |
| Yes (if using OpenAI) | OpenAI API key | |
| No |
| OpenAI embedding model |
| Yes (if using Pinecone) | Pinecone API key | |
| Yes (if using Pinecone) | Pinecone index host URL | |
| No |
| Metadata field containing text |
| Yes (if using pgvector) | PostgreSQL connection string | |
| No |
| Table name containing vectors |
| No |
| Column containing text content |
| No |
| Column containing embedding vectors |
Adding New Providers
Implement the EmbeddingProvider or VectorStore abstract base class and register it in server.py's factory function. See src/rag_retrieval_mcp/embedding_providers/base.py and src/rag_retrieval_mcp/vector_stores/base.py for the interfaces.
License
Apache License 2.0
This server cannot be installed
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/MaryamZi/rag-retrieval-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server