Qurio MCP Server
Integrates with Google Gemini's API to generate embeddings for document chunks, enabling hybrid search that combines keyword and vector similarity for high-recall retrieval.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Qurio MCP Serversearch my knowledge base for Go error handling patterns"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
π About
Qurio is a self-hosted, open-source ingestion and retrieval engine that functions as a local Shared Library for AI coding assistants (like Gemini-CLI, Claude Code, Cursor, Windsurf, or custom scripts).
Unlike cloud-based RAG solutions that introduce latency and privacy risks, Qurio runs locally to ingest your handpicked heterogeneous documentation (web crawls, PDFs, Markdown) and serves it directly to your agents via the Model Context Protocol (MCP). This ensures your AI writes better code faster using only the context you trust.
Qurio features a custom structural chunker that respects code blocks, API definitions, and config files, preserving full code blocks and syntaxes.
Why Qurio?
Privacy First: Your data stays on your machine (
localhost).Precision: Retrieves grounded "truth" to prevent AI hallucinations.
Speed: Deploys in minutes with
docker-compose.Open Standards: Built on MCP, Weaviate, and PostgreSQL.
β¨ Key Features
π Universal Ingestion: Crawl documentation sites or upload files (PDF, DOCX, MD).
π§ Hybrid Search: Configurable BM25 keyword search with Vector embeddings for high-recall retrieval.
π― Configurable Reranking: Integrate Jina AI or Cohere for precision tuning.
π Native MCP Support: Exposes a standard JSON-RPC 2.0 endpoint for seamless integration with AI coding assistants.
πΈοΈ Smart Crawling: Recursive web crawling with depth control, regex exclusions, respect robot.txt, sitemap and
llms.txtllms-full.txtsupport.π OCR Pipeline: Automatically extracts text from scanned PDFs and images via Docling.
π₯οΈ Admin Dashboard: Manage sources, view ingestion status, and debug queries via a clean Vue.js interface.
ποΈ Architecture
Qurio is built as a set of microservices orchestrated by Docker Compose:
Backend (Go): Core orchestration, API, and MCP server.
Frontend (Vue.js): User interface for managing sources and settings.
Ingestion Worker (Python): Async ingestion engine handling crawling (
crawl4ai) and parsing (docling).Vector Store (Weaviate): Stores embeddings and handles hybrid search.
Database (PostgreSQL): Stores metadata, job status, and configuration.
Queue (NSQ): Manages asynchronous ingestion tasks.
π Getting Started
Prerequisites
Docker and Docker Compose
A Google Gemini API Key (for embeddings)
Installation
Clone the repository:
git clone https://github.com/irahardianto/qurio.git cd qurioConfigure Environment: Copy the example environment file and add your API key.
cp .env.example .envStart the System:
docker-compose up -dWait a minute for all services (Weaviate, Postgres) to initialize.
Access the Dashboard: Open http://localhost:3000 in your browser.
Add API Keys: Access http://localhost:3000/settings page in the dashboard, and add your Gemini and JinaAI/Cohoere(optional) API Keys
Configuration
Configuration is managed via the Settings page in the UI or environment variables.
Variable | Description | Default |
| Key for Google Gemini (Embeddings) | Required |
|
|
|
| API Key for selected provider | - |
| Hybrid search balance (0.0=Keyword, 1.0=Vector) |
|
| Max results to return |
|
π‘ Usage
Unlock the full potential of your Agent
Check out the Agent Prompting Guide for best practices, workflow examples, and system prompt templates (CLAUDE.md, GEMINI.md) to paste into your project.
1. Add Data Sources
Navigate to the Admin Dashboard (http://localhost:3000) and click "Add Source".
Web Crawl: Enter a documentation URL (e.g.,
https://docs.docker.com). Configure depth and exclusion patterns.File Upload: Drag and drop PDFs or Markdown files.
2. Connect Your AI Agent (MCP)
Configure your MCP-enabled editor (like Cursor/Gemini CLI) to connect to Qurio.
Add the following to your MCP settings:
{
"mcpServers": {
"qurio": {
"httpUrl": "http://localhost:8081/mcp"
}
}
}Note: Qurio uses a stateless, streamable HTTP transport at http://localhost:8081/mcp. Use a client that supports native HTTP MCP connections.
3. Query
Ask your AI agent a question. It will now have access to the documentation you indexed!
"How do I configure a healthcheck in Docker Compose?"
4. Available Tools
Once connected, your agent will have access to the following tools:
Tool | Description |
| Search your knowledge base. Supports hybrid search (keywords + vectors). Use this to find relevant documentation or code examples. |
| List all available data sources. Useful to see what documentation is currently indexed. |
| List pages within a source. Helpful for exploring the structure of a documentation site. |
| Read a full page. Retrieves the complete content of a specific document or web page found via search or listing. |
5. Roadmap
Rework crawler & embedder parallelization
Migrate to Streamable HTTP
Supports multiple different models beyond Gemini
Supports more granular i.e. section by section page retrieval
π License
This project is licensed under the MIT License - see the LICENSE file for details.
This server cannot be installed
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/irahardianto/qurio'
If you have feedback or need assistance with the MCP directory API, please join our Discord server