RecoSearch
OfficialProvides governed read-only SQL querying on DuckDB databases with per-row provenance and citation tracking.
Provides governed read-only access to MongoDB document stores with per-row provenance and citation tracking.
Provides governed read-only search and querying on OpenSearch text indices with per-row provenance and citation tracking.
Provides governed read-only SQL querying on Snowflake data warehouses with per-row provenance and citation tracking.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@RecoSearchWhich product category drove the most delivered order revenue?"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
RecoSearch
Let an LLM answer questions over your real data — and prove every number it returns.
RecoSearch is a deterministic, governed MCP server. An MCP client (Claude, Claude Code, any MCP-compatible app) plans a question; RecoSearch decides whether and how to run it, executes only contract-allowed, read-only queries, and returns either an answer carrying full provenance — every row cited back to its source and the exact contract version that produced it — or a typed refusal.
It is not an agent. There is no model-driven decision loop in the server. All reasoning lives in the external MCP client; RecoSearch only governs execution.
Two governed paths, one server
RecoSearch exposes two coherent capabilities under one MCP server, one package, and one cite-or-refuse philosophy:
Governed multi-source query (breadth). Read-only, contract-governed access across six source types — structured (DuckDB, Postgres, Snowflake), text (OpenSearch), vector (Qdrant), and document (MongoDB) — with RBAC, field masking, a read-only SQL guard, in-process federation across sources, and a per-row citation on every result.
Certified semantic layers (depth, DuckDB). A certified metric kernel (definition-hash, fan-out guard against double-counting, freshness SLA), an L2 context/trust layer, and L3 ontology claim validation. An experimental L4/L5 decision + calibration loop is available behind a flag (off by default).
Honest scope. The depth semantic layers run on a local DuckDB source. The breadth path is the one that spans all six source types. Both share the same contract, envelope, and citation model.
Related MCP server: Semantic BI MCP
Every row is citable — and the client enforces it
Every row a data tool returns carries a _citation (source, contract version, fields, query hash,
and whether it may back a final answer). RecoSearch also ships validate_cited_evidence_packet, a
closure-graph validator that refuses a packet whose claims don't resolve to evidence actually returned
this session from the source that defines the metric, pinned to the current contract.
The validator is a tool the client calls — the server provides the validator and the provenance; the MCP client invokes the gate before presenting an answer. The server does not silently auto-validate behind your back. See the worked example for the full question → tool calls → cited answer flow.
How you author it
Three files you author are the source of truth for the breadth path — no business logic in Python:
File | Role |
| Connection authority — where each source lives and how to reach it |
| Business meaning — metrics, rules, dimensions, measures, relations in plain language |
| Scenario identity and optional governance (RBAC, field masking, vocabulary) |
The depth semantic layers are authored as their own kernel YAML (see
docs/design and the bundled recosearch/semantic_layers/semantic/).
Quickstart — zero infrastructure, 2 minutes
The bundled NovaShop example is a single DuckDB file (products, customers, orders). No server, no credentials:
pip install -e ".[duckdb]"
# Build the sample database (deterministic), point the server at the example, compile + check.
python examples/novashop-duckdb/seed.py
export RECOSEARCH_SEMANTIC_DIR=examples/novashop-duckdb
recosearch --write-semantic-json
recosearch --validate
recosearch --health-check # → status "ok" with nothing else running
# Start the MCP server (connect it from Claude Desktop / Claude Code)
recosearchThen ask your MCP client "Which product category drove the most delivered order revenue?" and watch it call the governed tools and return a cited answer.
Install
pip install -e ".[duckdb]" # minimal: the zero-infra quickstart
pip install -e ".[all,dev]" # all adapters + ontology + testsExtras: duckdb, postgres, opensearch, qdrant, snowflake, mongodb, ontology,
observability. The depth ontology layer needs the ontology extra (rdflib, pyshacl).
CLI
Command | Description |
| Start the MCP server |
| Compile |
| Validate declared inputs; non-zero exit on errors |
| Probe all declared sources |
| Verify |
Environment variables
Variable | Default | Description |
|
| Directory holding the active scenario's input files |
|
|
|
| — | RBAC role applied to all requests ( |
|
| Enables the experimental L4/L5 decision/calibration tools |
|
| Export MCP tool spans to Phoenix (needs the |
Limitations (what v0.1 does and does not do)
The depth semantic layers run on DuckDB; the multi-source breadth path is the one spanning all six source types.
Federation is an in-process slice-join over pre-fetched rows, not atomic cross-source SQL.
RBAC/masking are opt-in and default to off (no
RECOSEARCH_ROLE→ no role gating). The read-only SQL guard and contract gating are always on.The experimental decision/calibration tools keep their ledger per-process (the audit trail is not persisted across restart). They are off by default behind
RECOSEARCH_EXPERIMENTAL.The read-only guard blocks mutating SQL, undeclared tables/columns, no-FROM/constant selects, and a denylist of server-side functions (file read/write, sleep, dblink). Run the DB connection under a least-privilege, read-only role as the backstop.
See docs/usage/limitations.md for the full list.
Documentation
Running tests
pip install -e ".[duckdb,postgres,mongodb,ontology,dev]"
recosearch --write-semantic-json # compile first
pytest -q tests/unit tests/smoke tests/integration -m "not live" # breadth gateway
pytest -q tests/semantic_layers -m "not live" # depth semantic layersLicense
Apache-2.0 — see LICENSE. Built by Recohut. Found a security issue? See SECURITY.md.
This server cannot be installed
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/recohut/recosearch'
If you have feedback or need assistance with the MCP directory API, please join our Discord server