How do I use roshan-baaz-mcp?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@roshan-baaz-mcp semantic search for Persian documents about technology" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

roshan-baaz-mcp

by dwin-gharibi

Overview Schema Related Servers Score Discussions

Python

Remote

roshan-baaz-mcp

A self-hostable Model Context Protocol server for Roshan AI's Baaz (باز) Persian semantic-search service.

Built from the public API documentation at docs.roshan-ai.ir. Unofficial community integration.

What is this?

Baaz (باز) is Roshan AI's Persian-native semantic search and retrieval service. You give it documents; it chunks them, computes embeddings, and stores them across an Elasticsearch + Weaviate backend. You can then run semantic queries, find similar documents, check document status, fetch full documents, and manage indices — all tuned for Persian (فارسی) content.

roshan-baaz-mcp wraps that HTTP API as MCP tools so Claude — or any MCP-compatible client — can call Baaz as first-class tools.

Baaz is self-hostable, and a single deployment is rarely enough: teams run many independent Baaz instances (per region, per tenant, per environment). This server treats named instances as a core concept — one MCP process can route every tool call to the right Baaz deployment via an optional instance argument.

Highlights

100% endpoint coverage — every documented Baaz endpoint is a tool.
Multi-instance / self-hosting first — route by instance; tokens are never exposed by list_instances.
Guardrails — http(s) URL validation, ~10MB body cap, pagination clamps, and token redaction in every error path.
Offline docs tool — roshan_baaz_docs describes the service and tools without any network call.

Related MCP server: RAG In A Box MCP Server

Tool reference

Every tool accepts an optional instance: str | None selecting which configured Baaz deployment to call (defaults to default_instance).

Tool	Method & endpoint	Purpose
`baaz_index`	`POST /{index}/index`	Index/update a batch of documents (chunked + embedded).
`baaz_semantic_search`	`POST /{index}/document/semantic_query`	Semantic query → ranked documents with matching chunks.
`baaz_document_status`	`POST /{index}/document_status`	Check whether documents (by URL) already exist.
`baaz_delete_index`	`DELETE /{index}/delete_index`	Delete an entire index and all its documents.
`baaz_delete_documents`	`DELETE /{index}/delete_index` (body `{type}`)	Delete only documents of one `type`.
`baaz_stats`	`GET /{index}/stats`	Index statistics keyed by `{index}_{type}`.
`baaz_similar_documents`	`POST /{index}/similar/documents`	Find documents similar to a given `document_id`.
`baaz_view_document`	`GET /{index}/document/view?id=`	Fetch a full document by id.
`healthcheck`	`GET /healthcheck`	Check that an instance is up and ready.
`list_instances`	(local)	List configured instances — names + base URLs only, never tokens.
`roshan_baaz_docs`	(local)	Offline docs about Baaz and these tools.

See the Baaz API guide for full request/response field semantics.

Install

Requires Python 3.10+ (developed and tested on 3.11).

git clone https://github.com/roshan-research/roshan-baaz-mcp.git
cd roshan-baaz-mcp
python -m venv .venv && source .venv/bin/activate
pip install -e .            # add ".[dev]" for the test/lint toolchain

Run it (stdio transport by default):

python -m roshan_baaz_mcp            # or: roshan-baaz-mcp
python -m roshan_baaz_mcp --help     # transports, host/port, log level

Configuration (multi-instance)

Configuration is read from environment variables via pydantic-settings.

Shorthand (single instance)

The fastest way to point at one Baaz deployment. This synthesizes an instance named default:

Variable	Default	Description
`ROSHAN_BAAZ_BASE_URL`	`https://baaz.roshan-ai.ir`	Base URL of the Baaz deployment.
`ROSHAN_BAAZ_TOKEN`	(none)	Token sent as `Authorization: Token <token>`.

export ROSHAN_BAAZ_BASE_URL=https://baaz.roshan-ai.ir
export ROSHAN_BAAZ_TOKEN=your-secret-token

Nested (one or many named instances)

Prefix ROSHAN_BAAZ__, nested delimiter __. Each instance under INSTANCES__<NAME>__:

Variable	Default	Description
`ROSHAN_BAAZ__INSTANCES__<NAME>__BASE_URL`	`https://baaz.roshan-ai.ir`	Base URL of that instance.
`ROSHAN_BAAZ__INSTANCES__<NAME>__TOKEN`	(none)	Auth token for that instance.
`ROSHAN_BAAZ__INSTANCES__<NAME>__VERIFY_SSL`	`true`	Verify the TLS certificate.
`ROSHAN_BAAZ__INSTANCES__<NAME>__TIMEOUT`	`60`	Per-request timeout (seconds).
`ROSHAN_BAAZ__DEFAULT_INSTANCE`	`default`	Instance used when a tool omits `instance`.
`ROSHAN_BAAZ__LOG_LEVEL`	`INFO`	Logging level.

# Two self-hosted regions behind one MCP server
export ROSHAN_BAAZ__INSTANCES__TEHRAN__BASE_URL=https://baaz.tehran.example.ir
export ROSHAN_BAAZ__INSTANCES__TEHRAN__TOKEN=tehran-token
export ROSHAN_BAAZ__INSTANCES__MASHHAD__BASE_URL=https://baaz.mashhad.example.ir
export ROSHAN_BAAZ__INSTANCES__MASHHAD__TOKEN=mashhad-token
export ROSHAN_BAAZ__DEFAULT_INSTANCE=tehran

Tools then target a deployment with instance="mashhad"; omit it to use the default. Call list_instances to discover configured names (tokens are never returned).

Use with an MCP client

Add the server to your MCP client (e.g. Claude Desktop) config:

{
  "mcpServers": {
    "roshan-baaz": {
      "command": "python",
      "args": ["-m", "roshan_baaz_mcp"],
      "env": {
        "ROSHAN_BAAZ_BASE_URL": "https://baaz.roshan-ai.ir",
        "ROSHAN_BAAZ_TOKEN": "your-secret-token"
      }
    }
  }
}

Architecture

The MCP client talks to roshan-baaz-mcp over a transport (stdio by default). The server resolves the requested instance, applies guardrails, and calls the Baaz HTTP API.

Architecture

Request flow

A typical retrieval session: index documents, run a semantic query, read the ranked results, then optionally expand with similar documents.

Request flow

Self-hosting & scaling

Because Baaz is self-hosted, one MCP process can front many Baaz deployments, routing each tool call by its instance argument (region / tenant / environment).

Self-hosting multi-instance

Scaling notes:

Stateless — the server holds no per-request state; run as many replicas as you need behind your scheduler. Configuration comes entirely from the environment.
One process, many instances — add instances by setting more ROSHAN_BAAZ__INSTANCES__<NAME>__* variables; no code changes.
Rate limits — Baaz limits to ~100 requests/minute per deployment; spread heavy load across instances and back off on 429.
Body cap — requests are capped at ~10MB (enforced client-side before the call); batch large indexing jobs accordingly.
Horizontal autoscaling — a Kubernetes HPA manifest and Helm chart ship in deploy/.

Deployment

Ready-to-use manifests live in deploy/:

Docker — Dockerfile and a two-instance docker-compose.yml.
Kubernetes — raw manifests in deploy/kubernetes/ (namespace, configmap, secret, deployment, service, ingress, hpa, kustomization).
Helm — chart in deploy/helm/roshan-baaz-mcp/.
Terraform — example in deploy/terraform/.

See deploy/README.md for details.

Testing

pip install -e ".[dev]"
python -m pytest -q          # all HTTP is mocked with respx; no network
ruff check src tests

Optional live tests run against a real Baaz deployment when you opt in:

export ROSHAN_BAAZ_LIVE=1
export ROSHAN_BAAZ_BASE_URL=https://baaz.roshan-ai.ir
export ROSHAN_BAAZ_TOKEN=your-secret-token
python -m pytest tests/live -q

There is also a no-network smoke test that asserts every tool has a non-empty description and an instance parameter:

python scripts/smoke_test.py

Examples

examples/inspect_server.py prints the registered tools and their schemas without any network call. See examples/README.md.

License

MIT. Baaz (باز) and Roshan AI are products/trademarks of Roshan AI; this is an unofficial, community-maintained integration built from public docs.

This server cannot be installed

license - permissive license

quality - not tested

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/dwin-gharibi/roshan-baaz-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server