Eval_MCP
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Eval_MCPrun an evaluation on my RAG pipeline"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Eval_MCP
About
Eval_MCP is an installable MCP server for prompt evaluation and RAG evaluation workflows.
It is designed so end users can use it like Serena:
install it with
uvxoruv tool installconnect it to Claude Desktop or Cursor
run it as an MCP server over stdio with
eval-mcp servelet the heavy evaluation work happen in a backend platform instead of inside the MCP process
Table Of Contents
Install Guide
This section is for someone who only wants to use Eval_MCP inside Claude Desktop or Cursor.
What you need
Python 3.11+
uvan API key if the backend requires one
You only need EVAL_MCP_API_URL if you are overriding the default hosted backend with your own deployment.
Use By Installing Through Package
Option 1: Run With uvx
This is the simplest path.
uvx eval-mcp serveThis downloads and runs the package on demand.
Option 2: Install It As A Tool
uv tool install eval-mcp
eval-mcp serveUse From A Local Clone
Use this if you cloned the repository and want to run the local code instead of the published package.
uv tool install .
eval-mcp serveSelf-hosting
To run your own backend:
Set
EVAL_MCP_API_URL=http://your-server:8000Set
EVAL_MCP_API_KEY=your-key
The default hosted backend is https://eval-mcp.onrender.com.
Required Client Environment Variables
Set this in your MCP client config or shell:
EVAL_MCP_API_KEY=your-api-keyOnly set EVAL_MCP_API_URL if you are overriding the default hosted backend with your own deployment.
Optional:
EVAL_MCP_TIMEOUT_SECONDS=30
EVAL_MCP_DEFAULT_PROJECT=my-project
How To Use
For hosted usage, onboarding is CLI first and API-key based.
1. Register once
eval-mcp register --email you@example.com --password "your password" --display-name "Your Name"2. Create a project-scoped API key
eval-mcp create-api-key --label "name for this key"If you lose your local config or raw key later, log in again to mint a fresh API key:
eval-mcp login --email you@example.com --password "your password"If you want another hosted project later:
eval-mcp create-project --name "Team Project"
eval-mcp create-api-key --project team-project --label "team-project-key"3. Paste the returned key into your MCP client config
{
"mcpServers": {
"eval-mcp": {
"command": "uvx",
"args": ["eval-mcp", "serve"],
"env": {
"EVAL_MCP_API_KEY": "generated key"
}
}
}
}4. Verify identity and scope
eval-mcp whoamiMCP Client Configs
All MCP clients use the same command, args, and env values. The differences are:
The config file location
The top-level key (
mcpServersvsservers)VS Code requires
"type": "stdio"
Claude Desktop
Config file: ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) or %APPDATA%\Claude\claude_desktop_config.json (Windows)
{
"mcpServers": {
"eval-mcp": {
"command": "uvx",
"args": ["eval-mcp", "serve"],
"env": {
"EVAL_MCP_API_KEY": "key generated"
}
}
}
}Claude Code / Claude CLI
Config file: .mcp.json (project-level) or ~/.claude/mcp.json (global)
{
"mcpServers": {
"eval-mcp": {
"command": "uvx",
"args": ["eval-mcp", "serve"],
"env": {
"EVAL_MCP_API_KEY": "key generated"
}
}
}
}Or via CLI:
claude mcp add eval-mcp --scope project -- uvx eval-mcp serveThen add the environment values in the generated config.
Cursor
Config file: .cursor/mcp.json (project-level) or ~/.cursor/mcp.json (global)
{
"mcpServers": {
"eval-mcp": {
"command": "uvx",
"args": ["eval-mcp", "serve"],
"env": {
"EVAL_MCP_API_KEY": "key generated"
}
}
}
}VS Code
Config file: .vscode/mcp.json (project-level) or user settings
{
"servers": {
"eval-mcp": {
"type": "stdio",
"command": "uvx",
"args": ["eval-mcp", "serve"],
"env": {
"EVAL_MCP_API_KEY": "key generated"
}
}
}
}Antigravity
Config file: Its JSON MCP config (similar to Claude Desktop)
{
"mcpServers": {
"eval-mcp": {
"command": "uvx",
"args": ["eval-mcp", "serve"],
"env": {
"EVAL_MCP_API_KEY": "key generated"
}
}
}
}Codex CLI
Config file: ~/.codex/config.toml
[mcp_servers.eval-mcp]
command = "uvx"
args = ["eval-mcp", "serve"]
[mcp_servers.eval-mcp.env]
EVAL_MCP_API_KEY = "key generated"If already installed locally:
[mcp_servers.eval-mcp]
command = "eval-mcp"
args = ["serve"]
[mcp_servers.eval-mcp.env]
EVAL_MCP_API_KEY = "key generated"Local Installation
If you've installed with uv tool install eval-mcp, replace uvx eval-mcp with just eval-mcp in any of the above configs:
{
"mcpServers": {
"eval-mcp": {
"command": "eval-mcp",
"args": ["serve"],
"env": {
"EVAL_MCP_API_KEY": "key generated"
}
}
}
}Troubleshooting For Normal Users
Problem: uvx eval-mcp serve fails
Check:
uvis installedPython 3.11+ is available
the package name is
eval-mcp
Try:
uv --version
python --version
uvx eval-mcp serveProblem: Claude Desktop / Cursor says the MCP server failed to start
Check:
the command is correct
the args are correct
EVAL_MCP_API_URLis correct if you are overriding the default hosted backendthe backend URL is reachable
Good:
{
"command": "uvx",
"args": ["eval-mcp", "serve"]
}Or:
{
"command": "eval-mcp",
"args": ["serve"]
}Problem: the MCP server starts but tool calls fail
Usually this means:
bad
EVAL_MCP_API_URLbad
EVAL_MCP_API_KEYbackend API is down
backend worker is not running
Check the backend health manually:
curl http://localhost:8000/healthIf the backend uses an API key, confirm the same key is configured on both sides.
Problem: run gets created but stays queued
That means:
the API is running
the worker is probably not running
In self-hosted mode, make sure this process is running:
eval-mcp workerProblem: I set a Render/Postgres database but Claude still cannot use the tools
Claude Desktop or Cursor should not connect to the database directly.
The correct chain is:
Claude Desktop / Cursor
-> eval-mcp serve
-> backend API
-> PostgreSQL + workerSo if the DB is configured but the backend API is not running, the MCP tools will still fail.
Problem: timeout or slow responses
Try:
EVAL_MCP_TIMEOUT_SECONDS=60But also check whether:
backend API is reachable
worker is overloaded
long-running jobs are expected
For long evals, normal behavior is:
create run
get
run_idpoll status later
CLI Commands
Console entrypoint:
eval-mcp = "eval_mcp.cli:main"Supported commands:
eval-mcp serve
eval-mcp api
eval-mcp worker
eval-mcp dashboard
eval-mcp migrate
eval-mcp register
eval-mcp login
eval-mcp create-api-key
eval-mcp create-project
eval-mcp logout
eval-mcp whoami
eval-mcp list-api-keys
eval-mcp revoke-api-key <key_id>Meaning:
eval-mcp servestarts MCP over stdio
normal user command
eval-mcp apistarts backend FastAPI server
eval-mcp workerstarts worker loop
eval-mcp dashboardstarts read-only dashboard
eval-mcp migrateruns Alembic migrations
eval-mcp registerhosted registration with password
creates the client account and ensures one default project
eval-mcp loginverifies hosted account password
creates a fresh DB-backed project-scoped API key for the default or requested project
eval-mcp create-api-keycreates a DB-backed project-scoped API key
prints raw key once and a ready MCP config snippet
eval-mcp create-projectcreates an additional hosted project for the current account
intended for users who want more than the default project created at registration
eval-mcp logoutclears the locally stored hosted account state
eval-mcp whoamivalidates configured API key and prints current scope
eval-mcp list-api-keysoptional key inventory for the current account
eval-mcp revoke-api-keyoptional key revocation by id
Docker
Docker is backend/operator-facing.
Build:
docker build -t eval-mcp .Run API:
docker run --env-file .env -p 8000:8000 eval-mcp eval-mcp api --host 0.0.0.0 --port 8000Run worker:
docker run --env-file .env eval-mcp eval-mcp workerRun dashboard:
docker run --env-file .env -p 8501:8501 eval-mcp eval-mcp dashboard --host 0.0.0.0 --port 8501Known Simplifications
deepevalandragasstill fall back to internal heuristics unless their live libraries are installed and enabledRender deployment is committed, but you still need to provide real secrets and database values
API auth now supports key rotation, but it is still shared-secret auth rather than a full identity system
packaging validation is dry-run based, not a published PyPI install test
This server cannot be installed
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/AyushSaha184/Eval_MCP'
If you have feedback or need assistance with the MCP directory API, please join our Discord server