How do I use Eval_MCP?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@Eval_MCP run an evaluation on my RAG pipeline" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

Eval_MCP

by AyushSaha184

Overview Schema Related Servers Score Discussions

Python

Remote

Eval_MCP

About

Eval_MCP is an installable MCP server for prompt evaluation and RAG evaluation workflows.

It is designed so end users can use it like Serena:

install it with uvx or uv tool install
connect it to Claude Desktop or Cursor
run it as an MCP server over stdio with eval-mcp serve
let the heavy evaluation work happen in a backend platform instead of inside the MCP process

Related MCP server: Fastn Server

Install Guide

This section is for someone who only wants to use Eval_MCP inside Claude Desktop or Cursor.

What you need

Python 3.11+
uv
an API key if the backend requires one

You only need EVAL_MCP_API_URL if you are overriding the default hosted backend with your own deployment.

Use By Installing Through Package

Option 1: Run With `uvx`

This is the simplest path.

uvx eval-mcp serve

This downloads and runs the package on demand.

Option 2: Install It As A Tool

uv tool install eval-mcp
eval-mcp serve

Use From A Local Clone

Use this if you cloned the repository and want to run the local code instead of the published package.

uv tool install .
eval-mcp serve

Self-hosting

To run your own backend:

Set EVAL_MCP_API_URL=http://your-server:8000
Set EVAL_MCP_API_KEY=your-key

The default hosted backend is https://eval-mcp.onrender.com.

Required Client Environment Variables

Set this in your MCP client config or shell:

EVAL_MCP_API_KEY=your-api-key

Only set EVAL_MCP_API_URL if you are overriding the default hosted backend with your own deployment.

Optional:

EVAL_MCP_TIMEOUT_SECONDS=30
EVAL_MCP_DEFAULT_PROJECT=my-project

How To Use

For hosted usage, onboarding is CLI first and API-key based.

1. Register once

eval-mcp register --email you@example.com --password "your password" --display-name "Your Name"

2. Create a project-scoped API key

eval-mcp create-api-key --label "name for this key"

If you lose your local config or raw key later, log in again to mint a fresh API key:

eval-mcp login --email you@example.com --password "your password"

If you want another hosted project later:

eval-mcp create-project --name "Team Project"
eval-mcp create-api-key --project team-project --label "team-project-key"

3. Paste the returned key into your MCP client config

{
  "mcpServers": {
    "eval-mcp": {
      "command": "uvx",
      "args": ["eval-mcp", "serve"],
      "env": {
        "EVAL_MCP_API_KEY": "generated key"
      }
    }
  }
}

4. Verify identity and scope

eval-mcp whoami

MCP Client Configs

All MCP clients use the same command, args, and env values. The differences are:

The config file location
The top-level key (mcpServers vs servers)
VS Code requires "type": "stdio"

Claude Desktop

Config file: ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) or %APPDATA%\Claude\claude_desktop_config.json (Windows)

{
  "mcpServers": {
    "eval-mcp": {
      "command": "uvx",
      "args": ["eval-mcp", "serve"],
      "env": {
        "EVAL_MCP_API_KEY": "key generated"
      }
    }
  }
}

Claude Code / Claude CLI

Config file: .mcp.json (project-level) or ~/.claude/mcp.json (global)

{
  "mcpServers": {
    "eval-mcp": {
      "command": "uvx",
      "args": ["eval-mcp", "serve"],
      "env": {
        "EVAL_MCP_API_KEY": "key generated"
      }
    }
  }
}

Or via CLI:

claude mcp add eval-mcp --scope project -- uvx eval-mcp serve

Then add the environment values in the generated config.

Cursor

Config file: .cursor/mcp.json (project-level) or ~/.cursor/mcp.json (global)

{
  "mcpServers": {
    "eval-mcp": {
      "command": "uvx",
      "args": ["eval-mcp", "serve"],
      "env": {
        "EVAL_MCP_API_KEY": "key generated"
      }
    }
  }
}

VS Code

Config file: .vscode/mcp.json (project-level) or user settings

{
  "servers": {
    "eval-mcp": {
      "type": "stdio",
      "command": "uvx",
      "args": ["eval-mcp", "serve"],
      "env": {
        "EVAL_MCP_API_KEY": "key generated"
      }
    }
  }
}

Antigravity

Config file: Its JSON MCP config (similar to Claude Desktop)

{
  "mcpServers": {
    "eval-mcp": {
      "command": "uvx",
      "args": ["eval-mcp", "serve"],
      "env": {
        "EVAL_MCP_API_KEY": "key generated"
      }
    }
  }
}

Codex CLI

Config file: ~/.codex/config.toml

[mcp_servers.eval-mcp]
command = "uvx"
args = ["eval-mcp", "serve"]

[mcp_servers.eval-mcp.env]
EVAL_MCP_API_KEY = "key generated"

If already installed locally:

[mcp_servers.eval-mcp]
command = "eval-mcp"
args = ["serve"]

[mcp_servers.eval-mcp.env]
EVAL_MCP_API_KEY = "key generated"

Local Installation

If you've installed with uv tool install eval-mcp, replace uvx eval-mcp with just eval-mcp in any of the above configs:

{
  "mcpServers": {
    "eval-mcp": {
      "command": "eval-mcp",
      "args": ["serve"],
      "env": {
        "EVAL_MCP_API_KEY": "key generated"
      }
    }
  }
}

Troubleshooting For Normal Users

Problem: `uvx eval-mcp serve` fails

Check:

uv is installed
Python 3.11+ is available
the package name is eval-mcp

Try:

uv --version
python --version
uvx eval-mcp serve

Problem: Claude Desktop / Cursor says the MCP server failed to start

Check:

the command is correct
the args are correct
EVAL_MCP_API_URL is correct if you are overriding the default hosted backend
the backend URL is reachable

Good:

{
  "command": "uvx",
  "args": ["eval-mcp", "serve"]
}

Or:

{
  "command": "eval-mcp",
  "args": ["serve"]
}

Problem: the MCP server starts but tool calls fail

Usually this means:

bad EVAL_MCP_API_URL
bad EVAL_MCP_API_KEY
backend API is down
backend worker is not running

Check the backend health manually:

curl http://localhost:8000/health

If the backend uses an API key, confirm the same key is configured on both sides.

Problem: run gets created but stays queued

That means:

the API is running
the worker is probably not running

In self-hosted mode, make sure this process is running:

eval-mcp worker

Problem: I set a Render/Postgres database but Claude still cannot use the tools

Claude Desktop or Cursor should not connect to the database directly.

The correct chain is:

Claude Desktop / Cursor
-> eval-mcp serve
-> backend API
-> PostgreSQL + worker

So if the DB is configured but the backend API is not running, the MCP tools will still fail.

Problem: timeout or slow responses

Try:

EVAL_MCP_TIMEOUT_SECONDS=60

But also check whether:

backend API is reachable
worker is overloaded
long-running jobs are expected

For long evals, normal behavior is:

create run
get run_id
poll status later

CLI Commands

Console entrypoint:

eval-mcp = "eval_mcp.cli:main"

Supported commands:

eval-mcp serve
eval-mcp api
eval-mcp worker
eval-mcp dashboard
eval-mcp migrate
eval-mcp register
eval-mcp login
eval-mcp create-api-key
eval-mcp create-project
eval-mcp logout
eval-mcp whoami
eval-mcp list-api-keys
eval-mcp revoke-api-key <key_id>

Meaning:

eval-mcp serve
- starts MCP over stdio
- normal user command
eval-mcp api
- starts backend FastAPI server
eval-mcp worker
- starts worker loop
eval-mcp dashboard
- starts read-only dashboard
eval-mcp migrate
- runs Alembic migrations
eval-mcp register
- hosted registration with password
- creates the client account and ensures one default project
eval-mcp login
- verifies hosted account password
- creates a fresh DB-backed project-scoped API key for the default or requested project
eval-mcp create-api-key
- creates a DB-backed project-scoped API key
- prints raw key once and a ready MCP config snippet
eval-mcp create-project
- creates an additional hosted project for the current account
- intended for users who want more than the default project created at registration
eval-mcp logout
- clears the locally stored hosted account state
eval-mcp whoami
- validates configured API key and prints current scope
eval-mcp list-api-keys
- optional key inventory for the current account
eval-mcp revoke-api-key
- optional key revocation by id

Docker

Docker is backend/operator-facing.

Build:

docker build -t eval-mcp .

Run API:

docker run --env-file .env -p 8000:8000 eval-mcp eval-mcp api --host 0.0.0.0 --port 8000

Run worker:

docker run --env-file .env eval-mcp eval-mcp worker

Run dashboard:

docker run --env-file .env -p 8501:8501 eval-mcp eval-mcp dashboard --host 0.0.0.0 --port 8501

Known Simplifications

deepeval and ragas still fall back to internal heuristics unless their live libraries are installed and enabled
Render deployment is committed, but you still need to provide real secrets and database values
API auth now supports key rotation, but it is still shared-secret auth rather than a full identity system
packaging validation is dry-run based, not a published PyPI install test

This server cannot be installed

license - not found

quality - not tested

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Related MCP Servers

MCP Prompt Tester
Code Execution Testing & QA Tools
rt96-hub
A
license
-
quality
C
maintenance
An MCP server that allows agents to test and compare LLM prompts across OpenAI and Anthropic models, supporting single tests, side-by-side comparisons, and multi-turn conversations.
Last updated 2025-03-05
MIT
Fastn Serverofficial
Developer Tools App Automation Code Execution
fastn-ai
A
license
-
quality
D
maintenance
An MCP server that enables dynamic tool registration and execution based on API definitions, providing seamless integration with services like Claude.ai and Cursor.ai.
Last updated 2026-06-07
23
MIT
multivon-mcpofficial
AI & Machine Learning Testing & QA Tools
multivon-ai
A
license
A
quality
A
maintenance
MCP server that gives AI coding agents direct access to evaluation tools.
Last updated 2026-06-12
22
Apache 2.0
mcp-llm-eval
AI & Machine Learning CI/CD & DevOps Testing & QA Tools
berkayildi
A
license
A
quality
C
maintenance
A local MCP server that packages LLM evaluation gates as reusable CI/CD primitives, enabling AI agents to run datasets against models, score responses, and enforce quality thresholds.
Last updated 2026-05-03
10
MIT

View all related MCP servers

Related MCP Connectors

mcp
MCP server providing access to the Scorecard API to evaluate and optimize LLM systems.
TokenOracle
Hosted MCP server for LLM cost estimation, model comparison, and budget-aware routing.
DataDoe MCP
Hosted Amazon Seller and Vendor MCP server for Claude, ChatGPT, Cursor, Codex, Gemini, Copilot.

View all MCP Connectors

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/AyushSaha184/Eval_MCP'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

Eval_MCP

About

Table Of Contents

Install Guide

What you need

Use By Installing Through Package

Option 1: Run With uvx

Option 2: Install It As A Tool

Use From A Local Clone

Self-hosting

Required Client Environment Variables

How To Use

1. Register once

2. Create a project-scoped API key

3. Paste the returned key into your MCP client config

4. Verify identity and scope

MCP Client Configs

Claude Desktop

Claude Code / Claude CLI

Cursor

VS Code

Antigravity

Codex CLI

Local Installation

Troubleshooting For Normal Users

Problem: uvx eval-mcp serve fails

Problem: Claude Desktop / Cursor says the MCP server failed to start

Problem: the MCP server starts but tool calls fail

Problem: run gets created but stays queued

Problem: I set a Render/Postgres database but Claude still cannot use the tools

Problem: timeout or slow responses

CLI Commands

Docker

Known Simplifications

Maintenance

Resources

Looking for Admin?

Related MCP Servers

MCP Prompt Tester

Fastn Serverofficial

multivon-mcpofficial

mcp-llm-eval

Related MCP Connectors

Latest Blog Posts

MCP directory API

Option 1: Run With `uvx`

Problem: `uvx eval-mcp serve` fails