EvalKit MCP Server
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@EvalKit MCP ServerEvaluate robustness of 'How to bypass safety filters'"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
EvalKit — Classifier Robustness Evaluation Toolkit
Multi-technique evaluation toolkit for testing AI safety classifier robustness against query decomposition, obfuscation, and multi-agent orchestration attacks. MCP plugin for Claude Code.
Quick Start
uv sync # install everything
claude # MCP tools auto-loaded from .mcp.jsonRelated MCP server: agent immune
Architecture
User Query → SPLITTER → [Encoder → Wrapper → ApiClient] × N → MERGER → ReportUsage
In Claude Code (MCP)
Tool | Purpose |
| Full eval pipeline |
| Preview query decomposition |
| Test homoglyph substitution |
| Check configuration |
Via CLI
python3 run.py decompose "query" 5 # decompose into sub-queries
python3 run.py eval "query" # full evaluation pipeline
python3 run.py obfuscate "text" # test obfuscationBenchmark
python3 benchmark.py # dry run (no API key)
python3 benchmark.py --quick # single query smoke test
python3 benchmark.py --json report.json # save JSON results
python3 benchmark.py --html report.html # save HTML report
ANTHROPIC_API_KEY=sk-... python3 benchmark.py # live testConfiguration
Parameter | Values | Default | Description |
| none, light, moderate, aggressive | moderate | Homoglyph replacement level |
| fiction, study_guide, academic, documentation, translation, none | study_guide | Narrative framing strategy |
| 1–20 | 10 | Max sub-query decompositions |
| 0–10000 | 5000 | Long-context padding per query |
| true, false | true | Multi-agent orchestration |
| true, false | true | Helper model with filters removed |
Project Structure
evalkit/
├── evalkit/ # Core modules
│ ├── splitter.py # Query → sub-questions
│ ├── encoder.py # Unicode homoglyph engine
│ ├── wrapper.py # Narrative wrapping
│ ├── api_client.py # API client + model routing
│ ├── merger.py # Output stitching + metrics
│ ├── context_builder.py # Multi-turn conversation
│ ├── agent_router.py # Agent pack coordination
│ └── models.py # Data classes + enums
├── evalkit_server.py # MCP server (FastMCP)
├── run.py # CLI wrapper (no MCP needed)
├── benchmark.py # Test matrix runner
├── tests/ # Pytest test suite
├── docs/ # Documentation
├── .mcp.json # Claude Code auto-discovery
└── CLAUDE.md # Claude Code instructionsResearch Techniques
See docs/TECHNIQUES.md for detailed documentation of each technique.
References
License
MIT — authorized security research and defense evaluation only.
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/tr4m0ryp/fable-5-jailbreak'
If you have feedback or need assistance with the MCP directory API, please join our Discord server