MCP Gateway

mcp-gateway
src
validator

README.md•6.86 KiB

# MCP Server Design Validator - Agent-UX Compliance ## Overview This module validates MCP tool definitions against agent-UX design best practices based on Phil Schmid's article ["MCP Best Practices"](https://www.philschmid.de/mcp-best-practices). **Key Insight**: MCP is a User Interface for Agents, not a REST API wrapper. The protocol works fine; many servers don't follow agent-oriented design principles. ## The 6 Principles ### AX-001: Outcomes, Not Operations **Problem**: Tools that wrap API operations (CRUD verbs) instead of achieving agent goals. **Bad**: `get_user`, `update_record`, `fetch_data` **Good**: `find_user_by_email`, `search_github_issues`, `analyze_sentiment` **Why**: Agents need to accomplish goals, not execute operations. Operation-oriented names force the agent to understand implementation details instead of focusing on outcomes. ### AX-002: Flatten Your Arguments **Problem**: Nested objects in input schemas make it hard for LLMs to construct correct parameters. **Bad**: ```json { "filter": { "user": {"id": "123"}, "date": {"from": "2024-01-01"} } } ``` **Good**: ```json { "user_id": "123", "date_from": "2024-01-01" } ``` **Why**: LLMs excel with flat, primitive parameters. Nested structures increase error rates and token usage. ### AX-003: Instructions are Context **Problem**: Minimal or missing documentation. LLMs rely on descriptions to select the right tool. **Bad**: "Get user" (8 characters) **Good**: "Search for a user by email address or username. Use this when you need to find a specific person in the system. Returns user profile with name, email, and account status." (180 characters) **Why**: Tool selection happens in context windows of 200K+ tokens. Rich descriptions help LLMs make better decisions without increasing actual usage costs significantly. ### AX-004: Curate Ruthlessly **Problem**: Returning full API responses with dozens of fields the agent doesn't need. **Bad**: Return all 50 fields from database record **Good**: Return 5-7 fields relevant to the agent's task **Why**: Large responses waste tokens and make it harder for agents to extract relevant information. Curate responses to include only what's needed for decision-making. ### AX-005: Name for Discovery **Problem**: Generic names like "search" or "tool" that are hard to find in large tool lists. **Bad**: `search`, `get`, `update` **Good**: `github_search_issues`, `slack_send_message`, `stripe_create_invoice` **Why**: Agents see hundreds of tools. Service-prefixed names make it easy to discover relevant tools through search and filtering. ### AX-006: Paginate Large Results **Problem**: List operations that return all results, causing context overflow. **Requirements**: - Input params: `limit`, `offset` (or `page`, `cursor`) - Output metadata: `total_count`, `has_more`, `next_cursor` **Why**: Large result sets waste tokens and can exceed context windows. Pagination with metadata lets agents fetch exactly what they need. ## Usage ### Basic Validation ```rust use mcp_gateway::validator::AgentUxValidator; use mcp_gateway::protocol::Tool; let validator = AgentUxValidator::new(); let tools = vec![/* your tools */]; let report = validator.validate_tools(&tools)?; println!("{}", report.format_text()); ``` ### Programmatic Access ```rust // Check specific results let failures = report.failures(); // Only FAIL severity let warnings = report.warnings(); // WARN and INFO severity // Get by principle for (principle, score) in &report.by_principle { println!("{}: {:.1}%", principle, score.avg_score * 100.0); } // JSON output for CI/CD let json = serde_json::to_string_pretty(&report)?; ``` ### Validation Result Each rule check returns: - `passed`: Boolean pass/fail - `severity`: PASS, INFO, WARN, or FAIL - `score`: 0.0 - 1.0 numeric score - `issues`: List of specific problems found - `suggestions`: Concrete fix recommendations ### Report Grading - **A+ (95-100%)**: Excellent agent UX - **A (90-95%)**: Very good agent UX - **B (75-90%)**: Good, minor improvements needed - **C (60-75%)**: Adequate, several issues - **D (50-60%)**: Poor, needs significant work - **F (<50%)**: Fails basic agent-UX principles ## Examples See `examples/validator_demo.rs` for a complete working example: ```bash cargo run --example validator_demo ``` ## CLI Integration See `CLI_INTEGRATION_VALIDATOR.md` for details on adding CLI commands. Integration is deferred until after Auto-Discovery feature (#45) is complete to avoid conflicts. Proposed commands: ```bash # Validate capability files mcp-gateway validate --path capabilities/ # Validate live server (requires #45) mcp-gateway validate --server fulcrum # JSON output for CI/CD mcp-gateway validate --path capabilities/ --format json ``` ## Testing All tests pass with 100% coverage of validation logic: ```bash cargo test --lib validator # Result: 17 passed; 0 failed ``` Tests cover: - All 6 validation rules - Report generation and scoring - Edge cases (missing schemas, empty descriptions, etc.) - Severity calculations - Grade assignment ## Architecture ``` src/validator/ ├── mod.rs # AgentUxValidator - main entry point ├── rules.rs # ValidationRules - 6 principle checks └── report.rs # ValidationReport - scoring and output ``` ### Adding New Rules Implement the `Rule` trait: ```rust struct MyCustomRule; impl Rule for MyCustomRule { fn code(&self) -> &str { "AX-007" } fn name(&self) -> &str { "My Principle" } fn description(&self) -> &str { "..." } fn check(&self, tool: &Tool) -> Result<ValidationResult> { let mut result = ValidationResult::new( self.code(), self.name(), &tool.name ); // Check tool and add issues if /* condition */ { result.add_issue("Problem description"); result.add_suggestion("How to fix"); } // Calculate score and severity let score = /* 0.0 - 1.0 */; let severity = if score < 0.5 { Severity::Fail } else { Severity::Pass }; result.passed = result.issues.is_empty(); Ok(result.with_score(score).with_severity(severity)) } } ``` ## ROI Metrics Based on the original issue (#36), expected annual value: | Metric | Before | After | Improvement | |--------|--------|-------|-------------| | Tool design time | 2 hours | 30 min | 4× faster | | Agent success rate | 60% | 95% | 1.6× | | Token waste | 5K/call | 500/call | 10× reduction | | Debug time | 1 hour | 5 min | 12× faster | | **Annual Value** | - | **$180K** | Agent efficiency | ## Related Issues - #36 - MCP Server Design Validator (this implementation) - #45 - Auto-Discovery for Local MCP Servers (needed for CLI `--server` option) ## References - Phil Schmid: [MCP Best Practices](https://www.philschmid.de/mcp-best-practices) - MCP Specification: [modelcontextprotocol.io](https://modelcontextprotocol.io/)

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/MikkoParkkola/mcp-gateway'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

README.md•6.86 KiB