review-mcp

by io.github.HectorGuerilla

Server Details

Multi-model code review: a panel of models + detectors return a pass/fail verdict. Paid via x402.

Status: Healthy
Last Tested: 2026-07-25 12:40
Transport: Streamable HTTP
URL

Glama MCP Gateway

Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.

MCP client

Glama

MCP server

Full call logging

Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.

Tool access control

Enable or disable individual tools per connector, so you decide what your agents can and cannot do.

Managed credentials

Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.

Usage analytics

See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.

100% free. Your data is private.

Tool Definition Quality

A4.6/5.0

Tool DescriptionsA

Average 4.8/5 across 1 of 1 tools scored.

Server CoherenceA

Disambiguation5/5

Only one tool exists, so there is no possibility of ambiguity between tools. The tool's purpose is clearly defined.

Naming Consistency5/5

With a single tool, naming is inherently consistent. The name 'review_code' follows a clear verb_noun pattern.

Tool Count3/5

The server has only one tool, which is borderline for a service that appears to offer a complex, multi-model code review. However, it's plausible that all functionality is encapsulated in one tool.

Completeness4/5

The single tool covers multiple review scenarios (diff, module, spec+implementation) and returns structured results. Minor gaps could include lack of separate configuration tools, but core review functionality appears complete.

Available Tools

1 tool

review_codeAInspect

Adversarial multi-model code review. Submit a diff, a module, or a spec+implementation and get back a structured pass/fail verdict with each issue's type, severity, location, explanation, and suggested fix.

Why call this instead of reviewing your own output: a single model shares its blind spots with itself. This routes your code through a panel of different models plus a set of deterministic detectors, catching what self-review misses — path/contract violations, module incoherence (dangling imports, broken cross-references), syntax and call-arity regressions in a diff's post-image, and 'prose instead of tool calls' (output that describes an action rather than emitting it). The panel adds semantic judgment on top and never overrides a deterministic finding.

Call it before shipping or merging, as a second opinion on a risky change, or as a gate in an autonomous build loop. Choose depth='fast' (one model, low latency) or 'deep' (full panel, higher recall). Deep review audits files of any size in milestone chunks so every panel model contributes; the price (shown in the 402) and the payment window scale with file size. Per-call limit ~1,600 lines — larger inputs return 413, so split by file/module and call once per file. Paid per call via x402 (USDC on Base); the price is announced in the 402 response before any charge.

ParametersJSON Schema

Name	Required	Description
`depth`	No	"fast" = single-model, low latency; "deep" = full panel, higher recall (default "deep").
`context`	No	optional spec/intent, related interfaces, constraints, or what the change should do.
`payload`	Yes	a unified diff, a complete module/file, or a spec plus its implementation (include enough context lines for diffs).
`language`	Yes	primary language of the payload (e.g. "typescript", "python").

Tool Definition Quality

A4.8/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description fully discloses behavioral traits: uses multiple models and deterministic detectors, never overrides deterministic findings, depth options affect recall, line limit, payment via x402, and what types of issues it catches (path/contract violations, etc.).

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is well-structured, starting with a concise summary, followed by rationale for use, then usage details. It is somewhat lengthy but every sentence adds value. It is front-loaded with the most critical information.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema and no annotations, the description covers input formats, output structure (pass/fail verdict with issue details), limitations (line limit, split advice), depth options, and pricing. This is sufficient for an agent to use the tool correctly.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%. The description adds valuable context beyond the schema: explains 'payload' can be diff, module, or spec+implementation; 'depth' options are elaborated; 'context' is optional spec/intent. This enhances understanding beyond the schema descriptions alone.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: 'Adversarial multi-model code review.' It specifies inputs (diff, module, spec+implementation) and outputs (structured pass/fail verdict with issue details). It also distinguishes from self-review by explaining why multiple models catch more issues.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description explicitly tells when to use: 'Call it before shipping or merging, as a second opinion on a risky change, or as a gate in an autonomous build loop.' It also provides limitations like the ~1,600 line limit and suggests splitting inputs. Depth options and pricing are also covered.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Claim this connector by publishing a /.well-known/glama.json file on your server's domain with the following structure:

{
  "$schema": "https://glama.ai/mcp/schemas/connector.json",
  "maintainers": [{ "email": "your-email@example.com" }]
}

The email address must match the email associated with your Glama account. Once published, Glama will automatically detect and verify the file within a few minutes.

Discussions

No comments yet. Be the first to start the discussion!

Try in Browser

Your Connectors

Resources

Need Help?