What can you do with this server?

The ThoughtProof MCP server verifies AI-generated reasoning and claims using adversarial multi-model consensus (Grok, Gemini, DeepSeek, Sonnet), helping you decide whether to trust and act on AI outputs. * Verify claims and reasoning: Submit any decision or reasoning claim via verify_claim / verify_reasoning to receive a verdict (ALLOW, HOLD, UNCERTAIN, or DISSENT), a confidence score, and up to 3 key objections explaining why a claim may be challenged * Domain-specific verification: Tailor analysis to financial, medical, legal, code, or general contexts for more accurate assessments * Risk-adjusted analysis: Set a stake level (low, medium, high, critical) to adjust confidence thresholds based on decision consequence * Adjust verification depth: Choose between fast (2 models), standard (4 models), or deep (5+ models) to balance speed and cost ($0.008–$0.08 per verification) * Check agent trust scores: Use check_agent_score to look up composite trust scores for specific agents, optionally filtered by domain * Guard against hallucinations: Use verdicts and objections to validate AI outputs before acting on them

Which integrations are available for this server?

Allows looking up agent trust scores on the ERC-8004 registry, an Ethereum-based autonomous agent registry.

thoughtproof-mcp

Official

by ThoughtProof

Overview Schema Related Servers Score Discussions

JavaScript

Remote

thoughtproof-mcp

npm version License: MIT

MCP server for ThoughtProof — verify AI reasoning with adversarial multi-model consensus.

3–4 LLMs (Grok, Gemini, DeepSeek, Sonnet) independently evaluate every claim. A dedicated red-team model critiques their verdicts. A synthesizer (Sonnet) weighs everything and returns ALLOW, BLOCK, or UNCERTAIN with confidence score and objections.

Quick Start

{
  "mcpServers": {
    "thoughtproof": {
      "command": "npx",
      "args": ["-y", "thoughtproof-mcp"],
      "env": {
        "THOUGHTPROOF_API_KEY": "tp_op_your_key_here"
      }
    }
  }
}

Works with Claude Desktop, Cursor, Windsurf, Cline, and any MCP-compatible client.

Related MCP server: happy-thoughts

Tools

`verify_claim`

Verify any claim or AI-generated reasoning before acting on it.

Parameter	Type	Default	Description
`claim`	string	(required)	The text to verify
`stakeLevel`	`low` / `medium` / `high` / `critical`	`medium`	Risk level — higher stakes trigger deeper verification
`domain`	`financial` / `medical` / `legal` / `code` / `general`	`general`	Domain context for specialized verification
`speed`	`fast` / `standard` / `deep`	`standard`	Verification depth

`check_agent_score`

Look up an agent's composite trust score on the ERC-8004 registry.

Parameter	Type	Description
`agentId`	string	Agent ID to look up
`domain`	string	Optional domain filter

Example

In Claude Desktop or Cursor, just ask:

"Verify the claim: GPT-5 achieves 95% accuracy on MMLU-Pro"

The tool returns:

⚠️ UNCERTAIN (42% confidence)

Claim: "GPT-5 achieves 95% accuracy on MMLU-Pro"

Objections:
- Insufficient public benchmark data to confirm
- Historical accuracy claims have been overstated
- MMLU-Pro methodology has known ceiling effects

⚡ 3.2s | Adversarial Multi-Model Consensus

How It Works

Your AI Agent
    │
    ▼
┌──────────────────┐
│  thoughtproof-mcp │  ← MCP Server (this package)
└──────────────────┘
    │
    ▼
┌──────────────────┐
│  ThoughtProof API │  ← api.thoughtproof.ai (RV)
└──────────────────┘
    │
    ▼
┌───────────────────────────────────────────┐
│  Stage 1: Independent Evaluation       │
│  3–4 LLMs (Grok, Gemini, DeepSeek,     │
│  Sonnet) each examine the claim         │
│                                         │
│  Stage 2: Red-Team Critique             │
│  1 dedicated model challenges all       │
│  initial verdicts                        │
│                                         │
│  Stage 3: Synthesis                     │
│  Sonnet weighs verdicts + critique      │
│  → final decision                       │
└───────────────────────────────────────────┘
    │
    ▼
  ALLOW / BLOCK / UNCERTAIN
  + confidence % + objections

Pricing

Speed	Models	Cost per verification
fast	2	$0.008
standard	4	$0.02
deep	5+	$0.08

Payment: API key (operator account) or x402 micropayment (USDC on Base).

API Key

Get an operator API key at thoughtproof.ai. Without a key, verifications use x402 micropayments automatically.

Configuration

Environment Variable	Default	Description
`THOUGHTPROOF_API_KEY`	(none)	Operator API key
`THOUGHTPROOF_BASE_URL`	`https://api.thoughtproof.ai`	API base URL

Development

git clone https://github.com/ThoughtProof/thoughtproof-mcp.git
cd thoughtproof-mcp
npm install
npm run build
npm test
npm run dev          # Run with tsx (hot reload)
npm run inspect      # Test with MCP Inspector

ThoughtProof — Decision verification for AI agents
pot-cli — CLI for reasoning verification
ERC-8004 — Autonomous Agent Registry

License

MIT — ThoughtProof

Install Server

license - permissive license

quality

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

Need Help?

Related Servers

Tools

verify_reasoningB

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/ThoughtProof/thoughtproof-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server