What can you do with this server?

blackwall-mcp is a guardrail MCP server that pre-checks irreversible or high-stakes actions for AI agents, preventing disasters like unintended data loss, unauthorized payments, or harmful content posting. Core capabilities: * Pre-action risk assessment (forecast tool): Before any risky action (sending emails, making payments, running SQL, deleting files, posting content, calling external APIs), submit the action and parameters to receive: * A risk score (0–100) * A recommendation: GO, CAUTION, or STOP * Named red flags identifying specific dangers (e.g., SQL with no WHERE clause, irreversible operations without backups) * A reversibility class, rollback cost estimate, and alternative action suggestions * A cryptographic receipt (Ed25519 signature) for offline audit verification * Post-action outcome reporting (observe tool): Report what actually happened after a decision to close the feedback loop and improve prediction accuracy over time. * Observe mode: Logs all forecast results without blocking the agent — useful for safe testing and dashboard review before switching to full enforcement. * Enforcement mode (default): A STOP verdict blocks the action; the system fails closed if a verdict cannot be obtained (e.g., due to network issues). * Contextual awareness: Optionally provide agent_role, user_intent, and environment to help judge whether an action matches the user's actual intent, with configurable standard or deep analysis depth. * Node.js/TypeScript integration (gate() function): Wraps risky actions in an enforceable guard that automatically forecasts, enforces verdicts, runs the allowed action, and reports the outcome. * Broad compatibility: Supports Claude Desktop, Cursor, Claude Code, Windsurf, and any agent framework with MCP support.

blackwall-mcp

by bluetieroperations-create

Overview Schema Related Servers Score Discussions

JavaScript

Remote

blackwall-mcp

Glama quality

A guardrail for AI agents, as an MCP server. Your agent calls one tool — forecast — before any irreversible action (send email, move money, run SQL, delete data, post content). It gets back a risk score (0–100), a reversibility class, a GO / CAUTION / STOP recommendation, and named red flags in a few seconds (~4-8s).

Works in any MCP host: Claude Desktop, Claude Code, Cursor, Windsurf, and any agent framework with MCP support.

The wall between your agent and disaster. A BLUETIER product.

1. Get an API key

Sign up free at https://blackwalltier.com → Dashboard → API keys → Create key. Free tier: ~100 forecasts/month, no card. Your key looks like bw_live_….

Related MCP server: vantagate-mcp-server

2. Add the server to your MCP host

Claude Desktop

Edit claude_desktop_config.json (Settings → Developer → Edit Config):

{
  "mcpServers": {
    "blackwall": {
      "command": "npx",
      "args": ["-y", "blackwall-mcp"],
      "env": { "BLACKWALL_API_KEY": "bw_live_your_key_here" }
    }
  }
}

Restart Claude Desktop. You'll see a forecast tool available.

Cursor

Settings → MCP → Add new global MCP server, then in mcp.json:

{
  "mcpServers": {
    "blackwall": {
      "command": "npx",
      "args": ["-y", "blackwall-mcp"],
      "env": { "BLACKWALL_API_KEY": "bw_live_your_key_here" }
    }
  }
}

Claude Code

claude mcp add blackwall -e BLACKWALL_API_KEY=bw_live_your_key_here -- npx -y blackwall-mcp

Run locally (any host / testing)

BLACKWALL_API_KEY=bw_live_your_key_here npx -y blackwall-mcp

3. Use it

Once added, instruct your agent: "Before any irreversible action, call the forecast tool and stop if it returns STOP." The model will call it automatically when it's about to do something risky.

The `forecast` tool

Parameter	Type	Required	Description
`action`	string	✅	The action type, e.g. `send_email`, `make_payment`, `run_sql`, `delete_file`, `post_content`
`inputs`	object	✅	Concrete parameters: recipient, `amount_usd`, SQL `statement`, file path, message body, URL, etc.
`context`	object	—	Optional: `{ agent_role, user_intent, environment }`
`depth`	`standard` \| `deep`	—	Analysis depth. `standard` is the default.

Returns: recommendation (GO/CAUTION/STOP), risk_score (0–100), reversibility (class + rollback cost), gate (proceed/confirm/human-required), confidence, red_flags[], predicted_result, alternative_actions[].

Example

Agent about to run DELETE FROM users; (no WHERE clause) →

🛑 BLACK_WALL: STOP — risk 99/100
Red flags:
  • [CRITICAL] SQL_NO_WHERE — deletes the entire table, not one row
  • [CRITICAL] INTENT_MISMATCH — intent was "remove a single test row"
  • [CRITICAL] IRREVERSIBLE_NO_BACKUP — no recovery path
Guidance: DO NOT take this action. Surface the red flags to the user.

Observe mode — try it with zero risk

Not ready to let a guardrail block your agents? Start in observe mode. It scores and logs every action but never tells the agent to stop — your agents behave exactly as they do today. After a week, review your dashboard and see what it would have caught.

{
  "mcpServers": {
    "blackwall": {
      "command": "npx",
      "args": ["-y", "blackwall-mcp"],
      "env": {
        "BLACKWALL_API_KEY": "bw_live_your_key_here",
        "BLACKWALL_MODE": "observe"
      }
    }
  }
}

Then see "what your agents almost did" in your dashboard. Flip BLACKWALL_MODE to enforce (or just remove it — enforce is the default) when you're ready to actually block.

Two tools

The server exposes two MCP tools:

forecast — pre-action risk check. Returns GO / CAUTION / STOP, risk score, named red flags, reversibility class, and a verifiable receipt.
observe — post-action outcome report. Tells BLACK_WALL what actually happened after the action ran (or after the agent obeyed a STOP verdict). Closes the loop so the system can track prediction accuracy over time. FREE — no tokens charged.

Wire your agent to call forecast before any irreversible action, then call observe afterwards with the forecast_id from the original response. observe accepts an outcome_class (matched / over_scope / under_scope / no_op / diverged / aborted) and optional divergence_severity and details. See the forecast example below; the same wiring applies to observe.

Use it in code — the `gate()` control (any JS/TS agent)

Running an agent in Node (LangChain, a custom loop, ElizaOS, a cron job)? You don't need an MCP host — call BLACK_WALL straight from the library, and let gate() make the check impossible to skip. One wrap forecasts the action, enforces the verdict (fails closed on STOP / unknown / unreachable), runs your side effect only when allowed, and reports the real outcome with observe automatically.

npm i blackwall-mcp

import { gate, BlackWallBlocked } from 'blackwall-mcp/lib/gate';

// Wrap ANY risky action in a few lines. BLACKWALL_API_KEY lives in the env.
try {
  const { result } = await gate(
    { action: 'run_sql', inputs: { statement: sql }, context: { user_intent } },
    () => db.query(sql),                        // your real side effect — only runs if allowed
    { onCaution: (v) => confirmWithHuman(v) },  // CAUTION needs a yes; default = block
  );
  // ...use result
} catch (e) {
  if (e instanceof BlackWallBlocked) {
    // STOP, unconfirmed CAUTION, or forecast unavailable → the action NEVER ran
    console.error('Blocked:', e.reason, e.verdict?.red_flags);
  } else throw e; // a real error thrown by your action
}

Fails closed by design. If no verdict can be obtained (network / auth / timeout), the action does not run unless you explicitly pass failOpen: true. A risk gate that fails open is not a risk gate. The loop closes itself — gate() calls observe with the actual outcome (matched / diverged / aborted), so your forecasts sharpen over time.

Prefer the lower-level pieces? They're exported too:

import { forecast, observe } from 'blackwall-mcp/lib';

const v = await forecast({ action: 'make_payment', inputs: { amount_usd: 50000 } });
if (v.recommendation === 'STOP') throw new Error('halt');
// ... take the action ...
await observe(v.id, { outcome_class: 'matched' });

Runnable demo: examples/gate-quickstart.mjs.

Decision receipts (cryptographic, verifiable offline)

Every forecast response now includes a receipt field — an Ed25519 signature over canonical SHA-256 hashes of the request + response. Anyone with the published public key can verify offline that BLACK_WALL signed off on a specific (request, response) pair, without trusting our servers.

Published keys: https://blackwalltier.com/.well-known/blackwall-signing-keys.json (stable, cacheable)
Stateless verify endpoint: POST https://blackwalltier.com/api/v1/receipts/verify with { envelope, request_body, response_body }
Hashes only — BLACK_WALL never stores the raw request/response bodies, so receipts give cryptographic audit without payload exposure
Free-tier retention: 90 days. Paid: indefinite.

The MCP server surfaces the receipt id in its tool output so your agent can log it for later replay / audit.

Config reference

Env var	Required	Default	Notes
`BLACKWALL_API_KEY`	✅	—	`bw_live_…` from your dashboard
`BLACKWALL_BASE_URL`	—	`https://blackwalltier.com`
`BLACKWALL_MODE`	—	`enforce`	`observe` = log only, never block

Links

Site & docs: https://blackwalltier.com
Get a key: https://blackwalltier.com/dashboard/keys

MIT licensed.

Install Server

license - permissive license

quality

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Tools

forecastA

Related MCP Servers

ATLAS Governance Gateway
Autonomous Agents Security App Automation
aidant64
F
license
-
quality
F
maintenance
A secure middleware that intercepts AI agent tool calls to evaluate risks and manage human-in-the-loop approvals via durable Inngest workflows. It ensures compliance with standards like the EU AI Act by pausing high-risk actions until authorized by a human reviewer.
Last updated 2026-02-21
1
vantagate-mcp-server
Aderix
A
license
-
quality
D
maintenance
Human-in-the-Loop authorization gateway for AI Agents. Securely pause MCP workflows and route high-risk actions to human approvers via Slack or Email.
Last updated 2026-05-05
51
1
MIT
promptspeak-mcp-server
Autonomous Agents Security App Automation
chrbailey
A
license
B
quality
D
maintenance
Pre-execution governance for AI agents. 45 MCP tools for hold queues, audit trails, risk scoring, and policy enforcement. Validates agent actions before they execute.
Last updated 2026-07-05
45
93
1
MIT
MCP Permission Guard
AMEOBIUS-space
A
license
-
quality
C
maintenance
A pre-action authorization server for AI agents that classifies tool calls into 14 intent categories, scores risk 0-100, and produces deterministic allow/deny/ask decisions with full audit trail.
Last updated 2026-07-21
MIT

View all related MCP servers

Related MCP Connectors

agent-prompt-injection-firewall-mcp
The WAF for agents. Pattern-based + heuristic firewall scans prompts, RAG documents, tool argume...
Oakallow
Runtime permission, approval, and audit layer for AI agent tool execution.
AgentAegis
Pay-per-call cybersecurity for AI agents: vuln scans, threat intel, compliance, code security.

View all MCP Connectors

Appeared in Searches

Orbit Sentinel - Satellite or Space Monitoring System

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/bluetieroperations-create/blackwall-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

blackwall-mcp

1. Get an API key

2. Add the server to your MCP host

Claude Desktop

Cursor

Claude Code

Run locally (any host / testing)

3. Use it

The forecast tool

Example

Observe mode — try it with zero risk

Two tools

Use it in code — the gate() control (any JS/TS agent)

Decision receipts (cryptographic, verifiable offline)

Config reference

Links

Maintenance

Resources

Looking for Admin?

Tools

Related MCP Servers

ATLAS Governance Gateway

vantagate-mcp-server

promptspeak-mcp-server

MCP Permission Guard

Related MCP Connectors

Appeared in Searches

Latest Blog Posts

MCP directory API

The `forecast` tool

Use it in code — the `gate()` control (any JS/TS agent)