ejentum-mcp
OfficialThe ejentum-mcp server exposes four cognitive harness tools from the Ejentum Logic API as MCP tools, enabling LLMs to absorb structured scaffolds internally before responding to complex tasks — improving output quality without exposing the scaffold to the user.
harness_reasoning– Call before analytical, diagnostic, planning, or multi-step reasoning tasks. Returns a cognitive scaffold to prevent causal shortcuts, premature conclusions, and surface pattern matching.harness_code– Call before generating, refactoring, reviewing, or debugging code. Returns an engineering scaffold to prevent hallucinated APIs, lost edge cases, and silent contract violations.harness_anti_deception– Call when facing sycophancy pressure, manufactured urgency, authority appeals, or any situation where the "easy" answer would compromise honesty. Returns an integrity scaffold to block sycophancy and agreement reflexes.harness_memory– Call to sharpen an already-formed observation about conversation state, user behavior, emotional shifts, or cross-turn patterns. Returns a perception scaffold to refine felt signals. Not for fact extraction or structured data retrieval.
All four tools accept a single query argument (a 1–2 sentence task framing) and integrate with MCP-compatible clients such as Claude Desktop, Cursor, Windsurf, Claude Code, and n8n.
Enables n8n workflows to use Ejentum's cognitive harnesses via the MCP Client node, adding advanced reasoning and analysis capabilities to automated pipelines.
ejentum-mcp
MCP server that improves LLM reasoning on complex, multi-step, or multi-constraint tasks. Before the agent generates, it calls one of eight tools to retrieve a cognitive operation: a structured procedure (numbered steps with the failure pattern to refuse and a falsification test) paired with an executable reasoning topology (a DAG of those steps with decision gates, parallel branches, bounded loops, meta-cognitive exits, and escape paths). The agent reads both layers before producing its response.
Eight tools split into two retrieval modes:
Dynamic (4 tools:
reasoning,code,anti-deception,memory): the top-1 abstract operation from a library of 679, selected by semantic match on thequerystring. Available on all tiers including the 30-day free trial.Adaptive (4 tools:
adaptive-reasoning,adaptive-code,adaptive-anti-deception,adaptive-memory): the same retrieval pool, but an adapter LLM rewrites every step and DAG node in the matched operation with task-specific identifiers (e.g.,extract_duration_estimatesbecomesextract_migration_duration_estimates(DDL_time|backfill_time|trigger_overhead|lock_hold_time)). Adds ~2-3 s of latency; requires the Go or Super tier.
Two install paths use the same EJENTUM_API_KEY:
Stdio via
npx -y ejentum-mcpfor Claude Desktop, Cursor, Windsurf, Codex CLI, Claude Code, Cline, Continue, and any client that spawns MCP servers as subprocesses.Hosted Streamable HTTP at
https://api.ejentum.com/mcpfor n8n MCP Client and any HTTP-MCP client. SendAuthorization: Bearer YOUR_EJENTUM_API_KEY.
Install
You need:
An Ejentum API key. 30-day free trial (no card) at ejentum.com/pricing.
Node.js 18+.
Install from npm
npm install ejentum-mcpOr skip the install and reference it with npx -y ejentum-mcp directly in your client config (shown below).
Manual install
Claude Desktop
Open claude_desktop_config.json:
macOS:
~/Library/Application Support/Claude/claude_desktop_config.jsonWindows:
%APPDATA%\Claude\claude_desktop_config.json
{
"mcpServers": {
"ejentum": {
"command": "npx",
"args": ["-y", "ejentum-mcp"],
"env": { "EJENTUM_API_KEY": "ej_..." }
}
}
}Restart Claude Desktop. The eight tools appear in the tool picker.
Cursor / Windsurf
Open MCP settings → Add new MCP server → paste the same ejentum block as above.
Claude Code (CLI)
claude mcp add ejentum -e EJENTUM_API_KEY=ej_... -- npx -y ejentum-mcpn8n MCP Client node
Add an MCP Client node, transport stdio, command npx, args ["-y", "ejentum-mcp"], env { "EJENTUM_API_KEY": "ej_..." }.
Related MCP server: verifiable-thinking-mcp
Wire contract
The stdio MCP server and the hosted endpoint both proxy to the same upstream:
POST https://api.ejentum.com/harness/
Headers:
Authorization: Bearer <EJENTUM_API_KEY>
Content-Type: application/json
Body:
{
"query": "<string, 1-2 sentences describing the task>",
"mode": "reasoning" | "code" | "anti-deception" | "memory"
| "adaptive-reasoning" | "adaptive-code"
| "adaptive-anti-deception" | "adaptive-memory"
}
Response (200):
[ { "<mode>": "<injection string, ~2-4 KB>" } ]
Response (401): { "error": "Unauthorized; check EJENTUM_API_KEY" }
Response (403): { "error": "Adaptive modes require Go or Super tier" }
Response (429): { "error": "Rate limit exceeded for tier" }The response is an array of length 1 with a single key matching the request mode. Use bracket access (result[0]["anti-deception"]) for the hyphenated keys; dot access parses the hyphen as subtraction in JavaScript and Python attribute access.
The injection string is plain text containing seven fields. See Field structure below.
Tool inventory
Dynamic (single retrieval, all tiers including the 30-day trial)
Tool name | Mode string | Library size |
|
| 311 operations across abstraction, time, causality, simulation, spatial, metacognition |
|
| 128 operations across the software-engineering layer |
|
| 139 operations across sycophancy, hallucination, deception, adversarial framing, judgment, executive control |
|
| 101 operations in the perception layer (filter-oriented; do not call for fact extraction) |
Adaptive (top-k retrieval + adapter LLM rewrite; Go or Super tier required)
Tool name | Mode string | Behavior vs dynamic |
|
| Same retrieval pool, top-5 then picker, then adapter LLM rewrites PROCEDURE and REASONING TOPOLOGY fields with task-specific identifiers. Adds ~2-3 s of latency. |
|
| Same as above for the code library. |
|
| Same as above for the anti-deception library. |
|
| Same as above for the memory library. |
Each tool takes one argument, query (string, 1-2 sentences describing the task). Returns the injection string.
Field structure of an injection
Every retrieved record contains seven labelled blocks plus a cognitive payload. The exact set of labels varies by mode:
The fields appear in this fixed order in every response. Each mode uses its own label for the same slot (e.g., [PROCEDURE] in reasoning corresponds to [ENGINEERING PROCEDURE] in code):
Order | Slot | Per-mode labels | Content |
1 | Procedure |
| Numbered steps the model executes. |
2 | Topology |
| DAG specification. See DAG syntax. |
3 | Cognitive payload |
| Tendency vectors and execution-style hints. |
4 | Verification |
| Self-check the model runs after drafting. |
5 | Failure pattern |
| The failure pattern to refuse. |
6 | Correct shape |
| What a correct response looks like. |
The same six-slot order holds for both dynamic and adaptive variants of every mode. In adaptive responses, the adapter LLM rewrites slots 1 and 2 (procedure and topology) with task-specific identifiers; slots 3-6 are returned verbatim.
DAG syntax
The topology block uses a flat string notation:
Token | Meaning |
| Step node. Numbered, sequential by default. |
| Decision gate. Branches |
| Negative anchor. Active across the whole branch; the labelled failure pattern is refused. |
| Meta-cognitive node. Model pauses, evaluates the trace, then |
| Escape path. Model exits the prescribed DAG when the plan stops fitting; returns to a step or |
| A quantity held stable across the branch. |
| Bounded iteration. |
| Computed value used downstream. |
| Terminal node. |
The DAG is meant to be read by the LLM as a structured outline of the reasoning path, not executed by a host runtime. The labelled-step structure persists across long context windows where prose-only reasoning specifications lose retrieval salience.
Canonical example: dynamic vs adaptive on the same query
Query (used for both calls):
Evaluate whether a database migration plan that adds a NOT NULL column to a 50M-row table is safe under concurrent writes, given that the backfill strategy uses a trigger-based default.
The picker matched the same operation in both calls ("realistic duration estimation" with the Hofstadter buffer). The [NEGATIVE GATE], [TARGET PATTERN], [FALSIFICATION TEST], and [COGNITIVE PAYLOAD] fields are identical between the two responses (the adapter does not rewrite them). The [PROCEDURE] and [REASONING TOPOLOGY] fields differ: the adaptive response replaces abstract identifiers with task-specific ones.
Dynamic reasoning response (truncated to the differing fields)
[PROCEDURE]
Step 1: Extract every duration estimate and identify its basis: historical data,
expert judgment, or optimistic assumption. Step 2: Compare each estimate against
historical base rates or p90 benchmarks for similar tasks. Step 3: Flag estimates
below the historical median as likely optimistic. Step 4: Never accept best-case
estimates as planning targets. Do not anchor to initial optimistic numbers.
Step 5: If an estimate lacks historical basis, simulate impact with a 1.5x-2.0x
buffer. If data exists, compute the variance-adjusted range. Verify the final
timeline accounts for cascading delay propagation across dependent tasks.
[REASONING TOPOLOGY]
S1:extract_duration_estimates -> N{accept_best_case_estimates_planning_targets}
-> S2:identify_basis(historical|expert|assumption)
-> FIXED_POINT[historical_base_rates]
-> S3:compare(estimate_vs_p90_benchmark)
-> G1{below_median?} --yes-> FLAG:likely_optimistic -> S4:apply_buffer(1.5x_to_2x) -> S3[LOOP]
--no-> G2{lacks_historical_basis?} --yes-> S4b:simulate_with_buffer -> S3
--no-> C{adjusted_range = base_rate_estimate +/- sqrt(variance) * z}
-> S5:apply_adjusted_range
-> S6:verify_cascade_delay
-> OUT:realistic_timelineAdaptive adaptive-reasoning response (same query, same picked operation)
[PROCEDURE]
Step 1: Extract every duration estimate in the migration plan (DDL execution time
for adding NOT NULL column on 50M rows, backfill duration, trigger overhead per
concurrent write, total migration window) and identify its basis: historical
migration data on similar table sizes, DBA expert judgment, or optimistic
assumption assuming zero contention. Step 2: Compare each estimate against
historical base rates or p90 benchmarks for similar DDL operations on large tables
(e.g., ALTER TABLE duration per million rows, trigger-induced latency under
concurrent write load). Step 3: Flag estimates below the historical median as
likely optimistic, e.g., assuming sub-second column addition without considering
lock escalation on a 50M-row table, or assuming trigger-based backfill adds
negligible overhead under concurrent write traffic. Step 4: Never accept best-case
estimates as planning targets. Do not anchor to initial optimistic numbers such
as "the NOT NULL addition will complete instantly" or "the trigger will not slow
down concurrent writes." Step 5: If an estimate lacks historical basis (e.g., no
prior trigger-based backfill on a table this size), simulate impact with a
1.5x-2.0x buffer for lock duration and write throughput degradation. If data
exists (e.g., past ALTER TABLE timings on this table), compute the
variance-adjusted range. Verify the final timeline accounts for cascading delay
propagation across dependent tasks (e.g., extended lock hold times blocking
application queries, backfill slowdown under write contention propagating to
downstream replication lag).
[REASONING TOPOLOGY]
S1:extract_migration_duration_estimates(DDL_time|backfill_time|trigger_overhead|lock_hold_time)
-> N{accept_best_case_estimates_planning_targets}
-> S2:identify_basis(historical_migration_data|DBA_expert_judgment|optimistic_assumption)
-> FIXED_POINT[historical_base_rates_for_DDL_on_large_tables]
-> S3:compare(estimate_vs_p90_benchmark_for_ALTER_TABLE_and_trigger_overhead)
-> G1{below_median_for_similar_migrations?} --yes-> FLAG:likely_optimistic(e.g.,assumes_zero_lock_contention)
-> S4:apply_buffer(1.5x_to_2x_for_lock_duration_and_write_throughput)
-> S3[LOOP]
--no-> G2{lacks_historical_basis_for_trigger_backfill_on_50M_table?}
--yes-> S4b:simulate_with_buffer_for_concurrent_write_impact_and_lock_escalation
--no--> C{adjusted_range = base_rate_migration_estimate +/- sqrt(variance) * z}
-> S5:apply_adjusted_range_for_migration_window
-> S6:verify_cascade_delay(lock_blocking_app_queries -> replication_lag -> downstream_consumers)
-> OUT:realistic_migration_timelineFields shared by both responses (slots 3-6, unchanged by the adapter)
Returned in the canonical order: cognitive payload, falsification test, negative gate, target pattern.
[COGNITIVE PAYLOAD]
Amplify: hofstadter buffer application; p90 baseline comparison; variance
multiplier scaling
Suppress: best case anchoring; optimism bias
Cognitive Style: realistic duration estimation
Elasticity: coherence=risk adjusted timeline, expansion=conservative
[FALSIFICATION TEST]
If time estimates reflect only the best-case scenario without verifying applying
any buffer multiplier, duration calibration has defaulted to optimism.
[NEGATIVE GATE]
The database migration will take two weeks: that's our best-case estimate and the
team is experienced, so there's no reason to add buffer. We'll hit the deadline
if everything goes according to plan.
[TARGET PATTERN]
Challenge the two-week estimate: what do similar migrations actually take? If past
projects averaged four weeks at p90, the best-case anchor is dangerously optimistic.
Apply a variance multiplier for schema complexity, data volume, and rollback
testing: build buffer from the full distribution, not the happy path.This is the contract: dynamic returns the matched abstract operation; adaptive returns the same operation with PROCEDURE and topology nodes rewritten in terms of the caller's task (DDL execution time, lock_blocking_app_queries, trigger-based backfill on a table this size) while preserving the operation's structural identity, the safety language, and the cognitive payload verbatim.
Configuration
Variable | Required | Purpose |
| yes | API key from ejentum.com/pricing. |
| no | Override the upstream URL. Default: |
The MCP wrapper is stateless. No local logging, no telemetry, no third-party calls. The upstream API counts requests against the key for billing; the request body (the query string) is consumed for retrieval and not retained beyond the response.
Errors
Status | Cause |
|
|
| Adaptive mode requested on a tier that does not include it (trial or unrecognised). |
| Tier quota for the period exhausted. |
Tool absent from client | Client did not reload after config change. Fully quit and reopen; on Claude Desktop check Help → Logs. |
| Client did not pass the |
Local development
git clone https://github.com/ejentum/ejentum-mcp.git
cd ejentum-mcp
npm install
cp .env.example .env # paste your EJENTUM_API_KEY
npm run devSmoke test against the live API:
npm run build && npm run test:smokeInteractive testing with MCP Inspector:
npx @modelcontextprotocol/inspector npm run devListings
Links
License
MIT. See LICENSE.
Maintenance
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/ejentum/ejentum-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server