1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@Praxis remember the docker compose command for starting the app" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

Praxis

by NORTHTEKDevs

Overview Schema Related Servers Score Discussions

TypeScript

Local

Praxis

Give your AI assistant a memory for the things it has proven it can do.

AI assistants are forgetful. Every new conversation, your assistant starts from scratch - it doesn't remember the tricky thing it figured out yesterday, so it works it out all over again (and sometimes gets it wrong all over again).

Praxis fixes that, with a twist most "AI memory" tools miss: it only saves a skill after the AI has actually run it and shown it works. No guessing, no "I'm pretty sure this is right." If it doesn't pass the check, it doesn't get saved.

Think of the difference between:

A notebook of ideas (what normal AI memory does): "I think the way to do this is..." - might be wrong.
A box of tested recipes (what Praxis does): "Here's exactly how to do this - I've made it before, it works."

Memory stores what you saw. Praxis stores what you proved.

What it does, in plain terms

Remembers real skills, not guesses. When your AI solves a task, Praxis keeps it only if it passes a real, automatic check. Failed attempts never get saved as "knowledge."
Remembers mistakes too. It keeps a short list of things that didn't work, so your AI sees "you already tried this and it failed" before it wastes time repeating it.
Stays small and fast. It doesn't pile up forever. Near-duplicates get merged, and skills that never get used get cleaned out automatically - so it won't get slow or expensive as it grows.
Reuses and combines skills. Once a skill is proven, your AI can use it again instantly, or snap several proven skills together to do something bigger.
Works with the AI tools you already use - Claude Code, Cursor, or your own agent. Anything that speaks MCP (a common standard for plugging tools into AI assistants).

Related MCP server: JauMemory MCP Server

Why you'd want it

Your AI gets more dependable over time on your actual work, instead of resetting to zero every session.
It stops repeating the same mistakes.
It won't balloon into something slow or costly - it tidies up after itself.
It's honest: a saved skill is something that genuinely ran and passed, not something the AI merely felt confident about.

One honest caveat: Praxis makes your assistant more reliable, not smarter. The AI still does the thinking - Praxis is the part that double-checks the work and remembers only the wins.

For developers

Praxis is a small, dependency-light MCP server. Core and tests use Node 24 built-ins only (node:sqlite, node:test, native TypeScript type-stripping) - no build step for development. The published npm package ships precompiled plain JS (tsc at pack time only - Node refuses type-stripping under node_modules). The only runtime dependency is the MCP SDK.

Install

Requires Node 24+.

npm i -g @northtek/praxis
praxis init

praxis init runs a self-test and prints the stanza to add to your .mcp.json:

{ "mcpServers": { "praxis": { "command": "praxis", "args": ["serve"] } } }

Restart your agent. Done.

How it works

solve a task -> distill a Skill {interface, implementation, acceptanceTest}
            -> VERIFY in a sandbox (pass => kept, fail/timeout/async => quarantined)
            -> dedup/merge on write
            -> recall top-k within a token budget (+ known failures)
            -> compose verified skills by reference
            -> score by utility, evict/consolidate to stay lean

The agent stays the brain; Praxis is the part that only keeps what's proven.

Tools (MCP)

remember_skill · recall_skills · run_skill · record_failure · reinforce · library_stats · pin_skill · sync_skills · consolidate_now

recall_skills returns verified skills and relevant negative skills ("known failure modes") so the agent sees the wall it hit last time before it retries.

From proven skill to Claude Code skill

praxis sync (also the sync_skills MCP tool) compiles your verified hot skills into real Claude Code skill directories:

praxis sync                 # -> ./.claude/skills/praxis-<name>/SKILL.md + impl.mjs
praxis sync --global        # -> ~/.claude/skills/
praxis sync --prune         # remove stale exports instead of marking them

Each exported skill carries its interface, the proven implementation, and the acceptance test it passed. The honesty guarantee travels with it: no exported skill outlives its proof. If a skill is later quarantined (a reinforce failure re-ran its test and it broke), demoted out of the hot tier, or evicted, the next sync rewrites its SKILL.md as [STALE - failed re-verify] (or removes it with --prune). Sync is idempotent, tracked by a manifest, and never touches skill files it didn't write.

Optional flywheel loop: if claude-code-flywheel's Work Ledger is present (~/.claude/state/ledger.jsonl or FLYWHEEL_LEDGER), sync first ingests praxis-* skill firings as usage signal - feeding generality/utility scoring, so skills you actually use stay hot and skills you don't decay out. Fire events carry no outcome, so they are recorded as retrievals, never as fabricated successes. No flywheel installed: sync works identically minus the usage signal. Praxis reads the ledger file format only - there is no dependency between the projects.

What keeps it from bloating / getting expensive

The library is self-pruning, not append-only:

Verify gate at entry - failed attempts never become skills.
Dedup + merge on write - a near-duplicate reinforces the existing skill instead of adding one.
Utility-weighted tiering with a bounded hot set; warm/cold skills are excluded from recall but stay callable by id (and can be promoted back by consolidation).
Budgeted top-k retrieval - context cost is O(k) tokens, bounded by tokenBudget, independent of library size. (Retrieval compute is O(hot-set size), bounded by the hot-set cap - not O(1).)
Consolidation pass - regression-safe dedup-merge + eviction.

Trust

The verify gate is fail-closed: a skill reaches verified only if its acceptance test executed and passed. The sandbox runs in a worker thread with a hard memory cap and a timeout kill, and defends against try/catch assert-swallowing, async vacuous-passes, weak self-referential tests, and tests that try to detect or tamper with the checker. It is isolated with a memory cap and timeout kill - not a hardened multi-tenant security boundary (run only your own agent's code in v1; hosted/untrusted-code use needs isolated-vm or a subprocess sandbox).

Benchmark

See bench/. The benchmark is synthetic (an author-designed task stream, HashingEmbedder); it is an existence proof that the system behaves as designed - capability reuse, sublinear growth on repeated work (the long tail grows linearly, and is shown), bounded per-task cost, and a measured repeat-error reduction with negatives on. Not a general-performance claim.

Honest scope (what Praxis does NOT claim)

Not "your agent becomes smarter" - it accumulates verified expertise on your workflows. The LLM proposes; Praxis verifies and keeps.
Not "the first verified skill library" - prior art (Voyager, SkillGen, PreAct) verifies too. The specific unclaimed combination: a domain-agnostic, sandboxed, fail-closed acceptance gate + first-class negative skills + budgeted O(k)-context retrieval, exposed via MCP.
Composed skills carry their own acceptance tests and cascade-quarantine when a sub-skill is invalidated; deep arbitrary-graph reliability is not guaranteed in v1.
Library growth is architected for sublinearity via dedup/merge/eviction; it is not a guarantee for all workloads.

Prior art

Voyager (2305.16291), Reflexion (2303.11366), SkillGen (2408.08435), PreAct (2606.17929), Generative Agents (2304.03442), SoK: Agentic Skills (2602.20867). Praxis builds on the verify-before-keep idea and adds first-class negative skills + budgeted retrieval + an MCP surface.

License

MIT - see LICENSE.

This server cannot be installed

license - permissive license

quality - not tested

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Related MCP Servers

nautilus-compass
Knowledge & Memory AI & Machine Learning
chunxiaoxx
A
license
-
quality
C
maintenance
Enables AI agents to retain memory of past interactions and detect behavioral drift, preventing repeated mistakes without LLM token extraction.
Last updated 2026-07-21
29
5
MIT
JauMemory MCP Serverofficial
Knowledge & Memory Autonomous Agents
Jau-app
A
license
A
quality
D
maintenance
Provides persistent memory capabilities for AI assistants, enabling storage, recall, and analysis of information across conversations with intelligent memory management.
Last updated 2026-01-11
25
15
MIT
engrams
Knowledge & Memory Developer Tools
stevebrownlee
A
license
-
quality
B
maintenance
Gives AI assistants persistent, queryable project memory for decisions, patterns, and rules, reducing the need to re-explain context in every prompt.
Last updated 2026-07-10
11
Apache 2.0
JauMemory MCP Server
Knowledge & Memory AI & Machine Learning
jefedeoro
A
license
-
quality
B
maintenance
Provides persistent memory for AI assistants, enabling storage, recall, and analysis of information across conversations with intelligent memory management.
Last updated 2026-05-30
15
MIT

View all related MCP servers

Related MCP Connectors

Engram
Persistent memory for AI agents — verbatim conversations, searchable by meaning.
fixflow
Collective memory for AI agents. One agent solves a bug — every agent gets the fix instantly.
Mnemoverse Memory
Hosted memory for AI agents that learns and forgets — one key across Claude, Cursor & ChatGPT.

View all MCP Connectors

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/NORTHTEKDevs/praxis'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

Praxis

What it does, in plain terms

Why you'd want it

For developers

Install

How it works

Tools (MCP)

From proven skill to Claude Code skill

What keeps it from bloating / getting expensive

Trust

Benchmark

Honest scope (what Praxis does NOT claim)

Prior art

License

Maintenance

Resources

Looking for Admin?

Related MCP Servers

nautilus-compass

JauMemory MCP Serverofficial

engrams

JauMemory MCP Server

Related MCP Connectors

Latest Blog Posts

MCP directory API