sumo-qa
Integrates with JetBrains IDEs to bring QA capabilities such as test planning, TDD, mutation testing, and code review to the development workflow.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@sumo-qaCreate a test plan for the user authentication module"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
sumo-qa MCP
An MCP server that brings senior-QA discipline to AI coding assistants โ test planning, TDD, mutation testing, code review.
sumo-qa is an advisor, not an oracle. Like any AI tool it can be wrong. Your judgment and your team's standards are the final word.
๐ New here? 5-minute demo โ
One install line, one prompt on your repo, the workflow runs on real code.
Why it exists
Ask a stock AI assistant to QA a change and you get the junior answer: "add unit tests, consider edge cases, maybe test performance." That's a checklist.
sumo-qa makes the agent work the way a senior QA does:
Names 3โ7 risks tied to specific files and lines, not categories
Picks one design technique per risk from an ISTQB-grounded catalogue (boundary-value, decision-table, property-based, mutation)
Runs your test suite fresh in the current turn before any "safe to merge" claim
Holds TDD's red phase before any production code is written
Keeps production code locked while strengthening tests against mutation survivors
Won't ship a plan without measurable entry and exit criteria
The discipline lives in a library of skill files, each followed literally โ every one has an Iron Law and a HARD-GATE callout the LLM can't talk past. Skills route automatically from natural-language prompts; you don't need to remember to invoke them.
Related MCP server: dev-loop-mcp
Install
pip install sumo-qa && sumo-qa-installsumo-qa-install with no flag configures every host it detects. Target a single host with --claude-code, --vscode --workspace <path-to-repo>, or --jetbrains.
On Windows PowerShell, use (&& isn't a valid separator in Windows PowerShell, and pip's script directory is often off PATH, so use the module form):
py -m pip install sumo-qa; if ($?) { py -m sumo_qa.installer }If sumo-qa-install isn't on your PATH (e.g. pip install --user without ~/.local/bin exported), use the PATH-proof module form: python -m pip install sumo-qa && python -m sumo_qa.installer.
Prefer one command that installs, configures, and verifies in a single shot? From a clone of this repo, run ./install.sh (macOS/Linux) or .\install.ps1 (Windows) โ thin wrappers that route to the same pip install + python -m sumo_qa.installer + sumo-qa-doctor steps, with --update, --doctor, per-host --host, and an ownership-aware --uninstall flag. CI-verified on Linux/macOS/Windows. Details: docs/INSTALL.md#one-command-wrapper-installsh--installps1.
Restart your host or open a fresh chat afterwards.
Plugin install from a local clone (Claude Code, session-scoped)
Prefer the plugin experience over pip? Clone the repo and pass --plugin-dir to claude on each invocation. This loads the .claude-plugin/plugin.json manifest directly โ no pip install needed, skills + hooks + MCP server come from this checkout.
Prerequisite: uv on PATH (Astral's package runner โ one-line install, no Python prerequisite). Skip if uv --version already resolves:
# macOS / Linux
curl -LsSf https://astral.sh/uv/install.sh | sh
# Windows PowerShell
powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"
# Homebrew
brew install uvThen clone and launch:
git clone https://github.com/sumithr/sumo-qa.git
claude --plugin-dir /path/to/sumo-qaSession-scoped: every claude invocation needs the flag โ plain claude (no flag) starts a session with no sumo-qa loaded. Use /reload-plugins inside the session to pick up edits without restarting.
Persistent marketplace install (one-time setup, no flag on every launch): add this repo as a Claude Code plugin marketplace and install the plugin โ
/plugin marketplace add sumithr/sumo-qa /plugin install sumo-qa@sumo-qaThe marketplace catalog (
.claude-plugin/marketplace.json) is generated from the canonical source and schema-validated in CI. This flow is wired but the livemarketplace addโinstallround-trip has not yet been verified end-to-end in a Claude Code session โ until that is confirmed, thepip install sumo-qa && sumo-qa-installpath above remains the recommended persistent install. Full architecture + dev-iteration detail: docs/INSTALL.md#plugin-format-install-claude-code--codex.
Something not working?
# pip-install path (after `pip install sumo-qa`)
sumo-qa-doctor # or `python -m sumo_qa.doctor` if not on PATH
# plugin-install path (no pip install required) โ inside a Claude Code session
!sumo-qa-doctor # the plugin ships bin/sumo-qa-doctor on PATH
# plugin-install path from outside Claude Code
uvx --from /path/to/plugin/source sumo-qa-doctorRead-only setup diagnostics โ checks Python + sumo-qa version, install mode, the MCP initialize + tools/list handshake, and every host config the installer touches (Claude Code, Claude Desktop, VS Code workspace, JetBrains detection, Codex plugin). Each failure prints the exact Fix: command. --json for machine output. Details: docs/INSTALL.md#diagnosing-setup-with-sumo-qa-doctor.
Per-host flags, schema differences, and troubleshooting: docs/INSTALL.md. Want to install from a local clone โ to try an unreleased branch or run with your team's standards / knowledge packs editable in place? See docs/INSTALL.md#install-from-a-local-clone. For the pip path use python scripts/dev_install.py; for the Claude Code plugin path use claude --plugin-dir /path/to/sumo-qa (Anthropic's documented local-dev mode).
Verify it's wired
In any host, ask:
load the QA classifications
You should get the canonical change-classification names back. If you do, you're wired.
Run it from the terminal
Beyond the host integration, sumo-qa ships memorable commands for the QA-native repo loop:
sumo-qa analyze # map the current repo into .sumo-qa/repo-map.json
sumo-qa status # is the map present, current, and fresh? what next?analyze [path] writes the schema-validated .sumo-qa/repo-map.json artifact and prints a concise summary; status [path] reports the artifact's presence, schema version, freshness against HEAD, and the next command to run. Both take --json for automation. (Bare sumo-qa launches the MCP server for hosts; sumo-qa-doctor runs setup diagnostics.)
Update
pip install --upgrade sumo-qa && sumo-qa-installRestart the host. The SessionStart hook re-injects the latest content; skills and knowledge refresh from the upgraded package.
What's included
%%{init: {'theme':'base', 'themeVariables': {
'fontFamily':'Charter, "Iowan Old Style", Georgia, serif',
'fontSize':'15px',
'primaryTextColor':'#1B1B1B',
'lineColor':'#1B1B1B'
}}}%%
flowchart LR
LLM{{"Host LLM"}}
subgraph Inputs ["sumo-qa content"]
direction TB
Knowledge[("Knowledge")]
Standards[("Standards")]
end
Skills["<b>Skills</b>"]
Output(["Output"])
Knowledge -- cited by --> Skills
Standards -- cited by --> Skills
LLM == follows ==> Skills
Skills == produces ==> Output
classDef host fill:#7A1F1F,stroke:#1B1B1B,stroke-width:2px,color:#FAF7F2
classDef skills fill:#FAF7F2,stroke:#1B1B1B,stroke-width:2.5px,color:#1B1B1B
classDef data fill:#F0EAE0,stroke:#8A7B5C,stroke-width:1.5px,color:#1B1B1B
classDef out fill:#E8EDDF,stroke:#3F4A2E,stroke-width:2px,color:#1B1B1B
classDef group fill:none,stroke:#8A7B5C,stroke-width:1px,color:#5C4D00,stroke-dasharray: 4 4
class LLM host
class Skills skills
class Knowledge,Standards data
class Output out
class Inputs group
linkStyle 0,1 stroke:#8A7B5C,stroke-width:1.2px,stroke-dasharray:5 4
linkStyle 2,3 stroke:#1B1B1B,stroke-width:2.5pxLayer | What |
Skills ( | Iron-Law procedures across the QA lifecycle: deciding approach, preparing for work, TDD scaffolding, diff review, strengthening tests, finding test data, answering testing questions, repo strategy โ plus the planning โ parallel subagent execution โ finishing chain. |
MCP entry points | A thin tool surface โ skill tools, knowledge loaders, a capabilities-discovery tool, repo-map tools, test-data tools, an ingestion tool, and external-skill lifecycle tools. Each is file IO or small deterministic logic; no inference. |
Progressive skill loading | A read-only loader ( |
Knowledge catalogues ( | Classifications, approaches, principles, techniques. The agent picks from these instead of recalling from training data. Editable as plain markdown. Specialty-tool picks are deliberately not catalogued โ the discipline is observe the risk surface, web-search current options for the user's stack, cite when naming a tool. |
Host support
Every host calls the same MCP server and reads the same SKILL.md files. What differs is how each host exposes them โ that's a host-API difference, not a sumo-qa choice.
These hosts are verified end-to-end with sumo-qa-install:
Host | Slash | Setup |
Claude Code |
|
|
VS Code + Copilot (Agent mode, Claude Sonnet 4.5 or equivalent) | Natural language |
|
JetBrains AI Assistant |
| One-time UI setup; |
JetBrains Junie | Natural language | Drop the JSON |
In Claude Code, type / then sumo-qa- to see the skills as hyphenated entries (symlinked into ~/.claude/skills/). The same skills are also registered through MCP with underscores (/sumo_qa_load_classifications, /sumo_qa_find_test_data); both routes call the same SKILL.md.
Natural language works everywhere. "Review my changes", "plan QA for this story", "load the QA classifications" โ the agent routes by tool description. Slash and natural-language paths produce the same result.
Other MCP hosts (Cursor, Codex, OpenCode, etc.): pip install sumo-qa ships a standard stdio MCP server, so it should work with anything that speaks MCP. Follow your host's MCP-server setup docs and point it at the absolute path of the sumo-qa script. Not verified end-to-end by us, so we don't ship instructions.
Host adapter folders
sumo-qa ships first-class plugin manifest folders for hosts that consume them directly. Both folders are generated from a single canonical source (pyproject.toml's [tool.sumo-qa.plugin] overlay) โ see docs/host-adapters.md for the architecture.
Host | Manifest | Install status today | Source-of-truth contract |
Claude Code |
|
| Both schema-validated against the published JSON Schemas in CI |
OpenAI Codex |
| Not verified end-to-end yet โ treat as TBD | MCP |
Adding a new host is one new template under plugin_packaging/templates/ plus the canonical-source line that describes it. The plugin-packaging CI workflow re-runs the generator on every PR and fails if any committed adapter file diverges from the canonical source.
See it in action
Ten transcripts showing the workflow on real code โ diff reviews refusing to call safe-to-merge from stale CI, TDD cycles with the red output surfaced verbatim, mutation survivors walked one at a time, formal test plans gated on entry/exit criteria, and the case where the right answer is "no tests needed, stop here":
tests/scenarios/worked-examples/ โ start with 02 โ review my changes for a representative end-to-end
tests/scenarios/SCENARIOS.md โ the scenario specs (prompt โ expected shape โ anti-patterns the skill prevents)
When sumo-qa doesn't fit
If your QA intent has no native fit (Playwright E2E, accessibility audits, k6 load testing, type checking), sumo-qa searches for an external skill through its MCP server, offers a [y/N] install gate, installs through the Skills CLI, then loads the installed SKILL.md back into the conversation.
%%{init: {'theme':'base', 'themeVariables': {
'fontFamily':'Charter, "Iowan Old Style", Georgia, serif',
'fontSize':'13px',
'primaryTextColor':'#1B1B1B',
'lineColor':'#1B1B1B'
}}}%%
flowchart LR
Intent(["QA intent<br/><i>no native fit</i>"])
Search["<b>search</b><br/><i>sumo_qa_search_external_skills</i>"]
Gate{"<b>[y/N]</b>"}
Install["<b>install</b><br/><i>sumo_qa_install_external_skill</i>"]
Locate["<b>locate & load</b><br/><i>check_installed ยท execute</i>"]
Out(["external SKILL.md<br/>in the conversation"])
Stop(["stop"])
Intent ==> Search ==> Gate
Gate -->|y| Install ==> Locate ==> Out
Gate -->|N| Stop
classDef io fill:#FAF7F2,stroke:#1B1B1B,stroke-width:2px,color:#1B1B1B
classDef step fill:#FAF7F2,stroke:#1B1B1B,stroke-width:2.5px,color:#1B1B1B
classDef gate fill:#7A1F1F,stroke:#1B1B1B,stroke-width:2px,color:#FAF7F2
classDef stop fill:#F0EAE0,stroke:#8A7B5C,stroke-width:1.5px,color:#1B1B1B
classDef done fill:#E8EDDF,stroke:#3F4A2E,stroke-width:2px,color:#1B1B1B
class Intent io
class Search,Install,Locate step
class Gate gate
class Stop stop
class Out doneThe host does not run
npxdirectly;sumo_qa_search_external_skills,sumo_qa_check_external_skill_installed,sumo_qa_install_external_skill, andsumo_qa_execute_external_skillown the lifecycle.Search returns the Skills CLI's text output verbatim (ANSI stripped); the host LLM reads it as the user would. No structured parser to drift out of date.
Node.js is required for the Skills CLI. If
npxis missing, the MCP tool returns an actionable error and stops. It doesn't elevate via sudo.The external skill suggests tool-specific setup, but sumo-qa's setup standard overrides it โ any machine-level / global install in the returned skill body is translated to a repo-pinned + CI-reproducible equivalent โ while sumo-qa keeps the confirmation gates, test evidence, and risk-to-test mapping.
Support
Filing a clear issue gets it fixed faster. Pick the template that matches the problem:
Symptom | Template |
| |
Install worked, but the host (Claude Code, VS Code + Copilot, JetBrains, Cursor, โฆ) does not surface tools or skills correctly | |
The wrong sumo-qa skill ran for a prompt (or none ran when one should have) | |
A skill ran, but its QA output was generic, wrong, or missed something | |
You want a new workflow, skill, or host integration | |
Reproducible defect that does not fit the above |
License
Apache 2.0. See NOTICE for attribution requirements that apply to forks and redistributors.
More docs
AGENTS.md โ AI-agent bootstrap and per-host setup
docs/ARCHITECTURE.md โ three layers, host delivery, knowledge authority
docs/SKILLS.md โ every skill with its Iron Law
docs/TOOLS.md โ every MCP entry point
docs/INSTALL.md โ per-host install detail and troubleshooting
docs/CONTENT-FORMATS.md โ schemas + worked examples for adding team standards, knowledge, change rules, and test data (incl. swapping ISTQB out)
docs/CONFIGURATION.md โ env vars
docs/DEVELOPMENT.md โ local dev
docs/TEST-DATA.md โ known-good test-data catalogue
docs/REPO-MAP.md โ QA-native repo-map artifact under
.sumo-qa/: schema, scanner, and the scan / diff-impact / query tools that consume it (issues #155, #156)docs/RISK-LEDGER.md โ risk-to-test traceability ledger: the structured appendix to the markdown-first verdict, its row schema and evidence-status vocabulary, and when not to use it (issue #144)
docs/EXPORT.md โ deterministic export of already-structured QA test cases to versioned JSON, a markdown table, or (flat-only) CSV: the case schema, the format set, the side-effect-free contract, and the import-mapping caveat (issue #148)
docs/PERSONA.md โ optional Sumo-sensei voice (off by default)
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/sumithr/sumo-qa'
If you have feedback or need assistance with the MCP directory API, please join our Discord server