mk-qa-master

Overview Schema Related Servers Score Discussions

run_tests

Execute the test suite under the active QA runner and produce a structured report. Supports filter to narrow test scope and auto-refreshes optimization plan.

Instructions

Execute the test suite under the active QA_RUNNER and produce a structured report. The single most-called tool — invoke whenever a user says 「跑/run/test/check/驗證/執行」, after generate_test (verify new test), or after a fix (confirm bug gone).

Behavior:

Invokes the runner's native CLI under QA_PROJECT_ROOT — pytest with --screenshot=on / --tracing=on / --video=retain-on-failure, or npx jest --json, npx cypress run --reporter json, go test -json, maestro test --format junit
Optional filter narrows the scope: pytest -k expr, jest -t pattern, cypress --spec glob, go -run regex, maestro flow-name substring
Writes report.json (pytest-json-report shape, runner-agnostic) + JUnit XML
Snapshots the run into history/ and auto-triggers optimizer.write_plan() → optimization-plan.md is refreshed
Maestro: auto-retries flows that failed on first attempt (MAESTRO_RETRY=true), surfaces flaky_in_run count Returns: {exit_code, raw_exit_code, stdout_tail, stderr_tail, retry_enabled, flaky_in_run, ...}

When to use:

After writing a new test → verify it actually passes
Smoke before a release
Whenever the user prompt contains a run/test verb

When NOT to use:

Inspecting last results without re-running → use get_test_report (cheaper)
Re-running only failed cases → use run_failed (way faster)
Enumerating which tests exist → use list_tests

Edge cases:

No tests match filter → exit_code != 0 with 「no tests ran」 in stderr_tail
QA_TIMEOUT_SECONDS exceeded → exit_code 124 + [TIMEOUT…] tag in stderr_tail
filter starting with - or containing .. → blocked by security guardrail, returns {error: …}

Input Schema

TableJSON Schema

Name	Required	Description	Default
`filter`	No	選填，測試名稱關鍵字。pytest 走 -k 表達式（支援 and/or/not）、Jest 走 -t、Cypress 走 --spec '*/<filter>*'、Go 走 -run regex、Maestro 在 flow 檔名作子字串比對。
`headed`	No	選填，僅對 pytest-playwright 有效。True 時瀏覽器有 UI 模式跑（適合 debug、看 flake 視覺現象）；預設 headless 跑、CI / 大量套件用這個。
`browser`	No	選填，僅對 pytest-playwright 有效，指定 Playwright 啟用的 browser engine。需事先 `playwright install <browser>` 過。	chromium

Tool Definition Quality

A4.8/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description fully discloses behavior: invocation details per runner, filter mechanics, report writing, snapshotting, auto-triggering optimizer, Maestro retries, and return value structure. Edge cases (no match, timeout, security guardrail) are covered.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Well-structured with sections (behavior, when to use/not, edge cases). However, slightly verbose; filter description in behavior section partially duplicates param description. Still front-loaded with core purpose.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Comprehensive given no output schema. Covers return fields, edge cases, and environment specifics. Lacks full exit_code interpretation beyond examples, but adequate for a complex tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%. The description adds context beyond schema, e.g., how filter works across different runners (pytest -k, jest -t, cypress --spec, go -run, Maestro substring), headed vs headless for debugging, and browser install prerequisite.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states 'Execute the test suite under the active QA_RUNNER and produce a structured report.' It distinguishes from siblings like run_failed, get_test_report, and list_tests by providing explicit scope and differentiation.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Provides explicit 'When to use' and 'When NOT to use' sections with specific scenarios and alternative tool names, e.g., 'Inspect last results without re-running → use get_test_report.' Also covers edge cases like no match and timeout.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Latest Blog Posts

Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
open source
OpenAI
Tool Definition Quality Score (TDQS)
By punkpeye on April 3, 2026.
mcp
The Hackers Who Tracked My Sleep Cycle
By punkpeye on March 26, 2026.
security

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/kao273183/mk-qa-master'

If you have feedback or need assistance with the MCP directory API, please join our Discord server