argus-qa
This server lets you run AI-powered automated QA audits on web pages:
argus_audit— Quick QA Pass: Fast audit checking for JS errors, network failures, accessibility issues, SEO problems, security vulnerabilities, CSS issues, and content quality. Optionally mark routes as critical to escalate console errors.argus_audit_full— Deep QA Pass: Comprehensive audit including everything in the quick pass, plus Lighthouse scoring (performance, accessibility, SEO, best practices), responsive layout checks across 4 viewports (375px, 768px, 1024px, 1440px), memory leak detection via V8 heap snapshots, hover-state bug detection, and accessibility tree snapshot analysis.argus_compare— Environment Diff: Compare two environments (e.g., dev vs. staging) side-by-side — captures screenshots, runs the full analyzer suite on both, and diffs findings to identify regressions or severity changes. Falls back to detailed CSS analysis if no staging URL is configured.argus_last_report— Retrieve Last Report: Fetch the most recently saved Argus JSON report from thereports/directory, useful for reviewing previous results without re-running an audit.
Sends rich audit reports to Slack channels, with issue severity classification and routing.
Argus — AI-Powered Automated QA
Argus catches the bugs your test suite misses — visual regressions, API loops, CSS drift, console noise, accessibility failures, and more — and delivers rich reports to Slack (or a local HTML dashboard).
Quick Start · Features · Setup · MCP Tools · CLI Commands · Troubleshooting · Full Reference
Quick Start
No install required.
npxauto-downloads Argus on first run.
Step 1 — Add to .mcp.json in your project root:
{
"mcpServers": {
"chrome-devtools": { "command": "npx", "args": ["-y", "chrome-devtools-mcp@latest"] },
"argus": { "command": "npx", "args": ["-y", "argusqa-os"] }
}
}Or via Claude Code CLI:
claude mcp add chrome-devtools -- npx -y chrome-devtools-mcp@latest
claude mcp add argus -- npx -y argusqa-osStep 2 — Start Chrome with remote debugging:
# macOS
open -a "Google Chrome" --args --remote-debugging-port=9222 --headless=new
# Windows (PowerShell)
& "C:\Program Files\Google\Chrome\Application\chrome.exe" --remote-debugging-port=9222 --headless=new --no-sandbox --disable-gpu --user-data-dir="$env:TEMP\chrome-argus"
# Linux
google-chrome --remote-debugging-port=9222 --headless=new --no-sandboxStep 3 — Run an audit:
Run argus_audit on http://localhost:3000Argus scans your app and either posts findings to Slack or opens a local report.html. That's it.
Related MCP server: AWT (AI Watch Tester)
What Argus Catches
32 analysis engines, 140 distinct issue types, zero test-file maintenance:
Category | What it detects |
JavaScript | Uncaught exceptions, unhandled promise rejections, |
Network & API | HTTP 5xx, 401/403 auth failures, duplicate API calls (infinite loops), 4xx errors, broken links |
Performance | LCP > 2500ms, CLS > 0.1, TTFB > 800ms, slow APIs > 1s/3s, payloads > 500KB/2MB, JS bundles > 500KB |
Accessibility | axe-core (80+ WCAG rules), color-blind simulation, missing ARIA, keyboard focus, heading hierarchy |
SEO | Missing meta description, OG tags, canonical, viewport, h1 |
Security | Auth tokens in localStorage/URL, |
CSS | Cascade overrides, component style leaks, unused rules, React inline style conflicts |
Content |
|
Responsive | Horizontal overflow at 375px/768px, touch targets < 44×44px |
Memory | Detached DOM nodes via V8 heap snapshot, heap growth across navigation |
Visual | Pixel-level screenshot regression via pixelmatch (≥0.1% warning, ≥5% critical) |
Figma | Design-to-implementation fidelity — 13 property types (color, spacing, typography, shadows, etc.) |
Forms | Missing |
Fonts | FOIT, FOUT, missing fallbacks, slow loads > 1s, suboptimal formats |
Motion |
|
Network baseline | New requests, missing requests, status-code regressions vs saved HAR baseline |
Environment diff | Dev vs staging — screenshot diff, DOM changes, console/network regressions |
And every finding is post-processed with:
Post-processor | What it adds |
Intelligent baseline filtering | Findings that flip-flop across runs are tagged |
Root cause linking | New findings are annotated with the recent git commits and files most likely to have caused them ( |
All findings are classified as
critical/warning/infoand routed to the right Slack channel — or surfaced in the local HTML report. For per-finding severity tables and detection methods, see REFERENCE.md.
MCP Tools
Ask Claude (or any MCP client) — no terminal required:
Tool | Description |
| Fast pass — JS, network, accessibility, SEO, security, CSS, content |
| Deep pass — adds Lighthouse, responsive checks, memory leak detection, hover-state bugs |
| Diff dev vs staging — screenshots, findings delta, environment regressions |
| Capture everything broken on the open tab for Claude to diagnose |
| Snapshot the open tab without navigating (preserves auth/form state) |
| Return last JSON report without re-running |
| Figma URL → 13 design-token finding types (color, spacing, typography, shadows, etc.) |
| Screenshot baseline comparison. Pass |
| Fetch GitHub PR diff → map changed files to affected routes → targeted audit → |
Example prompts:
Run argus_audit on http://localhost:3000/checkout
Run argus_audit_full on http://localhost:3000/dashboard
Run argus_compare
Run argus_get_contextFull Setup
Prerequisites
Requirement | Version |
Node.js | v20.19+ |
Chrome | Stable (desktop or headless) |
Claude Code | Latest ( |
Slack workspace | Optional — omit for local |
Option A — MCP Server (recommended for Claude Code users)
No local install needed. Use the Quick Start above, then add your target URL:
# .env in your project root
TARGET_DEV_URL=http://localhost:3000
TARGET_STAGING_URL=https://staging.example.com # optional — enables argus_compareOptional — Slack notifications:
api.slack.com/apps → Create New App → name it BugBot
OAuth & Permissions → Bot Token Scopes:
chat:write,files:write,files:readInstall to workspace → copy the
xoxb-...tokenCreate channels
#bugs-critical,#bugs-warnings,#bugs-digestand run/invite @BugBotin each
SLACK_BOT_TOKEN=xoxb-...
SLACK_CHANNEL_CRITICAL=C0000000000
SLACK_CHANNEL_WARNINGS=C0000000001
SLACK_CHANNEL_DIGEST=C0000000002Without Slack: Argus auto-generates
reports/report.htmland opens it in your browser — zero extra config.
Option B — npm Package (CI / dev dependency)
npm install --save-dev argusqa-os
npx argus init # interactive wizard — detects framework, discovers routes, writes .env
npm run crawl # run after Chrome is startedOption C — Clone the Repository (contributors / full source)
git clone https://github.com/ironclawdevs27/Argus.git
cd Argus
npm install
npm run init # interactive setup wizardManual setup (skip the wizard):
cp .env.example .env
# Fill in TARGET_DEV_URL and optional Slack tokensThen configure your routes in src/config/targets.js:
export const routes = [
{ path: '/', name: 'Home', critical: true, waitFor: 'main' },
{ path: '/login', name: 'Login', critical: true, waitFor: 'form' },
{ path: '/dashboard', name: 'Dashboard', critical: true, waitFor: '[data-testid="dashboard"]' },
{ path: '/settings', name: 'Settings', critical: false, waitFor: null },
];critical: true— errors on this route go to#bugs-criticalwaitFor— CSS selector Argus waits for before capturing (signals page-ready)
CLI Commands
npm run chrome # Launch Chrome with --remote-debugging-port=9222 (auto-detects binary)
npm run doctor # Pre-flight check: Chrome reachable, .mcp.json valid, .env has TARGET_DEV_URL
npm run crawl # Batch audit of all configured routes
npm run compare # Dev vs staging diff (CSS-only if no staging URL)
npm run watch # Passive monitor — polls open Chrome tab every 1s
npm run report:html # Generate reports/report.html from last JSON audit
npm run report:pdf # Export HTML report to A4 PDF (requires: npm install puppeteer)
npm run server # Start Slack slash-command server (port 3001)
npm run init # Interactive setup wizard
npm run test:unit # 61 unit tests — no Chrome required
npm run test:harness # 144-block correctness harness — requires Chrome
npm run test:harness:log # same, but tees full output to harness-results.txtWatch mode — live monitoring as you develop:
# Terminal 1: start your app
npm run dev
# Terminal 2: start Argus watcher
npm run watch
# Ctrl+C → stops monitor and writes reports/report.htmlSlack slash command (on-demand from any channel):
/argus-retest https://staging.example.com/checkoutTo expose the server via tunnel: cloudflared tunnel --url http://localhost:3001 (free, no account required). Set the resulting URL as the Request URL in Slack App → Slash Commands.
GitHub Actions CI
Add to your repo's secrets (Settings → Secrets → Actions):
Secret | Required | Value |
| Yes | Your staging base URL |
| No |
|
| No* | Channel ID (needed when Slack is configured) |
| No* | Channel ID |
| No* | Channel ID |
| No | Auto-injected by Actions for PR comments + Check Runs |
The included workflow runs on push to main, daily at 6 AM UTC, and on manual trigger. If critical issues are found, the pipeline fails.
Environment Variables
Variable | Default | Description |
| — | Required. Base URL of your dev environment |
| — | Staging URL — enables |
| — |
|
| — | For |
| — | Channel ID for critical bugs |
| — | Channel ID for warnings |
| — | Channel ID for info / daily digest |
|
| Slack slash-command server port |
|
| Where to write JSON reports |
|
| Parallel MCP clients for route crawling |
|
|
|
| — | Set |
|
| Max retries for |
|
| Watch mode poll interval (ms) |
|
| Watch mode web dashboard port |
| — | App source path — enables env-var / feature-flag / dead-route analysis |
| — | Path to app |
|
| Pixel diff % threshold for environment comparison |
| — | For PR comments + Check Runs |
| — |
|
| — | Auto-injected by Actions from PR context |
|
| New criticals before blocking merge (0 = never block) |
| — | Visual diff image URL to embed in PR comment |
| — | OTLP collector for Jaeger / Grafana Tempo |
| — | Required for |
|
| Slow web font load threshold (ms) |
|
| WCAG AA min contrast ratio for CVD simulation |
Troubleshooting
Chrome DevTools MCP not connecting
claude mcp add chrome-devtools -- npx chrome-devtools-mcp@latest
# Restart Claude Code after addingSlack messages not posting
Token must start with
xoxb-(notxoxp-,xoxe-, orxapp-)Run
/invite @BugBotin each channelRequired scopes:
chat:write,files:write,files:read
Screenshots are blank
Page hasn't settled — increase
pageSettleMsinsrc/config/targets.jsor add awaitForselector for the route
/argus-retest returns "dispatch_failed"
Tunnel URL changed — update the Request URL in Slack App → Slash Commands and reinstall
CSS analysis returns empty results
Page may be behind auth — ensure you're logged in on the Chrome instance Argus is controlling
CI pipeline fails immediately
Chrome may not start fast enough — increase
sleep 3tosleep 5in .github/workflows/argus.yml
How Argus Differs From Playwright / Cypress
Argus is a complementary layer, not a replacement for unit or E2E tests:
Playwright / Cypress | Argus | |
Purpose | Test your logic and API contracts | Catch what the user actually sees |
What it catches | Regressions in behavior | CSS drift, visual regressions, API loops, console noise, perf budgets |
When it runs | In your test suite | Continuously, on the live running app |
Setup | Write test files | Configure routes in |
Output | Pass / fail | Structured Slack reports with screenshots |
Known Limitations
All 738 harness assertions pass (738/738) — there are currently no known MCP- or Chrome-layer restrictions. Soft assertions (Lighthouse, performance traces) still require non-headless Chrome and are skipped in headless CI.
Project Structure
src/
argus.js — single-page audit entry point
mcp-server.js — 9 MCP tools exposed to Claude / any MCP client
orchestration/ — crawl loop, Slack/GitHub dispatch, env comparison, watch mode
utils/ — 32 analysis engines (accessibility, security, performance, PDF, recording, etc.)
adapters/browser.js — CdpBrowserAdapter — wraps all chrome-devtools-mcp calls
config/targets.js — routes, thresholds, auth steps
cli/
init.js — argus init interactive setup wizard
chrome-launcher.js — npm run chrome / argus-chrome — launches Chrome with correct flags
doctor.js — npm run doctor / argus-doctor — pre-flight checks
pr-validate.js — headless CI entry point for GitHub Actions
test-harness/ — 144-block correctness harness, 738 hard assertions, 60 fixture pages
test/unit/ — 61 Vitest unit tests (no Chrome required)
landing/ — Product landing page (React 19 + Vite + Tailwind)Full source map → CLAUDE.md · MCP/DSL reference → SKILL.md
Contributing
Fork the repo and create a branch
npm run test:unit— verify without Chrome (61 tests)npm run test:harness— full integration coverage (requires Chrome on port 9222)Open a PR — Argus audits itself via the CI workflow
License
MIT © ironclawdevs27
Argus Panoptes — the all-seeing giant of Greek mythology who never slept.
Maintenance
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/ironclawdevs27/Argus'
If you have feedback or need assistance with the MCP directory API, please join our Discord server