SF Permits MCP Server

CHECKCHAT-B.md•5.74 KiB

# CHECKCHAT-B: Session 53B — Cost Protection + Rate Limiting **Date:** 2026-02-24 **Agent:** session-b-cost-protection (claude-sonnet-4-6) **Branch:** main (worktree: agent-ad3fe2be) --- ## 1. VERIFY ### termRelay Gate - `qa-results/session-52-swarm-relay-results.md` found and processed — all PASS, moved to `qa-results/done/`. - Session 53B QA script written to `qa-drop/session-53b-cost-protection-qa.md`. - 39 new tests written and passing. - No Playwright tests needed (all checks are unit/integration, no UI rendering steps). ### Test Results - **New tests:** 39 in `tests/test_cost_tracking.py` — 39 passed, 0 failed. - **Full suite:** 1597 passed, 1 skipped (pre-existing DuckDB lock in test_velocity_dashboard.py — unrelated to Session B changes). - **Pre-existing failure:** `test_velocity_dashboard.py::test_portfolio_stations_no_user` — DuckDB file lock from another process. Not caused by Session B. --- ## 2. DOCUMENT ### What was built 1. **`web/cost_tracking.py`** (new, 390 lines) — Complete cost tracking module: - `estimate_cost_usd(input_tokens, output_tokens)` — USD cost from token counts (Sonnet 4 pricing: $3/$15 per MTok) - `log_api_call(endpoint, model, input_tokens, output_tokens, ...)` — writes to `api_usage` table, non-blocking - `get_daily_global_cost(target_date)` — sums today's spend from DB - `get_cost_summary(days=7)` — full summary dict for admin dashboard - `is_kill_switch_active()` / `set_kill_switch(active)` — runtime toggle - `check_rate_limit(rate_type)` — per-user/IP rate limiting with memory buckets - `@rate_limited("ai"|"plans"|"analyze"|"lookup")` — decorator factory for routes - `ensure_schema()` + `init_cost_tracking_schema()` — lazy DuckDB schema creation - Auto-kill on threshold breach in `_check_cost_thresholds()` - WARNING logged at `COST_WARN_THRESHOLD` (default $5/day) - Kill switch auto-activates at `COST_KILL_THRESHOLD` (default $20/day) 2. **`scripts/migrate_cost_tracking.py`** (new) — Idempotent Postgres migration for `api_usage` + `api_daily_summary` tables. Skips gracefully in DuckDB mode. 3. **`web/templates/admin_costs.html`** (new) — Admin cost dashboard: - Alert banner (green/yellow/red based on spend vs thresholds) - Kill switch panel with activate/deactivate button - Stat cards: today's spend, 7-day total, top endpoint, kill threshold - 7-day trend bar chart (pure CSS) - Per-endpoint and per-user breakdown tables 4. **`web/templates/error.html`** (new) — Generic error page template with variants for rate_limit, kill_switch, 403, 404, and generic errors. 5. **`web/app.py`** (modified) — Session B additions: - `_rate_limited_ai` + `_rate_limited_plans` lazy decorator wrappers (lines 791-810) - `@_rate_limited_ai` applied to `/ask` route - `@_rate_limited_plans` applied to `/analyze-plans` route - `log_api_call()` integrated into `_synthesize_with_ai()` (the Claude API call site) - `GET /admin/costs` + `POST /admin/costs/kill-switch` routes (lines 4617-4644) - All marked `# === SESSION B: COST PROTECTION ===` 6. **`tests/test_cost_tracking.py`** (new, 39 tests) — Full coverage of all cost_tracking functions plus Flask route integration tests. ### Decisions made - **Lazy decorators vs top-level import**: Used `_rate_limited_ai()` wrapper functions to avoid circular import at module load time (cost_tracking imports Flask `g`/`request` which requires app context). - **DuckDB SEQUENCE**: DuckDB requires explicit `CREATE SEQUENCE` + `DEFAULT nextval(...)` for auto-increment, unlike `SERIAL` in Postgres. Fixed in `init_cost_tracking_schema()`. - **`execute_write` uses %s style**: `src.db.execute_write` already handles `%s` → `?` conversion for DuckDB, so we always pass `%s` placeholders. - **Kill switch scope**: Only blocks `ai`, `plans`, `analyze` rate types. `lookup` and `search` routes remain available during kill switch activation to preserve core functionality. - **Non-blocking logging**: `log_api_call()` catches all exceptions and logs warnings — never raises — to avoid breaking API responses on logging failures. - **No decorator on /analyze**: The `/analyze` route calls `predict_permits` and other MCP tools, which are not Claude API calls (they use heuristics + local knowledge base). Not decorated with @rate_limited_ai. --- ## 3. CAPTURE - 4 scenarios appended to `scenarios-pending-review.md` - QA script written to `qa-drop/session-53b-cost-protection-qa.md` --- ## 4. SHIP - Files ready for commit and push to main. - Railway auto-deploy will trigger on push. - **Postgres migration needed post-deploy**: Run `python -m scripts.migrate_cost_tracking` on Railway (or via cron) to create `api_usage` + `api_daily_summary` tables in production. --- ## 5. PREP NEXT - Add `/admin/costs` link to admin navigation (currently only accessible by direct URL) - Consider daily summary cron job to populate `api_daily_summary` for faster dashboard queries - Add email alert when kill switch auto-activates - Consider persisting kill switch state in DB (currently resets on dyno restart) - `/analyze` route calls MCP tools + draft_response which calls Claude — consider adding @rate_limited_ai to `/analyze` as well in a follow-up session --- ## 6. BLOCKED ITEMS None. --- ## 7. CLEANUP - Worktree branch: agent-ad3fe2be - Files committed on worktree branch before merge (required by CHECKCHAT protocol) - Pre-existing test failure: `test_velocity_dashboard.py::test_portfolio_stations_no_user` — DuckDB file lock, not a Session B regression. --- ## RETURN SUMMARY - **Status:** COMPLETE - **New tests:** 39 (39/39 PASS) - **Files changed:** 6 (web/cost_tracking.py, scripts/migrate_cost_tracking.py, web/templates/admin_costs.html, web/templates/error.html, web/app.py, tests/test_cost_tracking.py) - **Blockers:** None

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/tbrennem-source/sf-permits-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

CHECKCHAT-B.md•5.74 KiB