What can you do with this server?

The Cloudwright MCP server lets you design, cost, validate, and export cloud architectures using natural language across AWS, GCP, Azure, and Databricks. Design & Modification * design_architecture — Generate a complete architecture spec from a plain-English description * modify_architecture — Evolve an existing spec with natural-language instructions (e.g. "add a Redis cache") * compare_providers — Translate an architecture across cloud providers to see equivalent service mappings Cost Estimation * estimate_cost — Per-component monthly cost breakdown (offline) * compare_provider_costs — Compare costs for the same architecture across providers (offline) Compliance & Security * validate_compliance — Check against HIPAA, PCI-DSS, SOC 2, FedRAMP, GDPR, and AWS Well-Architected * security_scan — Detect anti-patterns (unencrypted stores, public databases, missing WAF, etc.) * scan_terraform — Audit existing Terraform HCL for security misconfigurations * lint_architecture — Check for hygiene issues, anti-patterns, and best-practice violations Analysis * analyze_blast_radius — Identify failure domains, SPOFs, and transitive component impact * score_architecture — Overall quality score (0–100) across reliability, security, cost, compliance, and complexity * diff_architectures — Compare two architecture versions with cost delta and compliance impact Export * export_architecture — Export to Terraform, CloudFormation, Mermaid, D2, SBOM (CycloneDX), AI BOM, or compliance audit report Discovery * list_services — List supported cloud services by provider * catalog_search — Search instance catalog by vCPU, memory, price, or free-text for right-sizing Multi-turn Chat Sessions * chat_create_session / chat_send — Stateful, context-aware architecture design conversations with optional provider, compliance, and budget constraints * chat_list_sessions / chat_delete_session — Manage saved sessions

Which integrations are available for this server?

Provides Terraform exporter module for Databricks infrastructure as part of the multi-cloud infrastructure code generation capabilities. Supports containerized deployment of the web server with Dockerfile and docker-compose.yml for production-ready containerization. Supports Kubernetes service normalization across cloud providers (AWS EKS, GCP GKE, Azure AKS) in architecture specifications. Supports MongoDB service normalization across cloud providers (AWS DocumentDB, GCP Cloud Datastore, Azure Cosmos DB) in architecture specifications. Provides LLM capabilities for architecture design, modification, chat interactions, and ADR generation through OpenAI models like GPT-5, with provider-aware routing and API key configuration. Supports PostgreSQL service normalization across cloud providers (AWS RDS, GCP Cloud SQL, Azure Database for PostgreSQL) in architecture specifications. Supports Redis service normalization across cloud providers (AWS ElastiCache, GCP Memorystore, Azure Cache for Redis) in architecture specifications. Generates Terraform infrastructure as code from architecture specifications, with provider-specific modules for AWS, GCP, Azure, and Databricks.

Cloudwright

Describe a cloud architecture in English. Get Terraform, costs, and a compliance check.

PyPI License: MIT Python 3.12+ xmpuspus/cloudwright MCP server

pip install 'cloudwright-ai[cli]'
export ANTHROPIC_API_KEY=sk-ant-...
cloudwright design "HIPAA healthcare API on AWS with Postgres and Redis"

Cloudwright takes a one-line description of a cloud system and produces a structured architecture spec, a per-component cost breakdown, a compliance report, and ready-to-apply Terraform, Pulumi (TypeScript or Python), or CloudFormation. It works across AWS, GCP, Azure, and Databricks. The latest work adds compliance scanning that maps every finding to the framework control it violates (HIPAA / SOC 2 / FedRAMP / PCI-DSS / ISO 27001 / NIST), a cloudwright plan step that proves the exported infrastructure actually deploys, and live import for GCP and Azure alongside AWS.

Try it - What's new - Docs - MCP server

What you get

Architecture spec (typed YAML, version-controlled, the single source of truth)
Cost breakdown across AWS, GCP, Azure, Databricks (region-aware, per-component, four pricing tiers, optional carbon + FOCUS CSV export). Each line carries a confidence flag — high from the bundled price catalog (deepest on AWS), low from formula/fallback — so an estimate never silently passes off a guess as a quote.
Compliance report covering HIPAA, SOC 2, PCI-DSS, FedRAMP Moderate, GDPR, NIST 800-53, and Well-Architected, with OSCAL 1.1.2 export and control traceability
Terraform, OpenTofu, Pulumi (TypeScript or Python), and CloudFormation export with safe defaults (encryption, IMDSv2, locked-down S3, sensible RDS settings)
Diagrams in ASCII, Mermaid, D2, and a fully editable web canvas
MCP server for AI agents (Claude Desktop, Cursor, Cline, and any MCP-compatible client)

Related MCP server: Cloud Pilot MCP

Quickstart

cloudwright design "HIPAA healthcare API on AWS with Postgres and Redis"
cloudwright cost spec.yaml --workload-profile medium
cloudwright validate spec.yaml --compliance hipaa,soc2
cloudwright export spec.yaml --format terraform -o ./infra
cloudwright chat --web                                # browser canvas at http://localhost:8765

All commands except design, modify, chat, and adr work fully offline. Set ANTHROPIC_API_KEY (preferred) or OPENAI_API_KEY to enable the LLM-powered ones. Drop --json on any command for machine-readable output.

Smart Canvas + Module Catalog (v1.2)

The web diagram is a fully editable architecture canvas. Edits (add, drag, connect, edit fields, delete) are deterministic frontend mutations, so they are instant, free, and reproducible. They do not call the LLM.

A left-side Catalog drawer has three tabs:

Resources - the full catalog for the active provider, served by /api/catalog/services (case-insensitive ?provider=).
Modules - approved multi-resource patterns from /api/modules. Bundled: AWS Three-Tier Web, AWS Serverless API, AWS Data Lake, GCP Serverless API, Azure Three-Tier Web.
Standards - runs POST /api/canvas/validate and surfaces orphan connections, partial modules, unapproved modules, naming-prefix violations, and missing required tags.

When a module instance is intact, the Terraform exporter emits a single module "<id>" block with the catalog's pinned source and version. Modified modules fall back to per-component resource rendering. Mixed specs work: catalog modules render as modules, ad-hoc resources as resources, side by side.

cloudwright chat --web
# Open http://localhost:8765, use the Catalog drawer, then Export -> Terraform

MCP server (works with every MCP client)

Expose Cloudwright as Model Context Protocol tools so AI agents can design, cost, validate, review, check compliance, plan, and export architectures directly. Almost every popular coding agent is an MCP client (Claude Code, Cursor, Cline, Windsurf, GitHub Copilot, Zed, OpenAI Codex CLI, JetBrains Junie, Kiro, Antigravity), so one server puts cloudwright inside all of their agent loops. Tool groups: design, cost, validate, analyze, export, session, review, compliance, plan.

pip install cloudwright-ai-mcp
cloudwright mcp                              # all tools, stdio
cloudwright mcp --tools design,cost,review   # subset
cloudwright mcp --transport sse              # SSE for HTTP clients

Do not hand-write the config. cloudwright integrate --harness <name> prints the exact wiring for your agent in its own format, and --write merges it into the right file. For a manual setup, the base shape (Claude Code, Cursor, Cline, Windsurf, Copilot, Junie) is:

{
  "mcpServers": {
    "cloudwright": {
      "command": "cloudwright",
      "args": ["mcp"]
    }
  }
}

Zed uses context_servers, VS Code Copilot uses servers, and Codex CLI uses a [mcp_servers.cloudwright] TOML table. cloudwright integrate emits the right one for each, and Aider (not an MCP client) gets a CLI-pipe recipe. Full matrix in docs/integrations.md.

Analysis

cloudwright lint (10 anti-pattern checks), cloudwright score (5-dimension quality grade), cloudwright analyze (blast radius and SPOF), cloudwright drift <spec> <tfstate> (design vs deployed), cloudwright policy --rules policy.yaml (policy-as-code with 9 built-in checks), cloudwright security (security anti-patterns; also scans exported Terraform HCL), cloudwright compliance <spec> --frameworks hipaa,soc2,fedramp (every finding mapped to its HIPAA / SOC 2 / FedRAMP / PCI-DSS / ISO 27001 / NIST control ID, with optional Checkov deep scan), and cloudwright plan <spec> --target terraform (proves the exported artifact validates / plans). Every command supports --json. See docs/ and the examples/ directory for end-to-end samples.

Python API

from cloudwright import ArchSpec
from cloudwright.cost import CostEngine
from cloudwright.validator import Validator
from cloudwright.exporter import export_spec

spec = ArchSpec.from_file("spec.yaml")
priced = CostEngine().estimate(spec, workload_profile="medium")
results = Validator().validate(spec, compliance=["hipaa", "pci-dss"])
hcl = export_spec(spec, "terraform", output_dir="./infra")

What's new in v1.7.0

Cloudwright's design-time checks now reach every coding agent, not just its own CLI. Almost every popular AI coding harness speaks MCP, so one server puts review, compliance, and plan inside their agent loops.

cloudwright integrate wires cloudwright into 11 harnesses. It writes the MCP server config in each client's own format (Claude Code, Cursor, Cline, Windsurf, GitHub Copilot, Zed, OpenAI Codex CLI, JetBrains Junie, Kiro, Antigravity), and gives Aider the CLI-pipe path. cloudwright integrate --rules emits a gate block for AGENTS.md, CLAUDE.md, or GEMINI.md that tells any agent to run cloudwright review/cost/compliance before it writes infrastructure. See docs/integrations.md.
The MCP server exposes the differentiators. New review_architecture (offline critique, no key), scan_compliance_controls (control-mapped, with an oscal flag), and read-only plan_infrastructure tools. Every MCP client gets them.
Review and OSCAL on the web canvas. A Review tab (score, grade, findings) and an OSCAL JSON download on the Compliance panel.
Cost stops overclaiming. Estimates report prices_as_of (catalog vintage) separately from estimated_on (today), show a "17/20 catalog-backed" ratio, carry per-alternative confidence in compare, and use a real regional price row when the catalog has one. docs/provider-coverage.md states per-provider coverage plainly.
Web hardening. Diagram rendering no longer blocks the event loop, spec payloads are size- and component-capped, the container refuses to serve unauthenticated by accident, and malformed input returns a clean error instead of a traceback.

cloudwright integrate --harness claude-code      # MCP config for your agent
cloudwright integrate --rules --agent-file claude # gate block for CLAUDE.md

What's new in v1.6.0

The design engine now reviews and repairs its own output, compliance binds at design time with OSCAL output, and the cost estimate stops guessing silently.

The architect self-corrects. Every cloudwright design runs the built-in critics (scorer, linter, validator) against the generated spec and, when blocking findings remain, repairs it in one bounded pass before you ever see it — recorded in spec.metadata.critique. The same engine is a free, offline command: cloudwright review spec.yaml gives a severity-ranked architecture review with no API key.
OSCAL + control traceability. cloudwright compliance spec.yaml --frameworks fedramp --oscal emits an OSCAL 1.1.2 component-definition — control mapping a CSPM or evidence tool cannot produce before deploy. --traceability prints the chain design intent -> component -> Terraform resource -> control ID -> status.
Cost you can defend. Region-aware pricing (every region used to be priced as us-east-1), data-transfer/egress estimation, a per-line pricing confidence (high = catalog, low = fallback), design-time carbon (cloudwright cost --carbon), and FOCUS-spec CSV export (--focus).
Drift -> remediation and OpenTofu. cloudwright drift ... --remediate turns drift into a cost + compliance + plan preview (read-only). cloudwright export --format opentofu and a tofu-aware plan.
Hardening. Terraform exporter injection hardening, cloudwright plan no longer carries the LLM key into the IaC subprocess, the WAF export is deployable, and the "compliance overrides workload profile" guarantee is now actually enforced for sandbox specs.

cloudwright review spec.yaml                                   # offline, no API key
cloudwright compliance spec.yaml --frameworks fedramp --oscal  # OSCAL component-definition
cloudwright cost spec.yaml --carbon --focus                    # region-aware + carbon + FOCUS CSV
cloudwright export spec.yaml --format opentofu -o ./infra

See docs/ for getting-started, CLI, MCP, and troubleshooting guides.

What's new in v1.5.0

Terminal — cloudwright compliance maps every finding to its framework control ID, then cloudwright plan proves the Terraform validates:

Web canvas — the same checks as Compliance and Plan tabs:

Compliance scanner with framework control-ID mapping. cloudwright compliance spec.yaml --frameworks hipaa,soc2,fedramp maps every design-stage finding to the exact control it violates — HIPAA 164.312(a)(2)(iv), SOC 2 CC6.1, FedRAMP SC-28, plus PCI-DSS, GDPR, ISO 27001, NIST 800-53 — before any infrastructure exists. No competitor maps findings to control IDs at design time. The mapping runs on the built-in scanner with zero external tooling; when the Checkov binary is present it is run against the exported Terraform and its CKV_* findings fold into the same control-mapped report. Per-framework posture table, audit-ready markdown report (-o report.md), POST /api/compliance, and a Compliance tab in the canvas. pip install 'cloudwright-ai[compliance]' for the Checkov deep scan; the control mapping works without it.
cloudwright plan — prove it deploys. cloudwright plan spec.yaml --target terraform runs terraform validate (and terraform plan when credentials are present) against the generated artifact; --target pulumi-python|pulumi-ts runs pulumi preview. Read-only — nothing is applied. validate needs no credentials and is the offline proof of deployability; plan adds a real +add ~change -destroy diff when credentials resolve. DEPLOYABLE / NOT DEPLOYABLE verdict in the CLI, POST /api/plan, and a Plan tab in the canvas.
Live GCP and Azure import. cloudwright import-live --provider gcp --project PROJECT (Compute Engine, Cloud Storage, Cloud SQL) and --provider azure --subscription SUB_ID (Virtual Machines, Storage Accounts, Azure SQL, AKS) join the existing AWS importer — same lazy-SDK, fast-fail-on-credentials, non-fatal-per-service-permission-guard pattern, with security posture captured per resource. pip install 'cloudwright-ai[live-import]'.

The demos above are reproducible: vhs scripts/controls_demo.tape (terminal) and python scripts/record_controls_demo.py against a local web server (template-matched prompt, no API key required).

What's new in v1.4.0

Pulumi exporter (TypeScript + Python). cloudwright export spec.yaml --format pulumi-ts -o ./infra writes a complete Pulumi TypeScript project (index.ts, Pulumi.yaml, package.json, tsconfig.json). --format pulumi-python writes the Python equivalent. AWS, GCP, and Azure coverage matches the Terraform exporter, with the same safe-by-default posture (S3 public-access block + AES256 + versioning, RDS encryption + 7-day backups + deletion protection, EC2 IMDSv2, DynamoDB SSE + PITR, CloudFront TLSv1.2_2021, CloudTrail log-file validation). Aliases pulumi-typescript and pulumi-py also work.
Live AWS import. cloudwright import-live --provider aws --region us-east-1 [--profile NAME] [--services ec2,rds,s3] [-o spec.yaml] walks boto3 describe-* calls (EC2, VPC + subnets + security groups, RDS, S3, Lambda, ECS, EKS, DynamoDB, ALB / NLB, CloudFront, SQS, API Gateway, CloudTrail) and produces an ArchSpec from running infrastructure. Captures security posture (S3 encryption + versioning + public-access-block, RDS multi-AZ + backup retention, EC2 IMDSv2, SG ingress 0.0.0.0/0). Best-effort connection inference: ALB to EC2 via target groups, CloudFront to S3 via origin domains. Per-service permission denials are non-fatal. Optional dep: pip install 'cloudwright-ai[live-import]'.
Two-stage prompting plus boundary-aware spec. Architect.design() now runs Stage 1 (free-text architectural reasoning via Sonnet) followed by Stage 2 (strict JSON projection via Haiku). Stage 2 is told the canonical service keys, allowed connection kinds (sync_request | async_event | stream | replication | batch), and boundary kinds (VPC / subnet / security_group / availability_zone / region / account), so it projects faithfully without redesigning. VPCs, subnets, and SGs are now first-class in the LLM contract. Per-stage usage (stage1, stage2, total_cost_usd, two_stage: true) is exposed on /api/design, /api/modify, and their streaming variants. Single-shot path retained as fallback (Architect(two_stage=False)).
Workload-aware safe defaults. Pre-v1.4, _post_validate forced encryption=true, multi_az=true, backup=true, auto_scaling=true, and count=2 onto every spec, masking Stage 1 reasoning. v1.4 makes these conditional on spec.metadata.workload_profile: sandbox, dev, test, demo, poc keep the LLM's chosen values; production, medium, large, enterprise get safe defaults forced. Compliance frameworks (HIPAA, PCI-DSS, SOC 2, GDPR, FedRAMP, HITRUST, ISO 27001) always force encryption + HA regardless of profile.
GitHub Action for PR previews. Drop-in workflow posts an idempotent comment with architecture diff (added / removed / changed components), monthly cost delta (head vs. base, with annual rollup), and per-framework compliance changes whenever a PR touches *.tf, *.tfstate, cloudwright.yaml, or spec.yaml. Reusable composite action at .github/actions/cloudwright-pr-comment/. See docs/github-action.md.
Refreshed Smart Canvas demo GIF (examples/cloudwright-smart-canvas-demo.gif) showing prompt to diagram to catalog drawer to add resource to side-panel edit to cost recomputation against the current UI. Reproducible via python scripts/record_smart_canvas.py.
Cancel-safe streaming via AsyncAnthropic and AsyncOpenAI. chat/stream and design/stream now use native async clients instead of a threading.Thread + asyncio.Queue bridge. When a client disconnects mid-stream, CancelledError propagates into the SDK's async with block and closes the upstream HTTPX connection, so the LLM call stops billing tokens at disconnect rather than at completion. Eliminates orphan threads and queue-full data loss. Sync send_stream / generate_stream paths are preserved for the CLI; only the web routers switched.

What's new in v1.3.0

Safe-by-default Terraform. S3 public-access blocks, RDS storage_encrypted and deletion_protection, EC2 IMDSv2, security-group ingress restricted to listed CIDRs.
HCL injection-safe escaping. All Terraform exporters (AWS, GCP, Azure, Databricks) escape user-supplied config values before interpolation.
Per-model LLM pricing. Haiku (fast classifier) and Sonnet (designer) billed at their respective rates instead of a single hardcoded rate.
Anthropic prompt caching. Roughly 70-80% input-token savings on chat follow-ups, with measurable latency improvement.
Atomic SessionStore writes. No more session corruption on SIGKILL or disk full mid-write.
Constant-time API key comparison. Web API auth is now timing-attack hardened.
FedRAMP region allowlist. us-east-1 is no longer flagged as FedRAMP-authorized; the validator now matches the actual GovCloud / FedRAMP-authorized region set.
Health and version endpoints. /health returns version, configured model, and catalog status. New /api/version.
Request correlation IDs. Every web request gets an X-Request-Id, plumbed through router logs.
--debug flag works. Used to be a silent no-op; now prints prompts, timing, and token counts.
cloudwright chat --web pinned to port 8765 to match docs and MCP/Slack integrations.
Cost in design responses. /api/design and /api/modify now return cost in the response payload.
Swagger UI gated. /docs is off in production by default; set CLOUDWRIGHT_DOCS_ENABLED=true to expose it.

Compatibility

Python 3.12+
LLM providers: Anthropic (Claude Sonnet, Haiku) and OpenAI (GPT-5+ family). Auto-detected from env.
Clouds: AWS, GCP, Azure, Databricks. 112 service keys total.
Install variants: cloudwright-ai[cli], cloudwright-ai[web], cloudwright-ai-mcp.

Contributing, license, changelog

Contributing guide: CONTRIBUTING.md
License: MIT - see LICENSE
Full release history: CHANGELOG.md

Install Server

A

license - permissive license

A

quality

C

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

1Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Tools

View all tools

cloudwright-mcp