# CodeGuard AI - Architecture Overview v2.0
## π― Vision
CodeGuard AI Γ© um sistema de compliance automatizado de classe enterprise que combina **anΓ‘lise estΓ‘tica local** com **agentes de IA otimizados para custo** para auditoria profunda de conformidade.
---
## ποΈ Arquitetura Antigravity
```
βββββββββββββββββββββββββββββββββββββββ
β Antigravity Orchestrator β
β (Core Intelligence) β
βββββββββββββββββββ¬ββββββββββββββββββββ
β
βββββββββββββββββββββββββββββββββββΌββββββββββββββββββββββββββββββββββ
β β β
βΌ βΌ βΌ
βββββββββββββββββββββββ βββββββββββββββββββββββ βββββββββββββββββββββββ
β Repo Intelligence β β Coding Memory β β Patch Engine β
β Layer β β (RAG per file) β β (Diff Engine) β
βββββββββββββββββββββββ€ βββββββββββββββββββββββ€ βββββββββββββββββββββββ€
β β’ IndexaΓ§Γ£o repo β β β’ Embeddings via β β β’ GeraΓ§Γ£o via β
β β’ Grafo dependΓͺnciasβ β SiliconFlow β β GPT-4o-mini β
β β’ Arquivos sensΓveisβ β β’ Chunking auto β β β’ Backup automΓ‘tico β
β β’ Fluxos de dados β β β’ Busca semΓ’ntica β β β’ DetecΓ§Γ£o conflitosβ
β β’ DetecΓ§Γ£o ciclos β β β’ Cache local β β β’ OrdenaΓ§Γ£o deps β
βββββββββββββββββββββββ βββββββββββββββββββββββ βββββββββββββββββββββββ
β β β
ββββββββββββββββββββββ¬βββββββββββββ΄ββββββββββββββββββββββββββββββββββ
β
βΌ
βββββββββββββββββββββββ
β LLM Router β
β (Cost Optimizer) β
βββββββββββββββββββββββ€
β Scan: Kimi K2.5 ββββ $0.15/M input, 262k context
β Patch: GPT-4o-mini ββββ $0.60/M output (85% savings)
β Embed: SiliconFlow ββββ $0.01/M (10x cheaper)
β Fallback: OpenRouterββββ +5.5% markup only
βββββββββββββββββββββββ
```
---
## π¦ Components Detail
### 1. Repo Intelligence Layer (`ril.ts`)
**Responsabilidades:**
- Indexar estrutura do repositΓ³rio
- Construir grafo de dependΓͺncias entre arquivos
- Identificar arquivos sensΓveis (auth, payment, PII)
- Analisar fluxos de dados cross-file
- Detectar ciclos de dependΓͺncia
```typescript
const ril = new RepoIntelligence();
const context = await ril.indexRepository('/project');
// context.graph.sensitiveFiles = ['auth.ts', 'payment.ts']
// context.graph.cycles = [[file1, file2, file1]]
```
### 2. Coding Memory (`memory.ts`)
**Responsabilidades:**
- Gerar embeddings via SiliconFlow (10x mais barato que OpenAI)
- Chunking inteligente com overlap para arquivos grandes
- Busca semΓ’ntica por similaridade de coseno
- Cache local + persistΓͺncia em Supabase
```typescript
const memory = new CodingMemory();
await memory.initialize(context);
const result = await memory.query({
query: 'authentication LGPD compliance',
threshold: 0.7
});
// result.contextString β cΓ³digo relevante
```
### 3. Patch Engine (`patch.ts`)
**Responsabilidades:**
- Gerar correΓ§Γ΅es via GPT-4o-mini (85% mais barato que Haiku)
- Backup automΓ‘tico antes de aplicar
- DetecΓ§Γ£o de conflitos entre patches
- OrdenaΓ§Γ£o por dependΓͺncia (bottom-up)
- Rollback em caso de falha
```typescript
const patcher = new PatchEngine();
const patch = await patcher.generatePatch(violation, ragContext);
// patch.confidence = 0.95
const result = await patcher.apply(patch);
// result.backupPath = '.codeguard/backups/file.ts.2026-01-29.bak'
```
### 4. LLM Router (`llm-router.ts`)
**Responsabilidades:**
- Roteamento inteligente por tipo de task
- Fallback automΓ‘tico se provider falhar
- Tracking de custos em tempo real
- Alertas de limite mensal
```typescript
const router = getLLMRouter();
const decision = router.route('scan');
// decision.provider = 'kimi'
// decision.estimatedCost = 0.0003
```
---
## π° Cost Strategy
| Task | Provider | Cost | Reason |
|------|----------|------|--------|
| **Scan** | Kimi K2.5 | $0.15/M in | 262k context window |
| **Patch** | GPT-4o-mini | $0.60/M out | 85% cheaper than Haiku |
| **Embeddings** | SiliconFlow | ~$0.01/M | 10x cheaper than OpenAI |
| **Explain** | Kimi K2.5 | $0.15/M in | Long context for analysis |
| **Fallback** | OpenRouter | +5.5% | Universal gateway |
**Monthly Projection (100k scans):** ~$284 (63% savings vs traditional)
> π **Full documentation:** [`docs/LLM_API_STRATEGY.md`](docs/LLM_API_STRATEGY.md)
---
## π Data Flow: Deep Compliance Audit
```
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β 1. User triggers "Run Deep Compliance Audit" β
βββββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββββββ
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β 2. RepoIntelligence.indexRepository() β
β β Scans all files, builds dependency graph β
β β Identifies sensitive files (auth, payment, PII) β
βββββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββββββ
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β 3. CodingMemory.initialize() β
β β Chunks files, generates embeddings via SiliconFlow β
β β Stores in cache + Supabase β
βββββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββββββ
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β 4. LLMRouter.route('scan') β
β β Selects Kimi K2.5 (262k context) β
β β Estimates cost: ~$0.0003/batch β
βββββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββββββ
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β 5. ComplianceOrchestrator.runAudit() β
β β Sends batches to LLM via AIClient β
β β Compares against GDPR/LGPD/OWASP rules β
β β Tracks tokens + cost in LLMRouter β
βββββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββββββ
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β 6. PatchEngine.generatePatch() [for fixable violations] β
β β Uses GPT-4o-mini (85% cheaper than Haiku) β
β β Creates backup before applying β
βββββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββββββ
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β 7. Report Dashboard β
β β Aggregates results, shows cost summary β
β β Exports to HTML/JSON/PDF β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
```
---
## π Credit System (Pay-Per-Use)
1. **Check**: `get_credits` RPC before AI operation
2. **Authorize**: Block if balance <= 0
3. **Execute**: AI operation via LLMRouter
4. **Deduct**: `use_credits` RPC (server-side authoritative)
---
## π Project Structure
```
src/
βββ core/ # LLM routing and config
β βββ llm-config.ts # Provider pricing + routing rules
β βββ llm-router.ts # Intelligent router with fallback
βββ intelligence/ # Antigravity core
β βββ ril.ts # Repo Intelligence Layer
β βββ memory.ts # Coding Memory (RAG)
β βββ patch.ts # Patch Engine
β βββ orchestrator.ts # Main orchestration
β βββ ai_client.ts # AI provider client
β βββ batcher.ts # Context batching
β βββ frameworks.ts # Compliance frameworks
βββ dashboard/ # Cost management (NEW)
β βββ cost-analytics.ts # Real-time metrics
βββ alerts/ # Budget monitoring (NEW)
β βββ budget-alerts.ts # Multi-channel notifications
βββ optimization/ # Smart routing (NEW)
β βββ smart-router.ts # Circuit breaker + failover
βββ scanner/ # Regex-based static analysis
β βββ lgpd.ts
β βββ gdpr.ts
β βββ pci.ts
β βββ owasp.ts
βββ credits/ # Credit system
βββ report/ # Report generation
βββ supabase/ # Database client
βββ ui/ # Webviews
βββ extension.ts # VS Code entry point
docs/
βββ LLM_API_STRATEGY.md # π LLM cost strategy (REFERENCE)
βββ openapi.yaml # API specification
starter/ # Platform starter templates
βββ templates/
β βββ lovable/
β βββ vercel/
β βββ bolt/
βββ sdk/ # Universal SDK
βββ packages/ # CLI tools
```
---
## π° Cost Management Stack
```
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β Cost Management System β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ€
β β
β βββββββββββββββββββ βββββββββββββββββββ βββββββββββββββββββ β
β β CostAnalytics β β BudgetAlerts β β SmartRouter β β
β β (Dashboard) β β (Notifications)β β (Failover) β β
β βββββββββββββββββββ€ βββββββββββββββββββ€ βββββββββββββββββββ€ β
β β β’ Real-time β β β’ 50/80/95% β β β’ Circuit β β
β β metrics β β thresholds β β breaker β β
β β β’ Per-provider β β β’ Slack/Email β β β’ Auto retry β β
β β breakdown β β β’ Spike detect β β β’ Priority β β
β β β’ Savings calc β β β’ Forecasting β β scoring β β
β β β’ CSV export β β β’ Cooldown β β β’ Health check β β
β βββββββββββββββββββ βββββββββββββββββββ βββββββββββββββββββ β
β β β
β βΌ β
β βββββββββββββββββββββββ β
β β LLMRouter β β
β β (llm-router.ts) β β
β βββββββββββββββββββββββ β
β β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
```
---
## π Environment Variables
```bash
# Required for LLM
KIMI_API_KEY=sk-xxx # Scan + Explain
OPENAI_API_KEY=sk-xxx # Patch
SILICONFLOW_API_KEY=sf-xxx # Embeddings
# Optional
OPENROUTER_API_KEY=sk-or-xxx # Fallback only
# Supabase
SUPABASE_URL=https://xxx.supabase.co
SUPABASE_KEY=eyJ...
```
---
## π Metrics
The system tracks:
- **Cost per operation**: Automatically via LLMRouter
- **Monthly spend**: With 80% limit alerts
- **Provider distribution**: For optimization
- **Token usage**: By task type
```typescript
const stats = orchestrator.getUsageStats();
// {
// monthlySpend: 284.50,
// byProvider: { kimi: 0.45, openai: 0.12, siliconflow: 0.05 },
// savings: 1.55 (vs OpenAI-only)
// }
```
---
**Author:** CodeGuard AI Team
**Version:** 2.0
**Last Updated:** January 2026