Devpipe MCP Server

ROADMAP.md•4.81 KiB

# devpipe-mcp Roadmap ## v0.2.2 (Current) ✅ - 28 resources (complete documentation + intelligence) - 18 tools (full devpipe lifecycle) - 6 prompts (common workflows) - Intelligence features: flakiness, performance regressions, change correlation - **NEW:** Pipeline health scoring ## v0.3.0 - Advanced Analytics ### High Priority #### 1. `compare_runs` Tool ✅ DONE (v0.2.2) **Purpose:** Compare two pipeline runs to identify changes **Use case:** ``` compare_runs --run1 latest --run2 previous Output: - New failures: go-lint, security-scan - Performance regressions: unit-tests (+50%) - Fixed tasks: integration-tests - Performance improvements: build (-20%) ``` **Implementation:** ✅ Complete - Diff two run.json files - Compare task results, durations, metrics - Identify regressions and improvements - Support "latest" and "previous" shortcuts **Effort:** Low (2-3 hours) - COMPLETED #### 2. `predict_impact` Tool ✅ DONE (v0.2.2) **Purpose:** Predict which tasks will fail based on changed files **Use case:** ``` predict_impact Output: - Critical risk: integration-tests (score: 85) - 3 changed files match watchPaths - High correlation with past failures - 40% recent failure rate - Recommendation: Run high-risk tasks first: integration-tests,security-scan - Suggested command: devpipe --only integration-tests,security-scan ``` **Implementation:** ✅ Complete - Multi-factor risk scoring (watchPaths, correlation, failure rate) - Risk levels: critical (70+), high (50+), medium (30+), low - Actionable recommendations with suggested commands - Uses existing intelligence data **Effort:** Medium (4-5 hours) - COMPLETED ### Medium Priority #### 3. Enhanced `diagnose_failure` ⭐⭐⭐ **Purpose:** Deep failure analysis with pattern matching **Note:** LLMs already do basic diagnosis. This would add: - Known error pattern database - Historical fix suggestions - Automated log parsing **Use case:** ``` diagnose_failure --task go-build Output: - Error type: missing_import - Affected file: main.go:42 - Similar past failures: 2 (both fixed by adding imports) - Suggested fix: Add import "github.com/myproject/database" ``` **Implementation:** - Build error pattern library - Match current errors to patterns - Query historical fixes **Effort:** High (8-10 hours) #### 4. `prioritize_tasks` Tool ⭐⭐⭐ **Purpose:** Optimize task execution order for fast feedback **Use case:** ``` prioritize_tasks Output: 1. go-fmt (30s, 95% failure detection) 2. go-vet (45s, 90% failure detection) 3. unit-tests (2m, 85% failure detection) ... 10. e2e-tests (10m, 60% failure detection) Recommendation: Run tasks 1-5 first (3.5m, 92% coverage) ``` **Implementation:** - Risk scoring algorithm - Time vs value optimization - Dependency-aware ordering **Effort:** High (6-8 hours) ### Low Priority #### 5. `get_intelligent_context` ⭐⭐ **Purpose:** Synthesize all data for debugging **Note:** LLMs already do this naturally. Only add if we want pre-computed summaries. **Effort:** Medium (3-4 hours) ## v0.4.0 - Future Enhancements ### devpipe v0.2.0 Compatibility ✅ DONE (v0.2.2) **Completed changes:** - ✅ `metricsFormat` → `outputType` - ✅ `metricsPath` → `outputPath` - ✅ `artifacts/` → `outputs/` (run folder structure) - ✅ `repoRoot` → `projectRoot` **MCP updates completed:** 1. ✅ Updated `configure-metrics` prompt 2. ✅ Updated type definitions (DevpipeTask) 3. ✅ Updated all documentation 4. ✅ Minimum version now v0.2.0 **Effort:** Low (1-2 hours) - COMPLETED ## Future Considerations ### Machine Learning Features - Failure prediction models - Optimal test selection - Anomaly detection **Blocker:** Need more data and ML infrastructure ### Real-time Monitoring - WebSocket support for live updates - Streaming task output - Progress notifications **Blocker:** MCP protocol limitations ### Team Analytics - Multi-user failure patterns - Team velocity metrics - Collaboration insights **Blocker:** Requires team/org context ## Implementation Strategy ### Phase 1: Quick Wins (v0.3.0) 1. ✅ `get_pipeline_health` (DONE in v0.2.2) 2. `compare_runs` (2-3 hours) 3. `predict_impact` (4-5 hours) **Total:** 1 week ### Phase 2: Advanced Features (v0.3.1) 4. Enhanced `diagnose_failure` (8-10 hours) 5. `prioritize_tasks` (6-8 hours) **Total:** 2 weeks ### Phase 3: Maintenance (v0.4.0) - Devpipe field rename support - Documentation updates - Bug fixes **Total:** Ongoing ## Decision Criteria **Implement if:** - ✅ Uses existing data (no new data sources) - ✅ Provides value LLMs can't replicate - ✅ Solves common pain points - ✅ Low maintenance burden **Defer if:** - ❌ Duplicates LLM capabilities - ❌ Requires ML infrastructure - ❌ Niche use case - ❌ High complexity ## Community Input Want a feature prioritized? Open an issue with: - Use case description - Expected output format - Why LLMs can't do it already

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/drewkhoury/devpipe-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

ROADMAP.md•4.81 KiB