de en es ja ko ru zh

Genkit MCP

Official

by firebase

Overview Schema Related Servers Score Discussions

Python

Hybrid

genkit
py
engdoc
parity-analysis

roadmap.md•36.6 KiB

# Python Genkit Parity Roadmap This document organizes the identified gaps into executable milestones with dependency relationships. --- ## Potential Gaps Not Yet Analyzed > [!NOTE] > These areas may require future analysis but are not yet covered in detail. | Gap | Description | Priority | |-----|-------------|----------| | **Testing Infrastructure** | JS has `echoModel`, `ProgrammableModel`, `TestAction` for unit testing. Python equivalents need verification. | Medium | | **CLI/Tooling Parity** | `genkit` CLI commands and their behavior with Python projects (especially DevUI integration) | Medium | | **Error Types** | JS has `GenkitError`, `ModelError`, `UserFacingError`. Python error hierarchy needs parity check. | Low | | **Context Caching** | Listed as missing plugin feature but no detailed implementation analysis | High | | **Auth/Security Patterns** | How auth context flows through actions, middleware patterns for auth | Medium | | **Performance Benchmarks** | Streaming latency, memory usage, concurrent request handling | Low | | **Migration Guide** | Documentation for teams moving from JS to Python | Low | | **Go Parity** | This analysis is JS↔Python only. Go is a third implementation with its own gaps. | Low | | **Imagen/Veo Details** | Image/video generation model support specifics | Medium | | **Live/Realtime API** | Google GenAI Live API for real-time streaming | High | ### Quick Notes on Key Gaps **Testing Infrastructure:** - JS: `import { echoModel } from '@genkit-ai/ai/testing'` - Python: Needs `genkit.testing` module with equivalent utilities **Context Caching:** - JS: `ai.cacheContent()`, `cachedContent` option in generate - Python: Missing entirely from google-genai plugin **Live/Realtime API:** - JS: `ai.generate({ model: 'googleai/gemini-live' })` with real-time streaming - Python: Not implemented --- ## Dependency Graph ```mermaid flowchart TD subgraph M0["M0: Foundation (Unblocks Everything)"] A1[DevUI config_schema fix] A2[Model spec compliance: latency_ms] A3[Model spec compliance: docs context] end subgraph M1["M1: Core APIs"] B1[checkOperation API] B2[run step tracing] B3[currentContext API] B4[dynamicTool API] end subgraph M2["M2: Stateful Conversations"] C1[Session Store Interface] C2[createSession/loadSession] C3[chat API] end subgraph M3["M3: Plugin Parity"] D1[Anthropic ThinkingConfig] D2[Anthropic tool_choice/metadata] D3[Google GenAI apiVersion/baseUrl] D4[plugin.model factory pattern] end subgraph M4["M4: Telemetry & Observability"] E1[RealtimeSpanProcessor] E2[flushTracing API] E3[AdjustingTraceExporter] E4[Logging instrumentation] end subgraph M5["M5: Advanced Features"] F1[defineBackgroundModel] F2[MCP Tool Host] F3[embedMany API] F4[defineSimpleRetriever] end subgraph M6["M6: Samples"] S1[Consolidated plugin demos] S2[Chatbot sample] S3[Multi-agent sample] S4[MCP sample] end subgraph M7["M7: Documentation 📝"] Doc1[Session/Chat docs] Doc2[Plugin config docs] Doc3[Telemetry docs] Doc4[MCP docs] Doc5[Sample docs] end %% Feature Dependencies A1 --> D4 A3 --> C3 B1 --> F1 B2 --> E1 C1 --> C2 C2 --> C3 D1 --> D2 E1 --> E2 E2 --> E3 F2 --> S4 C3 --> S2 C3 --> S3 %% Documentation Dependencies (trigger docs after features) C3 --> Doc1 D4 --> Doc2 E3 --> Doc3 F2 --> Doc4 S1 --> Doc5 ``` ### ASCII Dependency Graph ``` ┌─────────────────────────────────────────────────────────────────────────────┐ │ M0: FOUNDATION │ │ ┌──────────────────┐ ┌──────────────────┐ ┌──────────────────┐ │ │ │ A1: config_schema│ │ A2: latency_ms │ │ A3: docs context │ │ │ └────────┬─────────┘ └──────────────────┘ └────────┬─────────┘ │ └───────────┼──────────────────────────────────────────┼──────────────────────┘ │ │ ▼ │ ┌───────────────────────────────────────────┐ │ │ M3: PLUGINS │ │ │ ┌──────────────────┐ │ │ │ │ D4: model() │◄────────────────────┼──────────┘ │ │ factory │ │ │ │ └──────────────────┘ │ │ └───────────────────────────────────────────┘ │ │ ┌─────────────────────────────────────────────────────────────────────────────┐ │ M1: CORE APIs │ │ ┌──────────────────┐ ┌──────────────────┐ ┌──────────────────┐ │ │ │ B1: checkOp() │ │ B2: run() │ │ B3: context() │ │ │ └────────┬─────────┘ └────────┬─────────┘ └──────────────────┘ │ └───────────┼─────────────────────┼───────────────────────────────────────────┘ │ │ ▼ ▼ ┌───────────────────────┐ ┌───────────────────────┐ │ M5: ADVANCED │ │ M4: TELEMETRY │ │ ┌───────────────────┐ │ │ ┌───────────────────┐ │ │ │ F1: background │ │ │ │ E1: RealtimeSpan │ │ │ │ Model │ │ │ └─────────┬─────────┘ │ │ └─────────┬─────────┘ │ │ ▼ │ │ ▼ │ │ ┌───────────────────┐ │ │ ┌───────────────────┐ │ │ │ E2: flushTracing │ │ │ │ F2: MCP Host │ │ │ └─────────┬─────────┘ │ │ └───────────────────┘ │ │ ▼ │ └───────────────────────┘ │ ┌───────────────────┐ │ │ │ E3: Adjusting │ │ │ └───────────────────┘ │ └───────────────────────┘ ┌─────────────────────────────────────────────────────────────────────────────┐ │ M2: SESSIONS │ │ ┌──────────────────┐ ┌──────────────────┐ ┌──────────────────┐ │ │ │ C1: Store │──────►│ C2: create/load │──────►│ C3: chat() │ │ │ └──────────────────┘ └──────────────────┘ └──────────────────┘ │ │ ▲ │ │ │ │ │ A3: docs ──────────┘ │ └─────────────────────────────────────────────────────────────────────────────┘ ┌─────────────────────────────────────────────────────────────────────────────┐ │ M7: DOCUMENTATION 📝 (After Features) │ │ │ │ C3: chat() ────────► Doc1: Session/Chat docs │ │ D4: model() ───────► Doc2: Plugin config docs │ │ E3: Adjusting ─────► Doc3: Telemetry docs │ │ F2: MCP Host ──────► Doc4: MCP docs │ │ S1: Samples ───────► Doc5: Sample docs │ └─────────────────────────────────────────────────────────────────────────────┘ ``` --- ## Reverse Topological Execution Order > [!IMPORTANT] > Execute tasks starting from **leaves** (no outgoing dependencies) working backwards to **roots**. ### Phase 1: Independent Leaves (Start Here) *These tasks have NO dependencies - can all start in parallel* | ID | Task | Effort | Milestone | |----|------|--------|-----------| | A2 | latency_ms tracking | S | M0 | | B3 | currentContext() | S | M1 | | B4 | dynamicTool() | S | M1 | | C1 | Session Store Interface | M | M2 | | D1 | Anthropic ThinkingConfig | M | M3 | | D3 | Google GenAI apiVersion/baseUrl | S | M3 | | E4 | Logging instrumentation | S | M4 | | F3 | embedMany() | S | M5 | | F4 | defineSimpleRetriever() | S | M5 | | **S1** | **Consolidated plugin demo structure** | **M** | **M6** | | **S5** | **Multimodal input sample** | **S** | **M6** | ### Phase 2: First Dependencies *Unblocked after Phase 1 completes* | ID | Task | Depends On | Effort | |----|------|------------|--------| | A1 | DevUI config_schema | — | S | | B2 | run() step tracing | — | M | | C2 | createSession/loadSession | C1 | L | | D2 | Anthropic tool_choice/metadata | D1 | S | ### Phase 3: Second Dependencies *Unblocked after Phase 2 completes* | ID | Task | Depends On | Effort | |----|------|------------|--------| | A3 | docs context handling | — | M | | B1 | checkOperation() | — | M | | D4 | plugin.model() factory | A1 | M | | E1 | RealtimeSpanProcessor | B2 | M | | **S6** | **DevUI gallery sample** | **A1** | **M** | | **S7** | **Reranker sample** | **—** | **S** | | **S8** | **Eval pipeline sample** | **—** | **M** | ### Phase 4: Third Dependencies | ID | Task | Depends On | Effort | |----|------|------------|--------| | C3 | chat() API | C2, A3 | M | | E2 | flushTracing() | E1 | S | | F1 | defineBackgroundModel() | B1 | L | ### Phase 5: Final Tasks | ID | Task | Depends On | Effort | |----|------|------------|--------| | E3 | AdjustingTraceExporter | E2 | M | | F2 | MCP Tool Host | F1 | L | | **S2** | **Chatbot sample** | **C3** | **L** | | **S3** | **Multi-agent sample** | **C3** | **L** | | **S4** | **MCP integration sample** | **F2** | **M** | ```mermaid flowchart BT subgraph Phase1["Phase 1: Leaves"] A2[latency_ms] B3[currentContext] B4[dynamicTool] C1[Session Store] D1[ThinkingConfig] D3[apiVersion] E4[Logging] F3[embedMany] F4[simpleRetriever] end subgraph Phase2["Phase 2"] A1[config_schema] B2[run tracing] C2[create/loadSession] D2[tool_choice] end subgraph Phase3["Phase 3"] A3[docs context] B1[checkOperation] D4[model factory] E1[RealtimeSpan] end subgraph Phase4["Phase 4"] C3[chat API] E2[flushTracing] F1[backgroundModel] end subgraph Phase5["Phase 5"] E3[Adjusting] F2[MCP Host] end C1 --> C2 D1 --> D2 A1 --> D4 B2 --> E1 C2 --> C3 A3 --> C3 E1 --> E2 B1 --> F1 E2 --> E3 F1 --> F2 ``` ### ASCII Execution Order ``` PHASE 1 (Leaves - Start Here) ═══════════════════════════════════════════════════════════════════════════════ │ A2: latency_ms │ B3: context │ B4: dynamicTool │ C1: Store │ D1: Thinking │ │ D3: apiVersion │ E4: Logging │ F3: embedMany │ F4: simpleRetriever │ ═══════════════════════════════════════════════════════════════════════════════ │ │ │ │ │ ▼ ▼ ▼ ▼ ▼ PHASE 2 ═══════════════════════════════════════════════════════════════════════════════ │ A1: config_schema │ B2: run() tracing │ C2: create/load │ D2: tool_choice │ ═══════════════════════════════════════════════════════════════════════════════ │ │ │ ▼ ▼ ▼ PHASE 3 ═══════════════════════════════════════════════════════════════════════════════ │ A3: docs context │ B1: checkOp() │ D4: model() factory │ E1: RealtimeSpan │ ═══════════════════════════════════════════════════════════════════════════════ │ │ │ ▼ ▼ ▼ PHASE 4 ═══════════════════════════════════════════════════════════════════════════════ │ C3: chat() API │ F1: backgroundModel │ E2: flushTracing │ ═══════════════════════════════════════════════════════════════════════════════ │ │ ▼ ▼ PHASE 5 (Roots - End Here) ═══════════════════════════════════════════════════════════════════════════════ │ F2: MCP Host │ E3: AdjustingExporter │ ═══════════════════════════════════════════════════════════════════════════════ ``` --- ## Milestone Breakdown ### M0: Foundation (Week 1-2) > **Goal:** Fix core issues that block DevUI and model spec compliance | Task | Effort | Unblocks | Files | |------|--------|----------|-------| | **A1: DevUI config_schema fix** | S | Plugin model() factories | `gemini.py`, `models.py` | | **A2: latency_ms tracking** | S | Monitoring dashboards | All model plugins | | **A3: docs context handling** | M | RAG + Chat API | `generate.py` | **Definition of Done:** - [ ] Model config shows in DevUI - [ ] latency_ms populated in GenerateResponse - [ ] `docs` field augments message history --- ### M1: Core APIs (Week 2-3) > **Goal:** Add missing core Genkit API methods | Task | Effort | Unblocks | Files | |------|--------|----------|-------| | **B1: checkOperation()** | M | Background models, Veo | `_aio.py` | | **B2: run() step tracing** | M | Better flow debugging | `_registry.py` | | **B3: currentContext()** | S | Auth in tools/flows | `_registry.py` | | **B4: dynamicTool()** | S | Runtime tool creation | `_registry.py` | **Definition of Done:** - [ ] `await ai.check_operation(op)` returns updated Operation - [ ] `await ai.run('step', fn)` creates traced sub-span - [ ] `ai.current_context()` returns ActionContext - [ ] `ai.dynamic_tool(config, fn)` returns unregistered ToolAction --- ### M2: Stateful Conversations (Week 3-5) > **Goal:** Enable multi-turn conversations with history persistence ```mermaid flowchart LR A[Session Store Interface] --> B[createSession] B --> C[loadSession] C --> D[chat API] D --> E[Multi-turn UX] ``` | Task | Effort | Dependencies | Files | |------|--------|--------------|-------| | **C1: Session Store Interface** | M | None | NEW: `session/store.py` | | **C2: createSession/loadSession** | L | C1 | NEW: `session/session.py` | | **C3: chat() API** | M | C2, A3 | `_aio.py` | **Definition of Done:** - [ ] `SessionStore` abstract base class with `get/save/delete` - [ ] `session = await ai.create_session()` / `ai.load_session(id)` - [ ] `response = await session.chat('message')` maintains history - [ ] At least one store implementation (in-memory) --- ### M3: Plugin Parity (Week 4-6) > **Goal:** Match JS plugin config schemas and APIs | Task | Effort | Plugin | Files | |------|--------|--------|-------| | **D1: Anthropic ThinkingConfig** | M | anthropic | `models.py`, `plugin.py` | | **D2: Anthropic tool_choice/metadata** | S | anthropic | `models.py` | | **D3: Google GenAI apiVersion/baseUrl** | S | google-genai | `google.py` | | **D4: plugin.model() factory** | M | All | All plugin `__init__.py` | **Definition of Done:** - [ ] `config={'thinking': {'enabled': True, 'budgetTokens': 10000}}` works - [ ] `tool_choice={'type': 'tool', 'name': 'myTool'}` supported - [ ] `GoogleGenAI(api_version='v1beta')` accepted - [ ] `google_ai.model('gemini-2.5-flash')` returns typed reference --- ### M4: Telemetry & Observability (Week 5-7) > **Goal:** Match JS realtime tracing and observability features | Task | Effort | Impact | Files | |------|--------|--------|-------| | **E1: RealtimeSpanProcessor** | M | Live DevUI tracing | NEW: `realtime_processor.py` | | **E2: flushTracing() API** | S | Clean shutdown | `tracing.py` | | **E3: AdjustingTraceExporter** | M | PII redaction | `google_cloud/telemetry/` | | **E4: Logging instrumentation** | S | Log correlation | `google_cloud/telemetry/` | **Definition of Done:** - [ ] Spans appear in DevUI as they START (not just on completion) - [ ] `GENKIT_ENABLE_REALTIME_TELEMETRY=true` env var supported - [ ] `await ai.flush_tracing()` available - [ ] Model I/O redacted before Cloud Trace export --- ### M5: Advanced Features (Week 7+) > **Goal:** Complete feature parity for advanced use cases | Task | Effort | Use Case | Files | |------|--------|----------|-------| | **F1: defineBackgroundModel()** | L | Veo, Imagen | `_registry.py`, block | | **F2: MCP Tool Host** | L | External tools | NEW: `mcp/host.py` | | **F3: embedMany()** | S | Batch embedding | `_aio.py` | | **F4: defineSimpleRetriever()** | S | Quick RAG setup | `_registry.py` | --- ## Timeline Overview ```mermaid gantt title Python Genkit Parity Roadmap dateFormat YYYY-MM-DD section M0 Foundation DevUI config_schema :a1, 2025-01-27, 3d latency_ms tracking :a2, after a1, 2d docs context handling :a3, after a1, 4d section M1 Core APIs checkOperation :b1, 2025-02-03, 4d run step tracing :b2, after b1, 3d currentContext :b3, after b1, 2d dynamicTool :b4, after b3, 2d section M2 Sessions Session Store :c1, 2025-02-10, 4d create/loadSession :c2, after c1, 5d chat API :c3, after c2, 4d section M3 Plugins Anthropic ThinkingConfig :d1, 2025-02-17, 3d Anthropic extras :d2, after d1, 2d Google GenAI options :d3, 2025-02-17, 2d plugin.model factory :d4, after d3, 4d section M4 Telemetry RealtimeSpanProcessor :e1, 2025-02-24, 4d flushTracing :e2, after e1, 2d AdjustingExporter :e3, after e2, 3d section M5 Advanced defineBackgroundModel :f1, 2025-03-03, 5d MCP Tool Host :f2, after f1, 7d ``` --- ## Effort Legend | Size | Days | Description | |------|------|-------------| | **S** | 1-2 | Simple addition, clear pattern | | **M** | 3-5 | Moderate complexity, some design | | **L** | 5-10 | Large feature, new subsystem | --- ## Quick Wins (Can Start Immediately) These have no dependencies and provide immediate value: 1. **A1: DevUI config_schema** - Uncomment and fix existing code 2. **A2: latency_ms** - Add timing to model wrappers 3. **B3: currentContext()** - Thread-local context access 4. **D3: apiVersion/baseUrl** - Add to plugin options 5. **E2: flushTracing()** - Simple exporter flush --- ## Files Reference | Area | Key Files | |------|-----------| | Core APIs | `py/packages/genkit/src/genkit/ai/_aio.py`, `_registry.py` | | Sessions | NEW: `py/packages/genkit/src/genkit/session/` | | Google GenAI | `py/plugins/google-genai/src/.../models/gemini.py` | | Anthropic | `py/plugins/anthropic/src/.../models.py` | | Telemetry | `py/packages/genkit/src/genkit/core/tracing.py` | | GCP Plugin | `py/plugins/google-cloud/src/.../telemetry/` | --- ## M6: Sample Parity > **Goal:** Match JS sample coverage and consolidate plugin demos See [sample_parity_analysis.md](sample_parity_analysis.md) for full analysis. ### Sample Tasks | ID | Task | Effort | Depends On | Phase | |----|------|--------|------------|-------| | S1 | Consolidated plugin demo structure | M | — | 1 | | S2 | Chatbot sample (like `js-chatbot`) | L | C3 (chat API) | 5 | | S3 | Multi-agent sample (like `js-schoolAgent`) | L | C3 (chat API) | 5 | | S4 | MCP integration sample | M | F2 (MCP Host) | 5 | | S5 | Multimodal input sample | S | — | 1 | | S6 | DevUI gallery sample | M | A1 (config_schema) | 3 | | S7 | Reranker sample | S | Plugin parity | 3 | | S8 | Full eval pipeline sample | M | — | 3 | ### Consolidated Plugin Demo Structure Each plugin should demonstrate the same core features: ``` py/samples/plugin-demos/{plugin}/ ├── 01_basic_generate.py # Simple text generation ├── 02_streaming.py # Streaming response ├── 03_structured_output.py # JSON schema output ├── 04_tool_calling.py # Tool/function calling ├── 05_multimodal.py # Image/audio input (if supported) ├── 06_multi_turn.py # Conversation history ├── 07_system_prompt.py # System instructions ├── 08_middleware.py # Request/response middleware ├── prompts/demo.prompt # Dotprompt example └── main.py # Entry point ``` ### Sample Dependency Graph ```mermaid flowchart TD subgraph Samples["M6: Samples"] S1[Consolidated Structure] S2[Chatbot Sample] S3[Multi-Agent Sample] S4[MCP Sample] S5[Multimodal Sample] S6[DevUI Gallery] S7[Reranker Sample] S8[Eval Pipeline] end C3[chat API] --> S2 C3 --> S3 F2[MCP Host] --> S4 A1[config_schema] --> S6 ``` ### ASCII Sample Dependencies ``` ┌─────────────────────────────────────────────────────────────────────────────┐ │ M6: SAMPLES │ │ │ │ Phase 1 (Independent): │ │ ┌───────────────────┐ ┌───────────────────┐ │ │ │ S1: Consolidated │ │ S5: Multimodal │ │ │ │ Structure │ │ Sample │ │ │ └───────────────────┘ └───────────────────┘ │ │ │ │ Phase 3 (After Foundation): │ │ ┌───────────────────┐ ┌───────────────────┐ ┌───────────────────┐ │ │ │ S6: DevUI Gallery │ │ S7: Reranker │ │ S8: Eval Pipeline │ │ │ └─────────▲─────────┘ └───────────────────┘ └───────────────────┘ │ │ │ │ │ │ (depends on A1) │ │ │ │ Phase 5 (After Chat/MCP): │ │ ┌───────────────────┐ ┌───────────────────┐ ┌───────────────────┐ │ │ │ S2: Chatbot │ │ S3: Multi-Agent │ │ S4: MCP Sample │ │ │ └─────────▲─────────┘ └─────────▲─────────┘ └─────────▲─────────┘ │ │ │ │ │ │ │ └──────────────────────┼──────────────────────┘ │ │ │ │ │ C3: chat() API │ │ │ │ │ F2: MCP Tool Host │ └─────────────────────────────────────────────────────────────────────────────┘ ``` --- ## M7: Documentation (Docsite Updates) > **Goal:** Keep [genkit-ai/docsite](https://github.com/genkit-ai/docsite) updated with Python feature parity > [!WARNING] > After completing each milestone, update the docsite to reflect Python support. ### Documentation Tasks | ID | Task | After Milestone | Priority | |----|------|-----------------|----------| | D1 | Update Session/Chat docs for Python | M2 | P0 | | D2 | Add Python examples to all feature docs | M0-M5 | P1 | | D3 | Document Python plugin config options | M3 | P1 | | D4 | Add Python telemetry setup guide | M4 | P2 | | D5 | Document MCP Python support | M5 (F2) | P2 | | D6 | Add Python sample links | M6 | P2 | | D7 | Python API reference (if applicable) | M1 | P3 | ### Docsite Files Analysis **Repository:** [genkit-ai/docsite](https://github.com/genkit-ai/docsite) **Docs path:** `src/content/docs/docs/` #### Core Feature Docs (Need Python Examples) | File | Size | Python Status | Action | |------|------|---------------|--------| | `chat.mdx` | 18KB | ❌ **JS/Go only** (`supportedLanguages="js go"`) | Add Python after M2 | | `models.mdx` | 78KB | Partial | Add Python config examples | | `flows.mdx` | 39KB | Partial | Verify Python examples current | | `dotprompt.mdx` | 40KB | Partial | Verify Python examples | | `tool-calling.mdx` | 32KB | Partial | Add tool config examples | | `rag.mdx` | 30KB | Partial | Add Python retriever examples | | `evaluation.mdx` | 44KB | Partial | Add Python evaluator examples | | `interrupts.mdx` | 26KB | ? | Check if Python supported | | `agentic-patterns.mdx` | 31KB | ? | Add Python multi-agent examples | | `multi-agent.mdx` | 6KB | ? | Add after M2/Sessions | | `mcp-server.mdx` | 12KB | ? | Add after MCP support | | `model-context-protocol.mdx` | 20KB | ❌ **JS only** | Add after F2 MCP Host | | `context.mdx` | 7KB | ? | Add `currentContext()` after B3 | | `durable-streaming.mdx` | 12KB | ? | Check if applicable to Python | #### Integration/Plugin Docs (Need Python) | File | Python Plugin | Action | |------|---------------|--------| | `integrations/google-genai.mdx` | ✅ Exists | Add config_schema examples | | `integrations/vertex-ai.mdx` | ✅ (in google-genai) | Update with Python | | `integrations/anthropic.mdx` | ✅ Exists | Add ThinkingConfig after D1 | | `integrations/ollama.mdx` | ✅ Exists | Verify examples | | `integrations/openai-compatible.mdx` | ✅ compat-oai | Verify examples | | `integrations/deepseek.mdx` | ✅ Exists | Add/verify Python | | `integrations/xai.mdx` | ✅ Exists | Add/verify Python | | `integrations/google-cloud.mdx` | ✅ Exists | Add telemetry examples after M4 | | `integrations/dev-local-vectorstore.mdx` | ✅ Exists | Add Python examples | | `integrations/cloud-firestore.mdx` | ✅ Exists | Add Python retriever examples | | `integrations/vectorsearch-firestore.mdx` | ✅ Exists | Add Python examples | | `integrations/vectorsearch-bigquery.mdx` | ✅ Exists | Add Python examples | | `integrations/chroma.mdx` | ❌ Missing plugin | Skip until plugin exists | | `integrations/pinecone.mdx` | ❌ Missing plugin | Skip until plugin exists | | `integrations/pgvector.mdx` | ❌ Missing plugin | Skip until plugin exists | #### Detailed Doc Tasks (by Dependency) | Task ID | Docsite File | Change Required | Depends On | |---------|--------------|-----------------|------------| | **Doc-01** | `chat.mdx` | Add `supportedLanguages="js go python"`, add Python tab content | C3 chat API | | **Doc-02** | `chat.mdx` | Python Session/SessionStore examples | C1, C2 | | **Doc-03** | `model-context-protocol.mdx` | Add Python MCP client examples | F2 MCP Host | | **Doc-04** | `mcp-server.mdx` | Add Python MCP server examples | Exists | | **Doc-05** | `integrations/anthropic.mdx` | Add ThinkingConfig Python example | D1 | | **Doc-06** | `integrations/google-genai.mdx` | Add apiVersion/baseUrl examples | D3 | | **Doc-07** | `integrations/google-cloud.mdx` | Add Python telemetry examples | E3 | | **Doc-08** | `context.mdx` | Add `ai.current_context()` Python examples | B3 | | **Doc-09** | `multi-agent.mdx` | Add Python multi-agent examples | C3, S3 | | **Doc-10** | `evaluation.mdx` | Update Python evaluator examples | Exists | | **Doc-11** | `rag.mdx` | Add `define_simple_retriever()` examples | F4 | | **Doc-12** | `models.mdx` | Add `define_background_model()` examples | F1 | ### Docsite Language Component The docsite uses `<LanguageSelector>` and `<LanguageContent>` components: ```mdx <LanguageSelector supportedLanguages="js go python" /> <LanguageContent lang="python">  </LanguageContent> ``` **Key change:** Files with `supportedLanguages="js go"` need `python` added. ### Post-Milestone Checklist ``` After completing each milestone: 1. ✅ Merge code to main 2. ✅ Update CHANGELOG 3. 📝 Identify affected docsite files from table above 4. 📝 Fork genkit-ai/docsite 5. 📝 Add `python` to LanguageSelector 6. 📝 Add <LanguageContent lang="python"> sections 7. 📝 Open PR on genkit-ai/docsite 8. 📝 Update any "JavaScript only" / "JS/Go only" warnings ``` --- ## M8: Automated Testing (Future - Low Priority) > **Goal:** Automate sample validation and DevUI E2E testing > [!NOTE] > This milestone is intentionally last. Complete all feature work first. ### Overview Use Playwright (Python) to automate: 1. DevUI E2E tests - verify flows work through the UI 2. Sample validation - run each sample and verify output 3. Regression testing - catch breaking changes ### Testing Tasks | ID | Task | Effort | Description | |----|------|--------|-------------| | T1 | Playwright test infrastructure | M | Set up pytest-playwright, fixtures | | T2 | DevUI flow runner tests | M | Test running flows through DevUI | | T3 | DevUI model config tests | S | Verify config_schema appears in UI | | T4 | Sample smoke tests | L | Run each sample, verify no errors | | T5 | CI integration | M | Add to GitHub Actions workflow | ### Example Test Structure ```python # tests/e2e/test_devui.py import pytest from playwright.async_api import async_playwright @pytest.fixture async def devui_page(): async with async_playwright() as p: browser = await p.chromium.launch() page = await browser.new_page() await page.goto("http://localhost:4000") yield page await browser.close() async def test_flow_list_loads(devui_page): """Verify flow list appears in DevUI.""" await devui_page.wait_for_selector('[data-testid="flow-list"]') flows = await devui_page.query_selector_all('[data-testid="flow-item"]') assert len(flows) > 0 async def test_run_menu_flow(devui_page): """Run menuSuggestionFlow and verify output.""" await devui_page.click('text=menuSuggestionFlow') await devui_page.fill('[data-testid="input"]', '{"theme": "Italian"}') await devui_page.click('[data-testid="run-button"]') output = await devui_page.wait_for_selector('[data-testid="output"]') text = await output.text_content() assert len(text) > 0 async def test_model_config_visible(devui_page): """Verify model config schema appears.""" await devui_page.click('[data-testid="models-tab"]') await devui_page.click('text=gemini-2.0-flash') config = await devui_page.wait_for_selector('[data-testid="config-schema"]') assert "temperature" in await config.text_content() ``` ### CI Workflow Addition ```yaml # .github/workflows/python-e2e.yml name: Python E2E Tests on: push: paths: ['py/**'] jobs: e2e: runs-on: ubuntu-latest steps: - uses: actions/checkout@v4 - uses: actions/setup-python@v5 with: python-version: '3.12' - name: Install dependencies run: | pip install playwright pytest-playwright playwright install chromium - name: Start sample server run: | cd py/samples/menu uv run genkit start & sleep 10 - name: Run E2E tests run: pytest tests/e2e/ -v ``` ### Dependencies ```mermaid flowchart LR All[All M0-M7 Complete] --> T1[Test Infrastructure] T1 --> T2[DevUI Tests] T1 --> T4[Sample Tests] T2 --> T5[CI Integration] T4 --> T5 ```

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/firebase/genkit'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

roadmap.md•36.6 KiB