de en es ja ko ru zh

Genkit MCP

Official

by firebase

Overview Schema Related Servers Score Discussions

TypeScript

Hybrid

genkit
py
engdoc
parity-analysis

roadmap.md•51.7 KiB

# Python Genkit Parity Roadmap This document organizes the identified gaps into executable milestones with dependency relationships. --- ## Current Status (Updated 2026-01-30) > [!IMPORTANT] > **Overall Parity: ~99% Complete** - Nearly all milestones done! ### Completed Milestones ✅ | Milestone | Status | Notes | |-----------|--------|-------| | **M0: Foundation** | ✅ Complete | DevUI config, latency_ms, docs context | | **M1: Core APIs** | ✅ Complete | check_operation, run, current_context, dynamic_tool | | **M2: Sessions** | ✅ Complete | SessionStore, create/load_session, chat API | | **M3: Plugin Parity** | ✅ Complete | Anthropic ThinkingConfig, Google apiVersion/baseUrl | | **M4: Telemetry** | ✅ Complete | RealtimeSpanProcessor, flushTracing, AdjustingTraceExporter, GCP parity | | **M5: Advanced** | ✅ Complete | embed_many ✅, define_simple_retriever ✅, define_background_model ✅ | | **M6: Media Models** | ✅ Complete | Veo, Lyria, TTS, Gemini Image models | ### Remaining Work | Priority | Task | Effort | Status | |----------|------|--------|--------| | **P0** | Testing Infrastructure (`genkit.testing`) | S | ✅ Complete | | **P0** | Context Caching (google-genai) | M | ✅ Complete | | **P1** | `define_background_model()` | M | ✅ Complete | | **P1** | Veo support in google-genai plugin | M | ✅ Complete | | **P1** | TTS (Text-to-Speech) models | S | ✅ Complete | | **P1** | Gemini Image models | S | ✅ Complete | | **P1** | Lyria audio generation (Vertex AI) | S | ✅ Complete | | **P1** | Live/Realtime API | L | ❌ Not Started | | **P2** | Multi-agent sample | M | ❌ Not Started | | **P2** | MCP sample | M | ❌ Not Started | --- ## Remaining Gaps (Prioritized) > [!NOTE] > Most original gaps have been addressed. These are the remaining items. | Gap | Description | Priority | Status | |-----|-------------|----------|--------| | **Testing Infrastructure** | JS has `echoModel`, `ProgrammableModel`, `TestAction` for unit testing. | **P0** | ✅ Complete | | **Context Caching** | `ai.cacheContent()`, `cachedContent` option in generate | **P0** | ✅ Complete | | **define_background_model** | Core API for background models (Veo, etc.) | **P1** | ✅ Complete | | **Veo plugin support** | Add `veo.py` to google-genai plugin (JS has `veo.ts`) | **P1** | ✅ Complete | | **TTS models** | Text-to-speech Gemini models (gemini-*-tts) | **P1** | ✅ Complete | | **Gemini Image models** | Native image generation (gemini-*-image) | **P1** | ✅ Complete | | **Lyria audio generation** | Audio generation via Vertex AI (lyria-002) | **P1** | ✅ Complete | | **Live/Realtime API** | Google GenAI Live API for real-time streaming | **P1** | ❌ Not Started | | **CLI/Tooling Parity** | `genkit` CLI commands and Python project behavior | Medium | ⚠️ Mostly Working | | **Error Types** | Python error hierarchy parity check | Low | ⚠️ Needs Review | | **Auth/Security Patterns** | Auth context flow through actions | Medium | ⚠️ Needs Review | | **Performance Benchmarks** | Streaming latency, memory usage | Low | ❌ Not Started | | **Migration Guide** | Documentation for JS to Python migration | Low | ❌ Not Started | --- ## M8: Additional Model Provider Plugins (2026-02-01) > [!NOTE] > New model provider plugins to expand Genkit's ecosystem. ### Completed Plugins | Plugin | Description | Status | Notes | |--------|-------------|--------|-------| | **Mistral AI** | Native Mistral API integration | ✅ Complete | mistral-large, mistral-small, codestral, pixtral | | **Hugging Face** | HF Inference API integration | ✅ Complete | Access to 1M+ models, inference providers | ### Plugin Details #### Mistral AI Plugin (`genkit-plugin-mistral`) - **Environment Variable**: `MISTRAL_API_KEY` - **Supported Models**: mistral-large-latest, mistral-small-latest, codestral-latest, pixtral-large-latest, ministral-8b/3b - **Features**: Chat completion, streaming, code generation - **Sample**: `samples/mistral-hello` #### Hugging Face Plugin (`genkit-plugin-huggingface`) - **Environment Variable**: `HF_TOKEN` - **Supported Models**: Any model on huggingface.co (1M+ models) - **Popular Models**: meta-llama/Llama-3.3-70B-Instruct, Qwen/Qwen2.5-72B-Instruct, google/gemma-2-27b-it - **Features**: Chat completion, streaming, inference providers (Cerebras, Groq, Together) - **Sample**: `samples/huggingface-hello` ### Future Plugins (Planned) | Plugin | Priority | Rationale | |--------|----------|-----------| | **OpenRouter** | P3 | Multi-provider routing, already usable via `compat-oai` | | **Cohere** | P3 | Enterprise NLP, RAG-focused models | | **AI21** | P3 | Specialized language models | --- ## Background Actions/Operations Audit (2026-01-30) > [!NOTE] > Detailed comparison between JS (canonical) and Python implementations. ### Function Parity | Function | JS Location | Python Location | Status | |----------|-------------|-----------------|--------| | `checkOperation()` | `js/ai/src/check-operation.ts` | `py/.../blocks/background_model.py` | ✅ Complete | | `lookupBackgroundAction()` | `js/core/src/background-action.ts` | `py/.../blocks/background_model.py` | ✅ Complete | | `backgroundAction()` | `js/core/src/background-action.ts` | N/A (uses `define_background_model`) | ✅ Equivalent | | `defineBackgroundAction()` | `js/core/src/background-action.ts` | `define_background_model()` | ✅ Complete | | `registerBackgroundAction()` | `js/core/src/background-action.ts` | Internal in `define_background_model` | ✅ Complete | ### Type/Interface Parity | Type | JS Definition | Python Definition | Status | |------|---------------|-------------------|--------| | `Operation<T>` | `action?, id, done?, output?, error?, metadata?` | Same fields via Pydantic | ✅ Complete | | `BackgroundAction<I,O>` | `startAction, checkAction, cancelAction?, supportsCancel` | Same structure | ✅ Complete | | `BackgroundActionRunOptions` | `context?, telemetryLabels?` | Implicit via ActionRunContext | ✅ Complete | | `BackgroundActionParams` | Full config object | Split into function parameters | ✅ Complete | ### Error Message Parity | Scenario | JS Error | Python Error | Status | |----------|----------|--------------|--------| | Missing action field | `"Provided operation is missing original request information"` | Same | ✅ Exact match | | Action not found | `"Failed to resolve background action from original request: ${action}"` | Same | ✅ Exact match | | Cancel not supported | `"${name} does not support cancellation."` | Returns operation unchanged (JS-compatible) | ✅ Complete | ### Action Key Format Parity | Action Type | JS Key Format | Python Key Format | Status | |-------------|---------------|-------------------|--------| | Start | `/{actionType}/{name}` | Same | ✅ Match | | Check | `/check-operation/{name}/check` | Same | ✅ Match | | Cancel | `/cancel-operation/{name}/cancel` | Same | ✅ Match | ### Flow Comparison **JS `checkOperation` flow:** 1. Validate `operation.action` exists 2. Call `registry.lookupBackgroundAction(key)` 3. Throw if not found 4. Call `backgroundAction.check(operation)` 5. Return updated operation **Python `check_operation` flow:** 1. Validate `operation.action` exists ✅ 2. Call `lookup_background_action(registry, key)` ✅ 3. Raise if not found ✅ 4. Call `background_action.check(operation)` ✅ 5. Return updated operation ✅ ### Registry Integration | Feature | JS | Python | Status | |---------|-----|--------|--------| | Actions registered separately (start/check/cancel) | ✅ | ✅ | ✅ Match | | Lookup by action key | ✅ | ✅ | ✅ Match | | `supportsCancel` property | ✅ | ✅ | ✅ Match | | Action metadata preserved | ✅ | ✅ | ✅ Match | ### Media Models Implementation | Model | JS Support | Python Support | Status | |-------|------------|----------------|--------| | Veo (video) | `veo.ts` | `veo.py` | ✅ Complete | | Lyria (audio) | `lyria.ts` | `lyria.py` | ✅ Complete | | TTS (speech) | In `gemini.ts` | In `gemini.py` | ✅ Complete | | Gemini Image | In `gemini.ts` | In `gemini.py` | ✅ Complete | | Imagen | `imagen.ts` | `imagen.py` | ✅ Complete | ### Sample Parity | Sample | JS | Python | Status | |--------|-----|--------|--------| | Media models demo | Various | `media-models-demo/` | ✅ Complete | | Background model example | In plugin samples | Integrated in media demo | ✅ Complete | ### Remaining Minor Gaps | Gap | Priority | Status | |-----|----------|--------| | `plugin.model()` factory pattern | Low | ⚠️ Different pattern (direct import) | | Zod schema validation | N/A | Pydantic equivalent | ✅ Complete | --- ### Phase 1 Tasks ✅ COMPLETE (2026-01-26) **1. Testing Infrastructure (`genkit.testing` module)** ✅ - Location: `py/packages/genkit/src/genkit/testing.py` - Implemented: - `EchoModel` / `define_echo_model()` - Returns input as output for testing - `ProgrammableModel` / `define_programmable_model()` - Configurable responses - `StaticResponseModel` / `define_static_response_model()` - Fixed responses - Streaming support with countdown ("3", "2", "1") - Request tracking (`last_request`, `request_count`) **2. Context Caching (google-genai plugin)** ✅ - Location: `py/plugins/google-genai/src/.../models/context_caching/` - Already implemented: - `_retrieve_cached_content()` - Cache lookup/creation - `_build_messages()` - Extracts cache config from message metadata - Model integration with `cached_content` option - Supported models: gemini-1.5-flash, gemini-1.5-pro, gemini-2.0-flash, etc. **3. Background Models (`define_background_model`)** ✅ (2026-01-27) - Location: `py/packages/genkit/src/genkit/blocks/background_model.py` - Implemented: - `Operation` - Typed operation tracking with start/check/cancel - `BackgroundAction` - Wrapper for background model actions - `define_background_model()` - Registers background model with registry - `lookup_background_action()` - Find registered background models - `ai.check_operation()` - Check operation status - `ai.cancel_operation()` - Cancel in-progress operations - Sample: `py/samples/media-models-demo/` (comprehensive demo of all media models) - Use cases: Video generation (Veo), Image generation (Imagen) **4. Media Generation Models (google-genai plugin)** ✅ (2026-01-27) Implemented full parity with JS for all media generation models: | Model Type | Models | Location | Config Schema | |------------|--------|----------|---------------| | **Veo (Video)** | veo-2.0, veo-3.0, veo-3.0-fast, veo-3.1 | `models/veo.py` | `VeoConfig` | | **Lyria (Audio)** | lyria-002 | `models/lyria.py` | `LyriaConfig` | | **TTS (Speech)** | gemini-*-tts | `models/gemini.py` | `GeminiTtsConfigSchema` | | **Gemini Image** | gemini-*-image | `models/gemini.py` | `GeminiImageConfigSchema` | **Sample:** `py/samples/media-models-demo/` - Comprehensive demo with testing instructions. Helper functions added: - `is_veo_model()` - Detect Veo video models - `is_lyria_model()` - Detect Lyria audio models - `is_tts_model()` - Detect TTS speech models - `is_image_model()` - Detect Gemini image models Example usage: ```python from genkit import Genkit from genkit.plugins.google_genai import GoogleAI, VeoVersion ai = Genkit(plugins=[GoogleAI()]) # TTS (text-to-speech) response = await ai.generate( model='googleai/gemini-2.5-flash-preview-tts', prompt='Hello, welcome to Genkit!', config={'speech_config': {'voice_config': {'prebuilt_voice_config': {'voice_name': 'Kore'}}}} ) # Gemini Image generation response = await ai.generate( model='googleai/gemini-2.5-flash-image', prompt='A serene mountain landscape at sunset', ) # Veo (video) - uses background model pattern veo = ai.lookup_background_model(f'googleai/{VeoVersion.VEO_2_0}') operation = await veo.start(request) while not operation.done: operation = await veo.check(operation) # Lyria (audio) - Vertex AI only response = await ai.generate( model='vertexai/lyria-002', prompt='A peaceful piano melody', ) ``` --- ## Dependency Graph ```mermaid flowchart TD subgraph M0["M0: Foundation (Unblocks Everything)"] A1[DevUI config_schema fix] A2[Model spec compliance: latency_ms] A3[Model spec compliance: docs context] end subgraph M1["M1: Core APIs"] B1[checkOperation API] B2[run step tracing] B3[currentContext API] B4[dynamicTool API] end subgraph M2["M2: Stateful Conversations"] C1[Session Store Interface] C2[createSession/loadSession] C3[chat API] end subgraph M3["M3: Plugin Parity"] D1[Anthropic ThinkingConfig] D2[Anthropic tool_choice/metadata] D3[Google GenAI apiVersion/baseUrl] D4[plugin.model factory pattern] end subgraph M4["M4: Telemetry & Observability"] E1[RealtimeSpanProcessor] E2[flushTracing API] E3[AdjustingTraceExporter] E4[Logging instrumentation] end subgraph M5["M5: Advanced Features"] F1[defineBackgroundModel] F2[MCP Tool Host] F3[embedMany API] F4[defineSimpleRetriever] end subgraph M6["M6: Samples"] S1[Consolidated plugin demos] S2[Chatbot sample] S3[Multi-agent sample] S4[MCP sample] end subgraph M7["M7: Documentation 📝"] Doc1[Session/Chat docs] Doc2[Plugin config docs] Doc3[Telemetry docs] Doc4[MCP docs] Doc5[Sample docs] end %% Feature Dependencies A1 --> D4 A3 --> C3 B1 --> F1 B2 --> E1 C1 --> C2 C2 --> C3 D1 --> D2 E1 --> E2 E2 --> E3 F2 --> S4 C3 --> S2 C3 --> S3 %% Documentation Dependencies (trigger docs after features) C3 --> Doc1 D4 --> Doc2 E3 --> Doc3 F2 --> Doc4 S1 --> Doc5 ``` ### ASCII Dependency Graph ``` ┌─────────────────────────────────────────────────────────────────────────────┐ │ M0: FOUNDATION │ │ ┌──────────────────┐ ┌──────────────────┐ ┌──────────────────┐ │ │ │ A1: config_schema│ │ A2: latency_ms │ │ A3: docs context │ │ │ └────────┬─────────┘ └──────────────────┘ └────────┬─────────┘ │ └───────────┼──────────────────────────────────────────┼──────────────────────┘ │ │ ▼ │ ┌───────────────────────────────────────────┐ │ │ M3: PLUGINS │ │ │ ┌──────────────────┐ │ │ │ │ D4: model() │◄────────────────────┼──────────┘ │ │ factory │ │ │ │ └──────────────────┘ │ │ └───────────────────────────────────────────┘ │ │ ┌─────────────────────────────────────────────────────────────────────────────┐ │ M1: CORE APIs │ │ ┌──────────────────┐ ┌──────────────────┐ ┌──────────────────┐ │ │ │ B1: checkOp() │ │ B2: run() │ │ B3: context() │ │ │ └────────┬─────────┘ └────────┬─────────┘ └──────────────────┘ │ └───────────┼─────────────────────┼───────────────────────────────────────────┘ │ │ ▼ ▼ ┌───────────────────────┐ ┌───────────────────────┐ │ M5: ADVANCED │ │ M4: TELEMETRY │ │ ┌───────────────────┐ │ │ ┌───────────────────┐ │ │ │ F1: background │ │ │ │ E1: RealtimeSpan │ │ │ │ Model │ │ │ └─────────┬─────────┘ │ │ └─────────┬─────────┘ │ │ ▼ │ │ ▼ │ │ ┌───────────────────┐ │ │ ┌───────────────────┐ │ │ │ E2: flushTracing │ │ │ │ F2: MCP Host │ │ │ └─────────┬─────────┘ │ │ └───────────────────┘ │ │ ▼ │ └───────────────────────┘ │ ┌───────────────────┐ │ │ │ E3: Adjusting │ │ │ └───────────────────┘ │ └───────────────────────┘ ┌─────────────────────────────────────────────────────────────────────────────┐ │ M2: SESSIONS │ │ ┌──────────────────┐ ┌──────────────────┐ ┌──────────────────┐ │ │ │ C1: Store │──────►│ C2: create/load │──────►│ C3: chat() │ │ │ └──────────────────┘ └──────────────────┘ └──────────────────┘ │ │ ▲ │ │ │ │ │ A3: docs ──────────┘ │ └─────────────────────────────────────────────────────────────────────────────┘ ┌─────────────────────────────────────────────────────────────────────────────┐ │ M7: DOCUMENTATION 📝 (After Features) │ │ │ │ C3: chat() ────────► Doc1: Session/Chat docs │ │ D4: model() ───────► Doc2: Plugin config docs │ │ E3: Adjusting ─────► Doc3: Telemetry docs │ │ F2: MCP Host ──────► Doc4: MCP docs │ │ S1: Samples ───────► Doc5: Sample docs │ └─────────────────────────────────────────────────────────────────────────────┘ ``` --- ## Reverse Topological Execution Order > [!IMPORTANT] > Execute tasks starting from **leaves** (no outgoing dependencies) working backwards to **roots**. ### Phase 1: Independent Leaves (Start Here) *These tasks have NO dependencies - can all start in parallel* | ID | Task | Effort | Milestone | |----|------|--------|-----------| | A2 | latency_ms tracking | S | M0 | | B3 | currentContext() | S | M1 | | B4 | dynamicTool() | S | M1 | | C1 | Session Store Interface | M | M2 | | D1 | Anthropic ThinkingConfig | M | M3 | | D3 | Google GenAI apiVersion/baseUrl | S | M3 | | E4 | Logging instrumentation | S | M4 | | F3 | embedMany() | S | M5 | | F4 | defineSimpleRetriever() | S | M5 | | **S1** | **Consolidated plugin demo structure** | **M** | **M6** | | **S5** | **Multimodal input sample** | **S** | **M6** | ### Phase 2: First Dependencies *Unblocked after Phase 1 completes* | ID | Task | Depends On | Effort | |----|------|------------|--------| | A1 | DevUI config_schema | — | S | | B2 | run() step tracing | — | M | | C2 | createSession/loadSession | C1 | L | | D2 | Anthropic tool_choice/metadata | D1 | S | ### Phase 3: Second Dependencies *Unblocked after Phase 2 completes* | ID | Task | Depends On | Effort | |----|------|------------|--------| | A3 | docs context handling | — | M | | B1 | checkOperation() | — | M | | D4 | plugin.model() factory | A1 | M | | E1 | RealtimeSpanProcessor | B2 | M | | **S6** | **DevUI gallery sample** | **A1** | **M** | | **S7** | **Reranker sample** | **—** | **S** | | **S8** | **Eval pipeline sample** | **—** | **M** | ### Phase 4: Third Dependencies | ID | Task | Depends On | Effort | |----|------|------------|--------| | C3 | chat() API | C2, A3 | M | | E2 | flushTracing() | E1 | S | | F1 | defineBackgroundModel() | B1 | L | ### Phase 5: Final Tasks | ID | Task | Depends On | Effort | |----|------|------------|--------| | E3 | AdjustingTraceExporter | E2 | M | | F2 | MCP Tool Host | F1 | L | | **S2** | **Chatbot sample** | **C3** | **L** | | **S3** | **Multi-agent sample** | **C3** | **L** | | **S4** | **MCP integration sample** | **F2** | **M** | ```mermaid flowchart BT subgraph Phase1["Phase 1: Leaves"] A2[latency_ms] B3[currentContext] B4[dynamicTool] C1[Session Store] D1[ThinkingConfig] D3[apiVersion] E4[Logging] F3[embedMany] F4[simpleRetriever] end subgraph Phase2["Phase 2"] A1[config_schema] B2[run tracing] C2[create/loadSession] D2[tool_choice] end subgraph Phase3["Phase 3"] A3[docs context] B1[checkOperation] D4[model factory] E1[RealtimeSpan] end subgraph Phase4["Phase 4"] C3[chat API] E2[flushTracing] F1[backgroundModel] end subgraph Phase5["Phase 5"] E3[Adjusting] F2[MCP Host] end C1 --> C2 D1 --> D2 A1 --> D4 B2 --> E1 C2 --> C3 A3 --> C3 E1 --> E2 B1 --> F1 E2 --> E3 F1 --> F2 ``` ### ASCII Execution Order ``` PHASE 1 (Leaves - Start Here) ═══════════════════════════════════════════════════════════════════════════════ │ A2: latency_ms │ B3: context │ B4: dynamicTool │ C1: Store │ D1: Thinking │ │ D3: apiVersion │ E4: Logging │ F3: embedMany │ F4: simpleRetriever │ ═══════════════════════════════════════════════════════════════════════════════ │ │ │ │ │ ▼ ▼ ▼ ▼ ▼ PHASE 2 ═══════════════════════════════════════════════════════════════════════════════ │ A1: config_schema │ B2: run() tracing │ C2: create/load │ D2: tool_choice │ ═══════════════════════════════════════════════════════════════════════════════ │ │ │ ▼ ▼ ▼ PHASE 3 ═══════════════════════════════════════════════════════════════════════════════ │ A3: docs context │ B1: checkOp() │ D4: model() factory │ E1: RealtimeSpan │ ═══════════════════════════════════════════════════════════════════════════════ │ │ │ ▼ ▼ ▼ PHASE 4 ═══════════════════════════════════════════════════════════════════════════════ │ C3: chat() API │ F1: backgroundModel │ E2: flushTracing │ ═══════════════════════════════════════════════════════════════════════════════ │ │ ▼ ▼ PHASE 5 (Roots - End Here) ═══════════════════════════════════════════════════════════════════════════════ │ F2: MCP Host │ E3: AdjustingExporter │ ═══════════════════════════════════════════════════════════════════════════════ ``` --- ## Milestone Breakdown ### M0: Foundation (Week 1-2) > **Goal:** Fix core issues that block DevUI and model spec compliance | Task | Effort | Unblocks | Files | |------|--------|----------|-------| | **A1: DevUI config_schema fix** | S | Plugin model() factories | `gemini.py`, `models.py` | | **A2: latency_ms tracking** | S | Monitoring dashboards | All model plugins | | **A3: docs context handling** | M | RAG + Chat API | `generate.py` | **Definition of Done:** - [x] Model config shows in DevUI - [x] latency_ms populated in GenerateResponse - [x] `docs` field augments message history --- ### M1: Core APIs (Week 2-3) > **Goal:** Add missing core Genkit API methods | Task | Effort | Unblocks | Files | |------|--------|----------|-------| | **B1: checkOperation()** | M | Background models, Veo | `_aio.py` | | **B2: run() step tracing** | M | Better flow debugging | `_registry.py` | | **B3: currentContext()** | S | Auth in tools/flows | `_registry.py` | | **B4: dynamicTool()** | S | Runtime tool creation | `_registry.py` | **Definition of Done:** - [x] `await ai.check_operation(op)` returns updated Operation - [x] `await ai.run('step', fn)` creates traced sub-span - [x] `ai.current_context()` returns ActionContext - [x] `ai.dynamic_tool(config, fn)` returns unregistered ToolAction --- ### M2: Stateful Conversations (Week 3-5) > **Goal:** Enable multi-turn conversations with history persistence ```mermaid flowchart LR A[Session Store Interface] --> B[createSession] B --> C[loadSession] C --> D[chat API] D --> E[Multi-turn UX] ``` | Task | Effort | Dependencies | Files | |------|--------|--------------|-------| | **C1: Session Store Interface** | M | None | NEW: `session/store.py` | | **C2: createSession/loadSession** | L | C1 | NEW: `session/session.py` | | **C3: chat() API** | M | C2, A3 | `_aio.py` | **Definition of Done:** - [x] `SessionStore` abstract base class with `get/save/delete` - [x] `session = await ai.create_session()` / `ai.load_session(id)` - [x] `response = await session.chat('message')` maintains history - [x] At least one store implementation (in-memory) --- ### M3: Plugin Parity (Week 4-6) > **Goal:** Match JS plugin config schemas and APIs | Task | Effort | Plugin | Files | |------|--------|--------|-------| | **D1: Anthropic ThinkingConfig** | M | anthropic | `models.py`, `plugin.py` | | **D2: Anthropic tool_choice/metadata** | S | anthropic | `models.py` | | **D3: Google GenAI apiVersion/baseUrl** | S | google-genai | `google.py` | | **D4: plugin.model() factory** | M | All | All plugin `__init__.py` | **Definition of Done:** - [x] `config={'thinking': {'enabled': True, 'budgetTokens': 10000}}` works - [x] `tool_choice={'type': 'tool', 'name': 'myTool'}` supported - [x] `GoogleGenAI(api_version='v1beta')` accepted - [ ] `google_ai.model('gemini-2.5-flash')` returns typed reference --- ### M4: Telemetry & Observability (Week 5-7) > **Goal:** Match JS realtime tracing and observability features | Task | Effort | Impact | Files | |------|--------|--------|-------| | **E1: RealtimeSpanProcessor** | M | Live DevUI tracing | NEW: `realtime_processor.py` | | **E2: flushTracing() API** | S | Clean shutdown | `tracing.py` | | **E3: AdjustingTraceExporter** | M | PII redaction | `google_cloud/telemetry/` | | **E4: Logging instrumentation** | S | Log correlation | `google_cloud/telemetry/` | **Definition of Done:** - [x] Spans appear in DevUI as they START (not just on completion) - [x] `GENKIT_ENABLE_REALTIME_TELEMETRY=true` env var supported - [x] `await ai.flush_tracing()` available - [x] Model I/O redacted before Cloud Trace export (via AdjustingTraceExporter) - [x] Logging instrumentation enabled with trace correlation --- ### M5: Advanced Features (Week 7+) > **Goal:** Complete feature parity for advanced use cases | Task | Effort | Use Case | Files | |------|--------|----------|-------| | **F1: defineBackgroundModel()** | L | Veo, Imagen | `_registry.py`, block | | **F2: MCP Tool Host** | L | External tools | NEW: `mcp/host.py` | | **F3: embedMany()** | S | Batch embedding | `_aio.py` | | **F4: defineSimpleRetriever()** | S | Quick RAG setup | `_registry.py` | (Marking done for verified items) - [x] F1: defineBackgroundModel() API - [x] F3: embedMany() API - [x] F4: defineSimpleRetriever() - [x] S2: Chatbot sample (chat-demo) - [x] Background model sample (background-model-demo) --- --- ## Timeline Overview ```mermaid gantt title Python Genkit Parity Roadmap dateFormat YYYY-MM-DD section M0 Foundation DevUI config_schema :a1, 2025-01-27, 3d latency_ms tracking :a2, after a1, 2d docs context handling :a3, after a1, 4d section M1 Core APIs checkOperation :b1, 2025-02-03, 4d run step tracing :b2, after b1, 3d currentContext :b3, after b1, 2d dynamicTool :b4, after b3, 2d section M2 Sessions Session Store :c1, 2025-02-10, 4d create/loadSession :c2, after c1, 5d chat API :c3, after c2, 4d section M3 Plugins Anthropic ThinkingConfig :d1, 2025-02-17, 3d Anthropic extras :d2, after d1, 2d Google GenAI options :d3, 2025-02-17, 2d plugin.model factory :d4, after d3, 4d section M4 Telemetry RealtimeSpanProcessor :e1, 2025-02-24, 4d flushTracing :e2, after e1, 2d AdjustingExporter :e3, after e2, 3d section M5 Advanced defineBackgroundModel :f1, 2025-03-03, 5d MCP Tool Host :f2, after f1, 7d ``` --- ## Effort Legend | Size | Days | Description | |------|------|-------------| | **S** | 1-2 | Simple addition, clear pattern | | **M** | 3-5 | Moderate complexity, some design | | **L** | 5-10 | Large feature, new subsystem | --- ## Quick Wins (Can Start Immediately) These have no dependencies and provide immediate value: 1. ~~**A1: DevUI config_schema** - Uncomment and fix existing code~~ 2. ~~**A2: latency_ms** - Add timing to model wrappers~~ 3. ~~**B3: currentContext()** - Thread-local context access~~ 4. ~~**D3: apiVersion/baseUrl** - Add to plugin options~~ 5. ~~**E2: flushTracing()** - Simple exporter flush~~ --- ## Files Reference | Area | Key Files | |------|-----------| | Core APIs | `py/packages/genkit/src/genkit/ai/_aio.py`, `_registry.py` | | Sessions | NEW: `py/packages/genkit/src/genkit/session/` | | Google GenAI | `py/plugins/google-genai/src/.../models/gemini.py` | | Anthropic | `py/plugins/anthropic/src/.../models.py` | | Telemetry | `py/packages/genkit/src/genkit/core/tracing.py` | | GCP Plugin | `py/plugins/google-cloud/src/.../telemetry/` | --- ## M6: Sample Parity > **Goal:** Match JS sample coverage and consolidate plugin demos See [sample_parity_analysis.md](sample_parity_analysis.md) for full analysis. ### Sample Tasks | ID | Task | Effort | Depends On | Phase | |----|------|--------|------------|-------| | S1 | Consolidated plugin demo structure | M | — | 1 | | S2 | Chatbot sample (like `js-chatbot`) | L | C3 (chat API) | 5 | | S3 | Multi-agent sample (like `js-schoolAgent`) | L | C3 (chat API) | 5 | | S4 | MCP integration sample | M | F2 (MCP Host) | 5 | | S5 | Multimodal input sample | S | — | 1 | | S6 | DevUI gallery sample | M | A1 (config_schema) | 3 | | S7 | Reranker sample | S | Plugin parity | 3 | | S8 | Full eval pipeline sample | M | — | 3 | ### Consolidated Plugin Demo Structure Each plugin should demonstrate the same core features: ``` py/samples/plugin-demos/{plugin}/ ├── 01_basic_generate.py # Simple text generation ├── 02_streaming.py # Streaming response ├── 03_structured_output.py # JSON schema output ├── 04_tool_calling.py # Tool/function calling ├── 05_multimodal.py # Image/audio input (if supported) ├── 06_multi_turn.py # Conversation history ├── 07_system_prompt.py # System instructions ├── 08_middleware.py # Request/response middleware ├── prompts/demo.prompt # Dotprompt example └── main.py # Entry point ``` ### Sample Dependency Graph ```mermaid flowchart TD subgraph Samples["M6: Samples"] S1[Consolidated Structure] S2[Chatbot Sample] S3[Multi-Agent Sample] S4[MCP Sample] S5[Multimodal Sample] S6[DevUI Gallery] S7[Reranker Sample] S8[Eval Pipeline] end C3[chat API] --> S2 C3 --> S3 F2[MCP Host] --> S4 A1[config_schema] --> S6 ``` ### ASCII Sample Dependencies ``` ┌─────────────────────────────────────────────────────────────────────────────┐ │ M6: SAMPLES │ │ │ │ Phase 1 (Independent): │ │ ┌───────────────────┐ ┌───────────────────┐ │ │ │ S1: Consolidated │ │ S5: Multimodal │ │ │ │ Structure │ │ Sample │ │ │ └───────────────────┘ └───────────────────┘ │ │ │ │ Phase 3 (After Foundation): │ │ ┌───────────────────┐ ┌───────────────────┐ ┌───────────────────┐ │ │ │ S6: DevUI Gallery │ │ S7: Reranker │ │ S8: Eval Pipeline │ │ │ └─────────▲─────────┘ └───────────────────┘ └───────────────────┘ │ │ │ │ │ │ (depends on A1) │ │ │ │ Phase 5 (After Chat/MCP): │ │ ┌───────────────────┐ ┌───────────────────┐ ┌───────────────────┐ │ │ │ S2: Chatbot │ │ S3: Multi-Agent │ │ S4: MCP Sample │ │ │ └─────────▲─────────┘ └─────────▲─────────┘ └─────────▲─────────┘ │ │ │ │ │ │ │ └──────────────────────┼──────────────────────┘ │ │ │ │ │ C3: chat() API │ │ │ │ │ F2: MCP Tool Host │ └─────────────────────────────────────────────────────────────────────────────┘ ``` --- ## M7: Documentation (Docsite Updates) > **Goal:** Keep [genkit-ai/docsite](https://github.com/genkit-ai/docsite) updated with Python feature parity > [!WARNING] > After completing each milestone, update the docsite to reflect Python support. ### Documentation Tasks | ID | Task | After Milestone | Priority | |----|------|-----------------|----------| | D1 | Update Session/Chat docs for Python | M2 | P0 | | D2 | Add Python examples to all feature docs | M0-M5 | P1 | | D3 | Document Python plugin config options | M3 | P1 | | D4 | Add Python telemetry setup guide | M4 | P2 | | D5 | Document MCP Python support | M5 (F2) | P2 | | D6 | Add Python sample links | M6 | P2 | | D7 | Python API reference (if applicable) | M1 | P3 | ### Docsite Files Analysis **Repository:** [genkit-ai/docsite](https://github.com/genkit-ai/docsite) **Docs path:** `src/content/docs/docs/` #### Core Feature Docs (Need Python Examples) | File | Size | Python Status | Action | |------|------|---------------|--------| | `chat.mdx` | 18KB | ❌ **JS/Go only** (`supportedLanguages="js go"`) | Add Python after M2 | | `models.mdx` | 78KB | Partial | Add Python config examples | | `flows.mdx` | 39KB | Partial | Verify Python examples current | | `dotprompt.mdx` | 40KB | Partial | Verify Python examples | | `tool-calling.mdx` | 32KB | Partial | Add tool config examples | | `rag.mdx` | 30KB | Partial | Add Python retriever examples | | `evaluation.mdx` | 44KB | Partial | Add Python evaluator examples | | `interrupts.mdx` | 26KB | ? | Check if Python supported | | `agentic-patterns.mdx` | 31KB | ? | Add Python multi-agent examples | | `multi-agent.mdx` | 6KB | ? | Add after M2/Sessions | | `mcp-server.mdx` | 12KB | ? | Add after MCP support | | `model-context-protocol.mdx` | 20KB | ❌ **JS only** | Add after F2 MCP Host | | `context.mdx` | 7KB | ? | Add `currentContext()` after B3 | | `durable-streaming.mdx` | 12KB | ? | Check if applicable to Python | #### Integration/Plugin Docs (Need Python) | File | Python Plugin | Action | |------|---------------|--------| | `integrations/google-genai.mdx` | ✅ Exists | Add config_schema examples | | `integrations/vertex-ai.mdx` | ✅ (in google-genai) | Update with Python | | `integrations/anthropic.mdx` | ✅ Exists | Add ThinkingConfig after D1 | | `integrations/ollama.mdx` | ✅ Exists | Verify examples | | `integrations/openai-compatible.mdx` | ✅ compat-oai | Verify examples | | `integrations/deepseek.mdx` | ✅ Exists | Add/verify Python | | `integrations/xai.mdx` | ✅ Exists | Add/verify Python | | `integrations/google-cloud.mdx` | ✅ Exists | Add telemetry examples after M4 | | `integrations/dev-local-vectorstore.mdx` | ✅ Exists | Add Python examples | | `integrations/cloud-firestore.mdx` | ✅ Exists | Add Python retriever examples | | `integrations/vectorsearch-firestore.mdx` | ✅ Exists | Add Python examples | | `integrations/vectorsearch-bigquery.mdx` | ✅ Exists | Add Python examples | | `integrations/chroma.mdx` | ❌ Missing plugin | Skip until plugin exists | | `integrations/pinecone.mdx` | ❌ Missing plugin | Skip until plugin exists | | `integrations/pgvector.mdx` | ❌ Missing plugin | Skip until plugin exists | #### Detailed Doc Tasks (by Dependency) | Task ID | Docsite File | Change Required | Depends On | |---------|--------------|-----------------|------------| | **Doc-01** | `chat.mdx` | Add `supportedLanguages="js go python"`, add Python tab content | C3 chat API | | **Doc-02** | `chat.mdx` | Python Session/SessionStore examples | C1, C2 | | **Doc-03** | `model-context-protocol.mdx` | Add Python MCP client examples | F2 MCP Host | | **Doc-04** | `mcp-server.mdx` | Add Python MCP server examples | Exists | | **Doc-05** | `integrations/anthropic.mdx` | Add ThinkingConfig Python example | D1 | | **Doc-06** | `integrations/google-genai.mdx` | Add apiVersion/baseUrl examples | D3 | | **Doc-07** | `integrations/google-cloud.mdx` | Add Python telemetry examples | E3 | | **Doc-08** | `context.mdx` | Add `ai.current_context()` Python examples | B3 | | **Doc-09** | `multi-agent.mdx` | Add Python multi-agent examples | C3, S3 | | **Doc-10** | `evaluation.mdx` | Update Python evaluator examples | Exists | | **Doc-11** | `rag.mdx` | Add `define_simple_retriever()` examples | F4 | | **Doc-12** | `models.mdx` | Add `define_background_model()` examples | F1 | ### Docsite Language Component The docsite uses `<LanguageSelector>` and `<LanguageContent>` components: ```mdx <LanguageSelector supportedLanguages="js go python" /> <LanguageContent lang="python">  </LanguageContent> ``` **Key change:** Files with `supportedLanguages="js go"` need `python` added. ### Post-Milestone Checklist ``` After completing each milestone: 1. ✅ Merge code to main 2. ✅ Update CHANGELOG 3. 📝 Identify affected docsite files from table above 4. 📝 Fork genkit-ai/docsite 5. 📝 Add `python` to LanguageSelector 6. 📝 Add <LanguageContent lang="python"> sections 7. 📝 Open PR on genkit-ai/docsite 8. 📝 Update any "JavaScript only" / "JS/Go only" warnings ``` --- ## M8: Additional Model Provider Plugins (Future) > **Goal:** Expand model provider coverage with additional popular platforms ### Hugging Face Plugin Hugging Face offers multiple inference options that would benefit Genkit users: | Service | Description | Use Case | |---------|-------------|----------| | **Serverless Inference API** | Free tier with rate limits, access to 1000s of models | Quick prototyping, testing | | **Inference Providers** | 17+ AI infrastructure partners (Cerebras, Groq, Together, etc.) | Production with provider choice | | **Inference Endpoints** | Dedicated managed infrastructure | High-volume production | **Why a Hugging Face Plugin?** 1. **Massive Model Selection**: Access to 1,000,000+ models on Hugging Face Hub 2. **Provider Flexibility**: Single API to access Cerebras, Groq, Together AI, Replicate, etc. 3. **No Vendor Lock-in**: Switch between providers without code changes 4. **Cost Optimization**: Choose providers based on cost/performance tradeoffs 5. **Open Source Models**: Easy access to Llama, Mistral, Falcon, and other open models **Implementation Plan:** | Task | Effort | Priority | Description | |------|--------|----------|-------------| | HF-1: Core plugin structure | M | P2 | `genkit-plugin-huggingface` package | | HF-2: InferenceClient integration | M | P2 | Use `huggingface_hub.InferenceClient` | | HF-3: Text generation models | M | P2 | Chat completion, text generation | | HF-4: Embedding models | S | P2 | Feature extraction / embeddings | | HF-5: Image generation | S | P3 | Stable Diffusion, FLUX, etc. | | HF-6: Speech models | S | P3 | Whisper (STT), Bark (TTS) | | HF-7: Provider selection | M | P2 | Allow choosing inference provider | **Environment Variables:** | Variable | Required | Description | |----------|----------|-------------| | `HF_TOKEN` | Yes | Hugging Face API token | | `HF_INFERENCE_PROVIDER` | No | Preferred provider (e.g., `cerebras`, `groq`) | **Example Usage (Proposed):** ```python from genkit import Genkit from genkit.plugins.huggingface import HuggingFace ai = Genkit( plugins=[HuggingFace()], model='huggingface/meta-llama/Llama-3.3-70B-Instruct', ) # Use default provider (HF Inference) response = await ai.generate(prompt='Hello!') # Or specify a provider for better performance response = await ai.generate( model='huggingface/meta-llama/Llama-3.3-70B-Instruct', config={'provider': 'groq'}, # Use Groq for fast inference prompt='Hello!', ) ``` **Dependencies:** ```toml [project] dependencies = [ "huggingface_hub>=0.25.0", ] ``` ### OpenRouter Plugin OpenRouter is a **unified API gateway** providing access to 500+ models from 60+ providers (OpenAI, Anthropic, Google, Meta, xAI, DeepSeek, etc.) through a single API. **Current Status:** OpenRouter is **already usable** via the `compat-oai` plugin since it's OpenAI-compatible: ```python from genkit import Genkit from genkit.plugins.compat_oai import OpenAI ai = Genkit( plugins=[OpenAI( api_key=os.getenv('OPENROUTER_API_KEY'), base_url='https://openrouter.ai/api/v1', )], model='openai/anthropic/claude-3.5-sonnet', ) ``` **Why a Dedicated Plugin?** A native OpenRouter plugin would add: | Feature | `compat-oai` | Dedicated Plugin | |---------|--------------|------------------| | Basic chat | ✅ | ✅ | | Auto model discovery | ❌ | ✅ 500+ models | | Provider routing | ❌ Manual headers | ✅ Native config | | Cost optimization | ❌ | ✅ Built-in | | Usage analytics | ❌ | ✅ Exposed | **Implementation (P3 Priority):** | Task | Effort | Description | |------|--------|-------------| | OR-1: Core plugin | S | Use OpenRouter Python SDK | | OR-2: Model registry | M | Fetch models from `/api/v1/models` | | OR-3: Provider routing | S | Expose `provider` config option | | OR-4: Usage stats | S | Expose generation stats | **Environment Variables:** | Variable | Required | Description | |----------|----------|-------------| | `OPENROUTER_API_KEY` | Yes | OpenRouter API key | ### Other Potential Plugins | Plugin | Provider | Priority | Notes | |--------|----------|----------|-------| | **Mistral AI** | mistral.ai | P2 | Direct Mistral API access | | **Together AI** | together.ai | P3 | Fast open model inference | | **Groq** | groq.com | P3 | Ultra-fast LPU inference | | **Replicate** | replicate.com | P3 | Run any model via API | | **Fireworks** | fireworks.ai | P3 | Fast inference platform | | **Cohere** | cohere.com | P3 | Enterprise NLP models | > [!NOTE] > Many of these providers are accessible via: > - **OpenRouter** (unified gateway to 60+ providers) > - **Hugging Face Inference Providers** (17+ providers) > - **compat-oai** (any OpenAI-compatible API) > > Dedicated plugins are only needed when provider-specific features aren't available > through these unified interfaces. --- ## M9: Automated Testing (Future - Low Priority) > **Goal:** Automate sample validation and DevUI E2E testing > [!NOTE] > This milestone is intentionally last. Complete all feature work first. ### Overview Use Playwright (Python) to automate: 1. DevUI E2E tests - verify flows work through the UI 2. Sample validation - run each sample and verify output 3. Regression testing - catch breaking changes ### Testing Tasks | ID | Task | Effort | Description | |----|------|--------|-------------| | T1 | Playwright test infrastructure | M | Set up pytest-playwright, fixtures | | T2 | DevUI flow runner tests | M | Test running flows through DevUI | | T3 | DevUI model config tests | S | Verify config_schema appears in UI | | T4 | Sample smoke tests | L | Run each sample, verify no errors | | T5 | CI integration | M | Add to GitHub Actions workflow | ### Example Test Structure ```python # tests/e2e/test_devui.py import pytest from playwright.async_api import async_playwright @pytest.fixture async def devui_page(): async with async_playwright() as p: browser = await p.chromium.launch() page = await browser.new_page() await page.goto("http://localhost:4000") yield page await browser.close() async def test_flow_list_loads(devui_page): """Verify flow list appears in DevUI.""" await devui_page.wait_for_selector('[data-testid="flow-list"]') flows = await devui_page.query_selector_all('[data-testid="flow-item"]') assert len(flows) > 0 async def test_run_menu_flow(devui_page): """Run menuSuggestionFlow and verify output.""" await devui_page.click('text=menuSuggestionFlow') await devui_page.fill('[data-testid="input"]', '{"theme": "Italian"}') await devui_page.click('[data-testid="run-button"]') output = await devui_page.wait_for_selector('[data-testid="output"]') text = await output.text_content() assert len(text) > 0 async def test_model_config_visible(devui_page): """Verify model config schema appears.""" await devui_page.click('[data-testid="models-tab"]') await devui_page.click('text=gemini-2.0-flash') config = await devui_page.wait_for_selector('[data-testid="config-schema"]') assert "temperature" in await config.text_content() ``` ### CI Workflow Addition ```yaml # .github/workflows/python-e2e.yml name: Python E2E Tests on: push: paths: ['py/**'] jobs: e2e: runs-on: ubuntu-latest steps: - uses: actions/checkout@v4 - uses: actions/setup-python@v5 with: python-version: '3.12' - name: Install dependencies run: | pip install playwright pytest-playwright playwright install chromium - name: Start sample server run: | cd py/samples/menu uv run genkit start & sleep 10 - name: Run E2E tests run: pytest tests/e2e/ -v ``` ### Dependencies ```mermaid flowchart LR All[All M0-M7 Complete] --> T1[Test Infrastructure] T1 --> T2[DevUI Tests] T1 --> T4[Sample Tests] T2 --> T5[CI Integration] T4 --> T5 ```

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/firebase/genkit'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

roadmap.md•51.7 KiB