Genkit MCP

Official

Overview Schema Related Servers Score Discussions

how-it-works.md•3.89 KiB

# How It Works This page explains how a request flows through the system, from HTTP/gRPC ingress to LLM response. ## Request lifecycle (REST) ```mermaid sequenceDiagram participant C as Client participant MW as Middleware Stack participant FW as Framework (FastAPI) participant F as Genkit Flow participant CB as Circuit Breaker participant CA as Cache participant AI as Gemini API C->>MW: POST /tell-joke {"name": "Python"} MW->>MW: RequestId (assign X-Request-ID) MW->>MW: SecurityHeaders (OWASP headers) MW->>MW: MaxBodySize (check Content-Length) MW->>MW: RateLimit (token bucket check) MW->>FW: Forward to route handler FW->>F: call tell_joke(JokeInput) F->>CA: get_or_call("tell_joke", input) alt Cache hit CA-->>F: cached result else Cache miss CA->>CB: breaker.call(fn) alt Circuit closed CB->>AI: ai.generate(prompt=...) AI-->>CB: LLM response CB-->>CA: result CA->>CA: store in cache else Circuit open CB-->>F: CircuitOpenError (503) end end F-->>FW: JokeResponse FW-->>MW: HTTP 200 + JSON body MW-->>C: Response + security headers ``` ## Request lifecycle (gRPC) ```mermaid sequenceDiagram participant C as gRPC Client participant I as Interceptors participant S as GenkitServiceServicer participant F as Genkit Flow participant AI as Gemini API C->>I: TellJoke(JokeRequest) I->>I: GrpcLoggingInterceptor I->>I: GrpcRateLimitInterceptor I->>S: forward to servicer S->>F: call tell_joke(input) F->>AI: ai.generate(...) AI-->>F: response F-->>S: result S-->>C: JokeReply ``` ## Startup sequence When you run `python -m src`, the following happens: 1. **Parse CLI arguments** (`config.py`) - `--port`, `--server`, `--framework`, `--otel-endpoint`, etc. 2. **Load settings** (`config.py`) - Environment variables → `.env` files → defaults 3. **Initialize Genkit** (`app_init.py`) - Create `ai = Genkit(...)` singleton - Auto-detect cloud platform for telemetry - Load plugins (Google AI, Vertex AI, etc.) 4. **Register flows** (`flows.py`) - `@ai.flow()` decorators register all flows 5. **Create resilience singletons** (`main.py`) - `FlowCache` with configured TTL and max size - `CircuitBreaker` with configured thresholds 6. **Create REST app** (`main.py`) - Select framework (FastAPI/Litestar/Quart) - Call `create_app(ai)` factory 7. **Apply middleware** (`main.py`) - Security headers, CORS, body size, request ID, rate limiting 8. **Instrument with OpenTelemetry** (`telemetry.py`) - If `--otel-endpoint` is set 9. **Start servers** (`main.py`) - `asyncio.gather(serve_rest(), serve_grpc())` - REST on `:8080`, gRPC on `:50051` ## Flow execution Every Genkit flow follows this pattern: ```python @ai.flow() async def my_flow(ai: Genkit, input: MyInput) -> MyOutput: # 1. Optionally run sub-steps (creates trace spans) cleaned = await ai.run("sanitize", lambda: sanitize(input.text)) # 2. Call the LLM response = await ai.generate( model="googleai/gemini-2.0-flash", prompt=cleaned, output=Output(schema=MyOutput), ) # 3. Return structured output return response.output ``` The flow is wrapped by the resilience layer in `flows.py`: 1. **Cache check** → return cached result if available 2. **Circuit breaker** → reject if circuit is open 3. **Execute flow** → call the LLM 4. **Record result** → cache the response, update breaker stats ## Configuration precedence Settings are resolved in this order (highest priority first): ``` CLI args > Environment vars > .<env>.env file > .env file > Defaults ``` This follows the [12-factor app](https://12factor.net/config) methodology. Environment-specific files (`.staging.env`, `.production.env`) layer on top of shared defaults (`.env`).

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/firebase/genkit'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

how-it-works.md•3.89 KiB