generate_image
Create images from text prompts using Google's Gemini API, with options for image editing, character consistency, multi-image blending, and factual accuracy.
Instructions
Generate image with specified prompt and optional parameters
Input Schema
Name | Required | Description | Default |
---|---|---|---|
blendImages | No | Enable multi-image blending for combining multiple visual elements naturally. Use when prompt mentions multiple subjects or composite scenes | |
fileName | No | Optional file name for the generated image (if not specified, generates an auto-named file in IMAGE_OUTPUT_DIR) | |
inputImagePath | No | Optional absolute path to source image for image-to-image generation. Use when generating variations, style transfers, or similar images based on an existing image (must be an absolute path) | |
maintainCharacterConsistency | No | Maintain character appearance consistency. Enable when generating same character in different poses/scenes | |
prompt | Yes | The prompt for image generation (English recommended for optimal structured prompt enhancement) | |
useWorldKnowledge | No | Use real-world knowledge for accurate context. Enable for historical figures, landmarks, or factual scenarios |