generate_image
Create images from text descriptions using Google Gemini's AI models. Choose between fast generation for high-volume tasks or advanced models for professional assets with up to 4K resolution.
Instructions
Generate an image from a text prompt using Google Gemini's image generation.
Models available:
nano-banana (gemini-2.5-flash-image): Fast, efficient, 1024px resolution. Best for high-volume tasks.
nano-banana-pro (gemini-3-pro-image-preview): Advanced, up to 4K resolution, with thinking mode. Best for professional assets.
Tips for better results:
Describe the scene narratively, don't just list keywords
Be specific about lighting, camera angles, and styles
Use photography terms for photorealistic images
Specify aspect ratio based on your use case
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| prompt | Yes | The text prompt describing the image to generate. Be descriptive and specific. | |
| model | No | The model to use. nano-banana is faster, nano-banana-pro is higher quality with up to 4K. | nano-banana |
| aspect_ratio | No | The aspect ratio of the generated image. | 1:1 |
| image_size | No | The resolution of the output (only for nano-banana-pro). Options: 1K, 2K, 4K. | 1K |
| filename | No | Optional filename for the output image (without extension). If not provided, a timestamp-based name will be used. |