generate_image
Create images from text prompts with support for variations, character consistency, and real-world grounding. Adjust quality, aspect ratio, and purpose for tailored results.
Instructions
Generate image with specified prompt and optional parameters
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| prompt | Yes | The prompt for image generation (English recommended for optimal structured prompt enhancement) | |
| fileName | No | Custom file name for the output image. Auto-generated if not specified. | |
| inputImagePath | No | Optional absolute path to source image for image-to-image generation. Use when generating variations, style transfers, or similar images based on an existing image (must be an absolute path) | |
| blendImages | No | Enable multi-image blending for combining multiple visual elements naturally. Use when prompt mentions multiple subjects or composite scenes | |
| maintainCharacterConsistency | No | Maintain character appearance consistency. Enable when generating same character in different poses/scenes | |
| useWorldKnowledge | No | Use real-world knowledge for accurate context. Enable for historical figures, landmarks, or factual scenarios | |
| useGoogleSearch | No | Enable Google Search grounding to access real-time web information for factually accurate image generation. Use when prompt requires current or time-sensitive data that may have changed since the model's knowledge cutoff. Leave disabled for creative, fictional, historical, or timeless content. | |
| aspectRatio | No | Aspect ratio for the generated image | |
| imageSize | No | Image resolution for high-quality output. Specify "1K", "2K", or "4K" when you need specific resolution. Leave unspecified for standard quality. | |
| purpose | No | Intended use for the image (e.g., cookbook cover, social media post, presentation slide). Influences lighting, composition, and detail level to match the context. | |
| quality | No | Quality preset controlling speed/fidelity tradeoff. Only specify when the user explicitly requests a specific quality level; omit to use the server's configured default. "fast": best for drafts and rapid iteration. "balanced": better detail and coherence, moderate latency. "quality": highest fidelity, use for final deliverables where quality matters most. |