transform_image
Change an existing image's style or interpretation by providing a reference image and a transformation prompt.
Instructions
Transform an existing image to a new style or interpretation using OpenAI GPT image models. Takes a reference image and a prompt describing the desired transformation. gpt-image-1.5 supports input_fidelity for better face/logo preservation.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| prompt | Yes | Description of the desired transformation | |
| reference_image_base64 | No | Base64 encoded reference image | |
| reference_image_path | No | Path to reference image file | |
| output_path | No | Output file path (default: transformed_image.png) | |
| model | No | Model to use. gpt-image-2: latest, flexible sizes, input_fidelity is auto-high (field ignored). gpt-image-1.5: supports input_fidelity. gpt-image-1: original. (default: gpt-image-1) | |
| size | No | Image size. gpt-image-1/1.5 only support 1024x1024, 1024x1536, 1536x1024, auto. gpt-image-2 also supports 2K/4K presets plus custom WxH (16px multiples, each edge ≤3840, ratio ≤3:1). (default: auto) | |
| quality | No | Image quality level (default: auto) | |
| output_format | No | Output image format (default: png) | |
| moderation | No | Content moderation level (default: auto) | |
| sample_count | No | Number of images to generate (1-10, default: 1) | |
| return_base64 | No | Return base64 image data in response (default: false) | |
| include_thumbnail | No | Include thumbnail preview in MCP response for LLM recognition (default: false, overrides OPENAI_IMAGE_THUMBNAIL env var) | |
| input_fidelity | No | Input fidelity for preserving faces/logos. gpt-image-1.5 only (gpt-image-2 is always high, gpt-image-1 unsupported). High uses more tokens. (default: low) |