Skip to main content
Glama

post-prompt-editing-inferences

Edit images using AI by applying text prompts to modify content, adjust styles, or transform visual elements.

Instructions

Edit an image with a prompt.

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
originalAssetsNoIf set to true, returns the original asset without transformation
dryRunNo
imageYesThe image to edit. Must reference an existing AssetId or be a data URL.
referenceImagesNo
seedNoThe seed to use for the image generation. Only available for the `flux-kontext` model.
strengthNoThe strength Only available for the `flux-kontext` LoRA model.
guidanceScaleNoThe guidance scale to use for the image generation. Only available for the `flux-kontext` model.
modelIdNoThe model to use. Can be "gemini-2.0-flash", "gemini-2.5-flash", "gpt-image-1", "flux-kontext", "runway-gen4-image" or "seedream-4".
formatNoThe format of the generated image(s) This parameter is only supported for the `gpt-image-1` model.
aspectRatioNoThe aspect ratio of the generated image(s). Supported for: `gemini-2.5-flash`, `gpt-image-1`, `flux-kontext`, `runway-gen4-image`, `seedream-4`. Will default to `auto` for other models and unknown ratios. Notes: - `gemini-2.5-flash` supports Landscape: 21:9, 16:9, 4:3, 3:2 • Square: 1:1 • Portrait: 9:16, 3:4, 2:3 • Flexible: 5:4, 4:5 • `auto`. - `gpt-image-1` supports `1:1`, `3:2`, `2:3`, `auto` (unknown ratios fall back to `auto`). - `runway-gen4-image` supports `1:1`, `4:3`, `3:4`, `16:9`, `9:16`, `auto` (unknown ratios fall back to `auto`). - `seedream-4` supports `1:1`, `4:3`, `3:4`, `16:9`, `9:16`, `2:3`, `3:2`, `21:9`, `auto`.
inputFidelityNoWhen set to `high`, allows to better preserve details from the input images in the output. This is especially useful when using images that contain elements like faces or logos that require accurate preservation in the generated image. You can provide multiple input images that will all be preserved with high fidelity, but keep in mind that the first image will be preserved with richer textures and finer details, so if you include elements such as faces, consider placing them in the first image. Only available for the `gpt-image-1` model.
qualityNoThe quality of the generated image(s). Only available for the `gpt-image-1`, `flux-kontext`, `runway-gen4-image` and `seedream-4` models.
conceptsNo
numSamplesNoThe number of samples to generate Maximum depends on the subscription tier.
compressionNoThe compression level (0-100%) for the generated images. This parameter is only supported for the `gpt-image-1` model with the `webp` or `jpeg` output formats, and defaults to 100.
promptYesThe prompt to edit the given image.
maskNoYou can provide a mask to indicate where the image should be edited. The black area of the mask will be replaced, while the filled areas will be kept as is. Must reference an existing AssetId or be a data URL. Only available for the `gpt-image-1` model. Will be ignored for other models.
Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden of behavioral disclosure. It mentions editing an image but does not clarify critical behaviors: whether this is a generative AI operation, if it modifies the original asset or creates a new one, potential rate limits, authentication needs, or output format. The description is too minimal to inform safe and effective use, especially for a complex tool with 17 parameters.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is extremely concise—a single sentence with no wasted words. It is front-loaded and to the point, though this brevity contributes to its inadequacy in other dimensions. Every sentence (here, just one) earns its place by stating the core action, but it lacks necessary detail.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (17 parameters, no annotations, no output schema), the description is incomplete. It fails to explain the tool's purpose in context, behavioral traits, or usage scenarios. While the schema covers parameters, the description does not add value to help an agent understand when or how to use this tool effectively, making it insufficient for such a multifaceted operation.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is high at 82%, so the schema already documents most parameters well. The description adds no parameter-specific information beyond the schema—it does not explain key parameters like 'prompt', 'image', or model-specific options. However, with high schema coverage, the baseline score is 3, as the description does not compensate but also does not detract from the existing documentation.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose2/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description 'Edit an image with a prompt' states the basic action but is vague and tautological—it essentially restates the tool name 'post-prompt-editing-inferences' without specifying what kind of editing occurs (e.g., generative AI-based transformation, inpainting, style transfer) or distinguishing it from sibling tools like 'post-img2img-inferences' or 'post-inpaint-inferences'. It lacks specificity about the resource being edited (an image asset) and the mechanism (prompt-driven AI editing).

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines1/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides no guidance on when to use this tool versus alternatives. With many sibling tools for image manipulation (e.g., 'post-img2img-inferences', 'post-inpaint-inferences', 'post-generative-fill-inferences'), there is no indication of this tool's specific role, prerequisites, or exclusions. It fails to help an agent choose appropriately among similar tools.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/pasie15/scenario.com-mcp-server'

If you have feedback or need assistance with the MCP directory API, please join our Discord server