🚀 Quick Start
📋 Prerequisites
🔑 Environment Variables
💻 Example Usage with NPX
🔌 Integration with MCP Clients
🛠️ Setting Up in an MCP Client
Example Configurations for Different Operating Systems
Note: For Windows paths, use double backslashes (
\\
) to escape the backslash character in JSON. For Linux/macOS, use forward slashes (/
).
✨ Features
💡 Enhanced Capabilities
🔄 How It Works
📁 Output Directory Behavior
Installation & Usage
NPM Package
This package is available on npm: @cloudwerxlab/gpt-image-1-mcp
You can install it globally:
Or run it directly with npx as shown in the Quick Start section.
Tool: create_image
Generates a new image based on a text prompt.
Parameters
Parameter | Type | Required | Description |
---|---|---|---|
prompt | string | Yes | The text description of the image to generate (max 32,000 chars) |
size | string | No | Image size: "1024x1024" (default), "1536x1024", or "1024x1536" |
quality | string | No | Image quality: "high" (default), "medium", or "low" |
n | integer | No | Number of images to generate (1-10, default: 1) |
background | string | No | Background style: "transparent", "opaque", or "auto" (default) |
output_format | string | No | Output format: "png" (default), "jpeg", or "webp" |
output_compression | integer | No | Compression level (0-100, default: 0) |
user | string | No | User identifier for OpenAI usage tracking |
moderation | string | No | Moderation level: "low" or "auto" (default) |
Example
Response
The tool returns:
- A formatted text message with details about the generated image(s)
- The image(s) as base64-encoded data
- Metadata including token usage and file paths
Tool: create_image_edit
Edits an existing image based on a text prompt and optional mask.
Parameters
Parameter | Type | Required | Description |
---|---|---|---|
image | string, object, or array | Yes | The image(s) to edit (base64 string or file path object) |
prompt | string | Yes | The text description of the desired edit (max 32,000 chars) |
mask | string or object | No | The mask that defines areas to edit (base64 string or file path object) |
size | string | No | Image size: "1024x1024" (default), "1536x1024", or "1024x1536" |
quality | string | No | Image quality: "high" (default), "medium", or "low" |
n | integer | No | Number of images to generate (1-10, default: 1) |
background | string | No | Background style: "transparent", "opaque", or "auto" (default) |
user | string | No | User identifier for OpenAI usage tracking |
Example with Base64 Encoded Image
Example with File Path
Response
The tool returns:
- A formatted text message with details about the edited image(s)
- The edited image(s) as base64-encoded data
- Metadata including token usage and file paths
🔧 Troubleshooting
🚨 Common Issues
🔍 Error Handling and Reporting
The MCP server includes comprehensive error handling that provides detailed information when something goes wrong. When an error occurs:
- Error Format: All errors are returned with:
- A clear error message describing what went wrong
- The specific error code or type
- Additional context about the error when available
- AI Assistant Behavior: When using this MCP server with AI assistants:
- The AI will always report the full error message to help with troubleshooting
- The AI will explain the likely cause of the error in plain language
- The AI will suggest specific steps to resolve the issue
📄 License
🙏 Acknowledgments
hybrid server
The server is able to function both locally and remotely, depending on the configuration or use case.
Un servidor de protocolo de contexto de modelo que permite generar y editar imágenes utilizando el modelo gpt-image-1 de OpenAI, lo que permite a los asistentes de IA crear y modificar imágenes a partir de indicaciones de texto.
- 📋 Requisitos previos
- 🔑 Variables de entorno
- 💻 Ejemplo de uso con NPX
- 🔌 Integración con clientes MCP
- ✨ Características
- 🔄 Cómo funciona
- Instalación y uso
- 🔧 Solución de problemas
- 📄 Licencia
- 🙏 Agradecimientos
Related Resources
Related MCP Servers
- AsecurityFlicenseAqualityA Model Context Protocol server that provides image generation capabilities using the Ideogram API, allowing users to create images from text prompts with customizable parameters.Last updated -174TypeScript
- -securityAlicense-qualityA server that provides AI-powered image generation, modification, and processing capabilities through the Model Context Protocol, leveraging Google Gemini models and other image services.Last updated -12PythonMIT License
- -securityAlicense-qualityA Model Context Protocol server enabling AI assistants to generate images through OpenAI's DALL-E API with full support for all available options and fine-grained control.Last updated -41JavaScriptMIT License
- -securityAlicense-qualityProvides tools for generating and editing images using OpenAI's gpt-image-1 model via an MCP interface, enabling AI assistants to create and modify images based on text prompts.Last updated -16PythonApache 2.0