Which integrations are available for this server?

Enables AI image generation using Google's Gemini 2.5 Flash and Gemini 3 Pro models, supporting up to 4K resolution output, flexible aspect ratios, Google Search grounding for factual accuracy, and natural language image editing capabilities.

How do I use Banana Image MCP?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@Banana Image MCP create a 4K professional logo for a coffee shop with a minimalist cat design" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

Banana Image MCP by zengwenliang416

About The Project

Banana Image MCP is a production-ready MCP (Model Context Protocol) server that enables Claude and other AI assistants to generate high-quality images using Google's latest Gemini image models.

Simply describe what you want, and Claude will create it — from quick concept sketches to stunning 4K professional artwork.

Why Banana Image MCP?

Zero Setup Complexity — Just add your API key and start generating
Production Ready — Built with FastMCP framework, fully tested, CI/CD enabled
Best Quality — Leverages Gemini's most advanced image models with 4K support
Smart Defaults — Intelligent model selection based on your prompts
Real-World Knowledge — Google Search grounding for accurate, factual images

Built With

Features

4K Ultra HD Output

Generate images up to 3840px with the Pro model. Perfect for professional work, marketing materials, and print-ready assets.

Dual Model Support

Flash: 2-3s, up to 1024px — for quick iterations
Pro: 5-8s, up to 4K — for final deliverables

Smart Model Selection

The server automatically picks the best model based on your prompt. Say "quick sketch" for Flash, or "4K professional" for Pro.

Google Search Grounding

Pro model uses real-world knowledge from Google Search to generate more accurate and factual images.

Flexible Aspect Ratios

Support for all common ratios: 1:1, 16:9, 9:16, 4:3, 3:2, 21:9 and more.

Natural Language Editing

Edit existing images with simple text commands like "make the sky more dramatic" or "remove the background".

Quick Start

Get up and running in under 2 minutes.

Prerequisites

Get a free Gemini API key from Google AI Studio
Have Claude Desktop installed

Installation

Add to your Claude Desktop config file:

{ "mcpServers": { "banana-image": { "command": "uvx", "args": ["banana-image-mcp"], "env": { "GEMINI_API_KEY": "your-api-key-here" } } } }

Platform	Path
macOS	`~/Library/Application Support/Claude/claude_desktop_config.json`
Windows	`%APPDATA%\Claude\claude_desktop_config.json`
Linux	`~/.config/Claude/claude_desktop_config.json`

When using uvx, packages are cached locally. To get the latest version:

# Clear the cache for this package uv cache clean banana-image-mcp # Then restart Claude Desktop

Or specify a version explicitly in your config:

"args": ["banana-image-mcp==1.0.1"]

The configuration is the same for other MCP-compatible clients. Just add the server config to your client's MCP configuration file.

Usage

Just ask Claude to generate images naturally:

"Generate a cute cat wearing a space suit" "Create a professional product photo of a coffee cup, 4K quality" "Make a 16:9 YouTube thumbnail about cooking" "Edit this image: make the sky more dramatic"

Model Comparison

Model	Speed	Max Resolution	Best For
Gemini 2.5 Flash	2-3s	1024px	Quick drafts, iterations, prototypes
Gemini 3 Pro	5-8s	4K (3840px)	Final assets, marketing, professional work

The server defaults to Pro model for best quality. Control it with keywords:

Say this...	Model Used
"quick sketch", "draft", "prototype"	Flash
"4K", "professional", "high quality"	Pro
(default)	Pro

Parameters Reference

Parameter	Type	Default	Description
`prompt`	string	required	Image description
`model_tier`	string	`"pro"`	`"flash"`, `"pro"`, or `"auto"`
`resolution`	string	`"4k"`	`"1k"`, `"2k"`, `"4k"`, `"high"`
`aspect_ratio`	string	-	`"1:1"`, `"16:9"`, `"9:16"`, `"4:3"`, `"21:9"`, etc.
`thinking_level`	string	`"high"`	`"low"` or `"high"` (Pro only)
`enable_grounding`	bool	`true`	Enable Google Search grounding (Pro only)
`n`	int	`1`	Number of images (1-4)
`negative_prompt`	string	-	What to avoid in the image

Environment Variables

Variable	Required	Default	Description
`GEMINI_API_KEY`	Yes	-	Your Gemini API key
`IMAGE_OUTPUT_DIR`	No	`~/banana-images`	Where to save generated images

What You Can Create

Roadmap

4K resolution output (up to 3840px)
Dual model support (Flash + Pro)
Google Search grounding
Flexible aspect ratios
Natural language image editing
GitHub Actions CI/CD
Batch image generation
Image-to-image transformation
Video generation support
Local model support (Ollama)

See the open issues for a full list of proposed features and known issues.

Development

# Clone the repository git clone https://github.com/zengwenliang416/banana-image-mcp.git cd banana-image-mcp # Install dependencies uv sync # Run in development mode fastmcp dev banana_image_mcp.server:create_app # Run tests pytest # Lint and format ruff check . ruff format .