Enables AI image generation using Google's Gemini 2.5 Flash and Gemini 3 Pro models, supporting up to 4K resolution output, flexible aspect ratios, Google Search grounding for factual accuracy, and natural language image editing capabilities.
About The Project
Banana Image MCP is a production-ready MCP (Model Context Protocol) server that enables Claude and other AI assistants to generate high-quality images using Google's latest Gemini image models.
Simply describe what you want, and Claude will create it — from quick concept sketches to stunning 4K professional artwork.
Why Banana Image MCP?
Zero Setup Complexity — Just add your API key and start generating
Production Ready — Built with FastMCP framework, fully tested, CI/CD enabled
Best Quality — Leverages Gemini's most advanced image models with 4K support
Smart Defaults — Intelligent model selection based on your prompts
Real-World Knowledge — Google Search grounding for accurate, factual images
Built With
Features
4K Ultra HD Output
Generate images up to 3840px with the Pro model. Perfect for professional work, marketing materials, and print-ready assets.
Dual Model Support
Flash: 2-3s, up to 1024px — for quick iterations
Pro: 5-8s, up to 4K — for final deliverables
Smart Model Selection
The server automatically picks the best model based on your prompt. Say "quick sketch" for Flash, or "4K professional" for Pro.
Google Search Grounding
Pro model uses real-world knowledge from Google Search to generate more accurate and factual images.
Flexible Aspect Ratios
Support for all common ratios: 1:1, 16:9, 9:16, 4:3, 3:2, 21:9 and more.
Natural Language Editing
Edit existing images with simple text commands like "make the sky more dramatic" or "remove the background".
Quick Start
Get up and running in under 2 minutes.
Prerequisites
Get a free Gemini API key from Google AI Studio
Have Claude Desktop installed
Installation
Add to your Claude Desktop config file:
Platform | Path |
macOS |
|
Windows |
|
Linux |
|
When using uvx, packages are cached locally. To get the latest version:
Or specify a version explicitly in your config:
The configuration is the same for other MCP-compatible clients. Just add the server config to your client's MCP configuration file.
Usage
Just ask Claude to generate images naturally:
Model Comparison
Model | Speed | Max Resolution | Best For |
Gemini 2.5 Flash | 2-3s | 1024px | Quick drafts, iterations, prototypes |
Gemini 3 Pro | 5-8s | 4K (3840px) | Final assets, marketing, professional work |
The server defaults to Pro model for best quality. Control it with keywords:
Say this... | Model Used |
"quick sketch", "draft", "prototype" | Flash |
"4K", "professional", "high quality" | Pro |
(default) | Pro |
Parameters Reference
Parameter | Type | Default | Description |
| string | required | Image description |
| string |
|
,
, or
|
| string |
|
,
,
,
|
| string | - |
,
,
,
,
, etc. |
| string |
|
or
(Pro only) |
| bool |
| Enable Google Search grounding (Pro only) |
| int |
| Number of images (1-4) |
| string | - | What to avoid in the image |
Environment Variables
Variable | Required | Default | Description |
| Yes | - | Your Gemini API key |
| No |
| Where to save generated images |
What You Can Create
Roadmap
4K resolution output (up to 3840px)
Dual model support (Flash + Pro)
Google Search grounding
Flexible aspect ratios
Natural language image editing
GitHub Actions CI/CD
Batch image generation
Image-to-image transformation
Video generation support
Local model support (Ollama)
See the open issues for a full list of proposed features and known issues.
Development
Contributing
Contributions make the open source community amazing. Any contributions you make are greatly appreciated.
Fork the Project
Create your Feature Branch (
git checkout -b feature/AmazingFeature)Commit your Changes (
git commit -m 'Add some AmazingFeature')Push to the Branch (
git push origin feature/AmazingFeature)Open a Pull Request
License
Distributed under the MIT License. See LICENSE for more information.
Contact
Wenliang Zeng - @zengwenliang416
Project Link: https://github.com/zengwenliang416/banana-image-mcp
Acknowledgments
FastMCP - The MCP framework that powers this server
Google Gemini - The AI models behind image generation
Anthropic MCP - The protocol specification
Best-README-Template - README inspiration