Skip to main content
Glama
tasopen

mcp-alphabanana

mcp-alphabanana

English | 日本語
alphananana on the move
A Model Context Protocol (MCP) server for generating image assets using Google Gemini AI (Gemini 3.1 Flash/Nano Banana 2 supported).

Built with FastMCP 3 for a simplified codebase and flexible output options.

Why alphabanana?

  • Zero Watermarks: API-native clean images.

  • Thinking/Grounding Support: Higher prompt adherence and search-backed accuracy.

  • Production Ready: Supports transparent WebP and exact aspect ratios for web/game assets.

Features

  • Ultra-fast image generation (Gemini 3.1 Flash, 0.5K/1K/2K/4K)

  • Advanced multi-image reasoning (up to 14 reference images)

  • Thinking/Grounding support (Flash3.1 only)

  • Transparent PNG/WebP output (color-key post-processing, despill)

  • Multiple output formats: file, base64, or both

  • Flexible resize modes: crop, stretch, letterbox, contain

  • Multiple model tiers: Flash3.1, Flash2.5, Pro3, legacy aliases

Installation

Add @tasopen/mcp-alphabanana to your MCP Servers configuration.

Configuration

Configure the GEMINI_API_KEY in your MCP configuration (e.g. mcp.json).

Examples:

  • Reference an OS environment variable from mcp.json:

{
  "env": {
    "GEMINI_API_KEY": "${env:GEMINI_API_KEY}"
  }
}
  • Provide the key directly in mcp.json:

{
  "env": {
    "GEMINI_API_KEY": "your_api_key_here"
  }
}

VS Code Integration

Add to your VS Code settings (.vscode/settings.json or user settings), configuring the server env in mcp.json or via the VS Code MCP settings.

{
  "mcp": {
    "servers": {
      "mcp-alphabanana": {
        "command": "npx",
        "args":["-y", "@tasopen/mcp-alphabanana"],
        "env": {
          "GEMINI_API_KEY": "${env:GEMINI_API_KEY}"  // or "your_api_key_here"
        }
      }
    }
  }
}

Optional: Set a custom fallback directory for write failures by adding MCP_FALLBACK_OUTPUT to the env object.

Model Selection & Parameters

Input Model ID

Internal Model ID

Description

Flash3.1

gemini-3.1-flash-image-preview

Ultra-fast, supports Thinking/Grounding.

Flash2.5

gemini-2.5-flash-image

Legacy Flash. High stability. Low cost.

Pro3

gemini-3.0-pro-image-preview

High-fidelity Pro model.

flash

gemini-3.1-flash-image-preview

Alias for backward compatibility.

pro

gemini-3.0-pro-image-preview

Alias for backward compatibility.

Parameters (v2.0)

Parameter

Type

Default

Description

prompt

string

required

Description of the image to generate

model

enum

Flash3.1

Model: Flash3.1, Flash2.5, Pro3, flash, pro

output_resolution

enum

1K

0.5K, 1K, 2K, 4K (0.5K/2K/4K: Flash3.1 only)

output_format

enum

png

png, jpg, webp (WebP: alpha supported)

transparent

boolean

false

Transparent background (PNG/WebP only)

grounding_type

enum

none

none, text, image, both (Flash3.1 only)

thinking_mode

enum

minimal

minimal, high (Flash3.1 only)

include_thoughts

boolean

false

Return model's "thoughts" (Flash3.1 only)

include_metadata

boolean

false

Include grounding/reasoning metadata in JSON output

reference_images

array

[]

Up to 14 (Flash3.1/Pro3), 3 (Flash2.5)


Usage Examples

Basic Generation

{
  "prompt": "A pixel art treasure chest, golden trim, wooden texture",
  "model": "Flash3.1",
  "outputFileName": "chest",
  "outputWidth": 64,
  "outputHeight": 64,
  "transparent": true
}

Advanced (WebP, thinking, grounding)

{
  "prompt": "A photorealistic girl with wings flying over a European countryside",
  "model": "Flash3.1",
  "outputFileName": "girl_wings",
  "outputWidth": 632,
  "outputHeight": 424,
  "output_format": "webp",
  "thinking_mode": "high",
  "grounding_type": "both",
  "include_thoughts": true
}

Grounding Sample (Search-backed)

{
  "prompt": "A modern travel poster featuring today's weather and skyline highlights in Kuala Lumpur",
  "model": "Flash3.1",
  "outputFileName": "kl_travel_poster",
  "outputWidth": 1024,
  "outputHeight": 1024,
  "grounding_type": "text",
  "thinking_mode": "high",
  "include_metadata": true,
  "include_thoughts": true
}

This sample enables Google Search grounding and returns grounding/reasoning metadata in JSON.

With Reference Images

{
  "prompt": "A matching treasure chest, open state, same pixel art style as references",
  "model": "Pro3",
  "outputFileName": "chest_open",
  "outputWidth": 64,
  "outputHeight": 64,
  "transparent": true,
  "reference_images": [
    {
      "description": "Closed chest for style reference",
      "data": "...base64..."
    }
  ]
}

Transparency & Output Formats

  • PNG: Full alpha, color-key + despill

  • WebP: Full alpha, better compression (Flash3.1+)

  • JPEG: No transparency (falls back to solid background)


Development

# Development mode with MCP CLI
npm run dev


# MCP Inspector (Web UI)
npm run inspect

# Build for production
npm run build

License

MIT

-
security - not tested
-
license - not tested
-
quality - not tested

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/tasopen/mcp-alphabanana'

If you have feedback or need assistance with the MCP directory API, please join our Discord server