Gemini MCP Image Generation Server

by sanxfxteam

Integrations

  • Enables configuration of the server through environment variables loaded from a .env file, specifically for storing the Google API key required for authentication.

  • Provides image generation capabilities using Google's Gemini 2 API, allowing users to generate images based on text prompts with customizable parameters such as number of samples and aspect ratio.

  • Used for package management, installation of dependencies, and running the server through npm commands.

Gemini Image Generation MCP Server

This is a Model Context Protocol (MCP) server that provides image generation capabilities using Google's Gemini 2 API.

Quick Start

  1. Get Gemini API Key
  2. Configure Claude Desktop
    • Locate your config file:
      Mac: ~/Library/Application Support/Claude/claude_desktop_config.json Windows: %APPDATA%\Claude\claude_desktop_config.json Linux: ~/.config/Claude/claude_desktop_config.json
    • Add Gemini configuration:
      { "mcpServers": { "gemini-imagen": { "command": "npx", "args": ["-y", "github:sanxfxteam/gemini-mcp-server"], "env": { "GEMINI_API_KEY": "your_api_key_here" } } } }
  3. Restart Claude Desktop

Setup

  1. Install dependencies:
npm install
  1. Set up your environment variables: Create a .env file in the root directory and add your Google API key:
GEMINI_API_KEY=your_api_key_here

Usage

Run the server:

npm start

To test

npx @modelcontextprotocol/inspector npm run start

Available Tools

generateImage

Generates images using Gemini 2's experimental image generation API.

Parameters:

  • prompt (string, required): The description of the image you want to generate
  • numSamples (number, optional, default: 4): Number of images to generate
  • aspectRatio (string, optional, default: '1:1'): Aspect ratio of the generated images
  • personGeneration (string, optional, default: 'ALLOW_ADULT'): Person generation settings

Example MCP request:

{ "tool": "generateImage", "params": { "prompt": "A serene mountain landscape at sunset", "numSamples": 2, "aspectRatio": "16:9" } }

Notes

  • This server uses the experimental image generation feature of Gemini 2
  • Make sure you have appropriate access and API keys from Google
  • The server communicates using the Model Context Protocol over stdio

You must be authenticated.

A
security – no known vulnerabilities
A
license - permissive license
A
quality - confirmed to work

remote-capable server

The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.

A Model Context Protocol server that provides image generation capabilities using Google's Gemini 2 API, allowing users to generate multiple images with customizable parameters like prompts, aspect ratios, and person generation settings.

  1. Quick Start
    1. Setup
      1. Usage
        1. Available Tools
      2. Notes

        Related MCP Servers

        • A
          security
          F
          license
          A
          quality
          A Model Context Protocol server that provides image generation capabilities using the Ideogram API, allowing users to create images from text prompts with customizable parameters.
          Last updated -
          1
          1
          3
          JavaScript
        • A
          security
          F
          license
          A
          quality
          A Model Context Protocol server that provides an image generation tool using Templated.io, allowing users to create customized images based on templates with text and image layers.
          Last updated -
          TypeScript
          • Apple
        • A
          security
          A
          license
          A
          quality
          A Model Context Protocol server that enables generation of high-quality images using the Flux.1 Schnell model via Together AI, allowing users to create images from text prompts with customizable dimensions.
          Last updated -
          1
          8
          Python
          MIT License
          • Apple
        • -
          security
          A
          license
          -
          quality
          A server that provides AI-powered image generation, modification, and processing capabilities through the Model Context Protocol, leveraging Google Gemini models and other image services.
          Last updated -
          6
          Python
          MIT License
          • Linux
          • Apple

        View all related MCP servers

        ID: l17qmg4mfs