Question 1

What can you do with this server?

Accepted Answer

This server lets an AI agent (e.g., Claude Code) analyze images using Google's Gemini vision models and receive text-based answers without loading raw image data into the agent's context window.

* Analyze a single image: Provide a local file path or HTTP(S) URL and receive a detailed text description or answer.
* Compare multiple images: Pass an array of images (e.g., before/after screenshots) to reason about differences or similarities in a single request.
* Ask custom questions: Supply an optional prompt to guide the analysis (e.g., "What does this chart show?"), or let it default to a general description/comparison.
* Override the Gemini model per request: Choose a specific model (e.g., gemini-pro-latest) for more demanding visual reasoning tasks.
* Wide format support: Works with PNG, JPEG, WebP, GIF, BMP, HEIC/HEIF, and PDF files.
* Context efficiency: Processes raw image bytes server-side and returns only the textual analysis, preventing context window bloat.
* Safe key handling: API key is read from a local .env file and not exposed to the calling agent.

Typical use cases include interpreting screenshots, UI states, diagrams, charts, and performing visual comparisons.

Question 2

Which integrations are available for this server?

Accepted Answer

Analyzes images using Google's Gemini vision models, enabling AI agents to understand visual content via text prompts.

Question 3

How do I use gemini-image-mcp?

Accepted Answer

1. Click on "Install Server".
2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@gemini-image-mcp Describe this UI mockup: /designs/mockup.png"

That's it! The server will respond to your query, and you can continue using it as needed.

Here is a step-by-step guide with screenshots.

Variable	Required	Default	Notes
`GEMINI_API_KEY`	yes	—	Your AI Studio key. Never commit it.
`GEMINI_MODEL`	no	`gemini-flash-latest`	Use `gemini-pro-latest` for harder visual reasoning.

Argument	Type	Required	Description
`image`	string \| string[]	yes	A single local file path or `http(s)` URL, or an array of them to compare/reason about together.
`prompt`	string	no	What to ask. Defaults to a detailed description (one image) or a comparison (several).
`model`	string	no	Per-call model override.

gemini-image-mcp

gemini-image-mcp

Setup

Configuration

Use with Claude Code

Tool: `analyze_image`

Smoke test

License

Maintenance

Resources

Looking for Admin?

Tools

Related MCP Servers

MCP Read Images

vision-mcp

VisionPower

mcp-see

Related MCP Connectors

Latest Blog Posts

MCP directory API

gemini-image-mcp

Setup

Configuration

Use with Claude Code

Tool: analyze_image

Smoke test

License

Maintenance

Resources

Looking for Admin?

Tools

Related MCP Servers

MCP Read Images

vision-mcp

VisionPower

mcp-see

Related MCP Connectors

Latest Blog Posts

MCP directory API

Tool: `analyze_image`