gemini-image-mcp
Analyzes images using Google's Gemini vision models, enabling AI agents to understand visual content via text prompts.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@gemini-image-mcpDescribe this UI mockup: /designs/mockup.png"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
gemini-image-mcp
A tiny MCP server that analyzes images with Google's
Gemini vision models. It exposes one tool, analyze_image, that takes a local image path
(or URL) plus an optional prompt and returns Gemini's text answer.
Why: it lets an agent (e.g. Claude Code) read screenshots, diagrams, charts, or UI states by reference — the raw image bytes go to Gemini, and only the text answer comes back, so they never bloat the calling agent's context window.
It talks straight to the Gemini REST API (generativelanguage.googleapis.com) with fetch —
no Google SDK, no gemini-cli, nothing tied to the deprecated consumer CLI.
Setup
npm install
npm run build
cp .env.example .env # then put your key in .envGet an API key at https://aistudio.google.com/apikey.
Configuration
Set via .env (loaded automatically from the repo root) or ambient environment:
Variable | Required | Default | Notes |
| yes | — | Your AI Studio key. Never commit it. |
| no |
| Use |
The key is sent as an x-goog-api-key header (kept out of URLs/logs) and is never written
to a tracked file — .env is gitignored.
Related MCP server: vision-mcp
Use with Claude Code
claude mcp add gemini-image -- node /absolute/path/to/gemini-image-mcp/dist/index.jsThe server loads its own .env, so no key needs to live in Claude's config. Restart Claude
Code, then it can call analyze_image with an image path and an optional prompt.
Tool: analyze_image
Argument | Type | Required | Description |
| string | yes | Absolute path to a local image file, or an |
| string | no | What to ask about the image. Defaults to a detailed description. |
| string | no | Per-call model override. |
Supported inputs: PNG, JPEG, WebP, GIF, BMP, HEIC/HEIF, and PDF.
Smoke test
Verify the key + API + image path end to end, without the MCP layer:
npm run smoke -- ./test/sample.png "What does this image say?"License
MIT
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Tools
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/AmyJeanes/gemini-image-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server