Gemini MCP Server
Server Configuration
Describes the environment variables required to run the server.
| Name | Required | Description | Default |
|---|---|---|---|
| DEBUG_MCP | No | Log to stderr for debugging tool calls (default: false) | false |
| GEMINI_AGY_BIN | No | Path/name of the agy executable for agent tools (default: agy) | agy |
| GEMINI_API_KEY | Yes | Google AI API key from AI Studio | |
| GEMINI_AGY_MODEL | No | Default agent model label (e.g. Gemini 3.1 Pro (High)) | |
| GEMINI_MCP_LOG_FILE | No | Write logs to ~/.gemini-mcp/logs/ (default: false) | false |
| GEMINI_DEFAULT_MODEL | No | Default model for gemini_chat and analyze_image (default: gemini-3.1-pro-preview) | gemini-3.1-pro-preview |
| GEMINI_AGY_AUTO_APPROVE | No | Auto-approve agent tool calls (default: true) | true |
| GEMINI_IMAGE_OUTPUT_DIR | No | Auto-save directory for generated images and videos | |
| GEMINI_DEFAULT_GROUNDING | No | Enable Google Search grounding by default (default: true) | true |
| GEMINI_ALLOW_EXPERIMENTAL | No | Include experimental/preview models in auto-discovery (default: false) | false |
| GEMINI_AGY_TIMEOUT_SECONDS | No | Default hard timeout per agent run (default: 600) | 600 |
| GEMINI_DEEP_RESEARCH_AGENT | No | Deep Research agent for gemini_deep_research (default: deep-research-pro-preview-12-2025) | deep-research-pro-preview-12-2025 |
Capabilities
Features and capabilities supported by this server
| Capability | Details |
|---|---|
| tools | {
"listChanged": true
} |
| resources | {
"listChanged": true
} |
Tools
Functions exposed to the LLM to take actions
| Name | Description |
|---|---|
| gemini_helpA | Get comprehensive help about Gemini MCP features, settings, and best practices |
| gemini_prompt_assistantA | Get expert prompt templates and guidance for Gemini image generation. Covers photography (portraits, products, cinematic), chart/diagram design (9 professional design systems including FT, Bloomberg, Tufte, Du Bois), lighting, colour grading, lens simulation, and style aesthetics. For charts: use chart_design with a color_scheme to get a full professional design system prompt. |
| gemini_chatC | Chat with Google Gemini models |
| gemini_list_modelsA | List available Gemini models and their descriptions |
| gemini_deep_researchA | Conduct real Deep Research on complex topics: Google's autonomous Deep Research agent does multi-step live web search and returns a synthesised, cited report (takes several minutes). Runs the actual Deep Research agents via the Gemini Interactions API. [MCP_RECOMMENDED_TIMEOUT_MS: 900000] |
| describe_imageA | Analyze and describe one or more images using Google Gemini image models (Nano Banana Pro). Returns a text description — no image is generated. Default model: gemini-3-flash-preview. |
| analyze_imageB | Analyze and extract information from one or more images using Gemini multimodal understanding. Returns a text analysis — no image is generated. Default model: gemini-3-pro-preview. |
| load_image_from_pathA | Read a local image file and return it as base64-encoded data ready to pass to generate_image, edit_image, describe_image, or analyze_image tools. Supports JPEG, PNG, GIF, WebP, BMP. |
| generate_imageB | Generate an image using Google Gemini image models (Nano Banana Pro). Returns image with inline preview in Claude Desktop and saves full-resolution to disk. Default model: gemini-3-pro-image-preview. |
| edit_imageA | Edit one or more images using Google Gemini image models (Nano Banana Pro). Provide images and natural-language instructions for how to modify them. Returns edited image with inline preview and saves full-resolution to disk. |
| generate_landing_pageA | Generate a complete, self-contained HTML landing page using Gemini. Returns inline preview with responsive viewport controls. No external dependencies; inline CSS and vanilla JS only. |
| generate_svgA | Generate scalable vector graphics (SVG) using Gemini. Creates clean, production-ready SVG code for diagrams, illustrations, icons, and data visualizations. Returns inline preview with SVG viewer. |
| generate_videoA | Generate videos using Google Veo 3.1 AI model. Creates realistic 4-8 second videos from text prompts with optional first-frame image and reference images for character/style consistency. Supports native audio generation. Processing time: 2-5 minutes for 1080p videos. Returns video file path with optional thumbnail and HTML preview player. ⚠️ IMPORTANT: Video generation is ASYNC and takes 2-5 minutes. The tool will poll for completion automatically. |
| gemini_agentA | Delegate a task to an autonomous Gemini agent (via the |
| gemini_agent_modelsA | List the model labels available to the Gemini agent (gemini_agent). These come from |
| ocrA | Extract text verbatim from images or PDFs using Gemini multimodal OCR. Returns the raw text (as Markdown for structure) — no summarising or analysis. For documents/PDFs, MEDIUM resolution gives the same OCR quality at half the token cost. |
| generate_summaryA | Summarise text or a local file with Gemini. Provide |
| transcribeB | Transcribe speech from an audio or video file to text using Gemini. Optional timestamps, speaker labels (diarization) and language hint. Provide a local file_path (loaded server-side) or inline base64 data. |
| extract_structured_dataA | Extract structured JSON from text or almost any local file (text/code, PDF, image, audio, video) using Gemini JSON mode. Describe the fields you want in |
Prompts
Interactive templates invoked by user choice
| Name | Description |
|---|---|
No prompts | |
Resources
Contextual data attached and managed by the client
| Name | Description |
|---|---|
| gemini-video-viewer | |
| gemini-image-viewer | |
| gemini-svg-viewer | |
| gemini-landing-page-viewer |
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/Raindancer118/gemini-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server