Schema | Gemini MCP Server

Gemini MCP Server

Overview Schema Related Servers Score Discussions

Server Configuration

Describes the environment variables required to run the server.

Name	Required	Description	Default
`DEBUG_MCP`	No	Log to stderr for debugging tool calls (default: false)	false
`GEMINI_AGY_BIN`	No	Path/name of the agy executable for agent tools (default: agy)	agy
`GEMINI_API_KEY`	Yes	Google AI API key from AI Studio
`GEMINI_AGY_MODEL`	No	Default agent model label (e.g. Gemini 3.1 Pro (High))
`GEMINI_MCP_LOG_FILE`	No	Write logs to ~/.gemini-mcp/logs/ (default: false)	false
`GEMINI_DEFAULT_MODEL`	No	Default model for gemini_chat and analyze_image (default: gemini-3.1-pro-preview)	gemini-3.1-pro-preview
`GEMINI_AGY_AUTO_APPROVE`	No	Auto-approve agent tool calls (default: true)	true
`GEMINI_IMAGE_OUTPUT_DIR`	No	Auto-save directory for generated images and videos
`GEMINI_DEFAULT_GROUNDING`	No	Enable Google Search grounding by default (default: true)	true
`GEMINI_ALLOW_EXPERIMENTAL`	No	Include experimental/preview models in auto-discovery (default: false)	false
`GEMINI_AGY_TIMEOUT_SECONDS`	No	Default hard timeout per agent run (default: 600)	600
`GEMINI_DEEP_RESEARCH_AGENT`	No	Deep Research agent for gemini_deep_research (default: deep-research-pro-preview-12-2025)	deep-research-pro-preview-12-2025

Capabilities

Features and capabilities supported by this server

Capability	Details
`tools`	{ "listChanged": true }
`resources`	{ "listChanged": true }

Tools

Functions exposed to the LLM to take actions

Name	Description
gemini_helpA	Get comprehensive help about Gemini MCP features, settings, and best practices
gemini_prompt_assistantA	Get expert prompt templates and guidance for Gemini image generation. Covers photography (portraits, products, cinematic), chart/diagram design (9 professional design systems including FT, Bloomberg, Tufte, Du Bois), lighting, colour grading, lens simulation, and style aesthetics. For charts: use chart_design with a color_scheme to get a full professional design system prompt.
gemini_chatC	Chat with Google Gemini models
gemini_list_modelsA	List available Gemini models and their descriptions
gemini_deep_researchA	Conduct real Deep Research on complex topics: Google's autonomous Deep Research agent does multi-step live web search and returns a synthesised, cited report (takes several minutes). Runs the actual Deep Research agents via the Gemini Interactions API. [MCP_RECOMMENDED_TIMEOUT_MS: 900000]
describe_imageA	Analyze and describe one or more images using Google Gemini image models (Nano Banana Pro). Returns a text description — no image is generated. Default model: gemini-3-flash-preview.
analyze_imageB	Analyze and extract information from one or more images using Gemini multimodal understanding. Returns a text analysis — no image is generated. Default model: gemini-3-pro-preview.
load_image_from_pathA	Read a local image file and return it as base64-encoded data ready to pass to generate_image, edit_image, describe_image, or analyze_image tools. Supports JPEG, PNG, GIF, WebP, BMP.
generate_imageB	Generate an image using Google Gemini image models (Nano Banana Pro). Returns image with inline preview in Claude Desktop and saves full-resolution to disk. Default model: gemini-3-pro-image-preview.
edit_imageA	Edit one or more images using Google Gemini image models (Nano Banana Pro). Provide images and natural-language instructions for how to modify them. Returns edited image with inline preview and saves full-resolution to disk.
generate_landing_pageA	Generate a complete, self-contained HTML landing page using Gemini. Returns inline preview with responsive viewport controls. No external dependencies; inline CSS and vanilla JS only.
generate_svgA	Generate scalable vector graphics (SVG) using Gemini. Creates clean, production-ready SVG code for diagrams, illustrations, icons, and data visualizations. Returns inline preview with SVG viewer.
generate_videoA	Generate videos using Google Veo 3.1 AI model. Creates realistic 4-8 second videos from text prompts with optional first-frame image and reference images for character/style consistency. Supports native audio generation. Processing time: 2-5 minutes for 1080p videos. Returns video file path with optional thumbnail and HTML preview player. ⚠️ IMPORTANT: Video generation is ASYNC and takes 2-5 minutes. The tool will poll for completion automatically.
gemini_agentA	Delegate a task to an autonomous Gemini agent (via the `agy` CLI). Unlike gemini_chat (a single text response), the agent can read and edit files and run shell commands inside a working directory to actually carry out multi-step work, then returns its final report. Use it to offload self-contained coding/research/automation tasks. Returns a conversation_id you can pass back to continue iterating with the same agent. Pick a "Flash" model for speed, a "Pro" model for harder reasoning (see gemini_agent_models).
gemini_agent_modelsA	List the model labels available to the Gemini agent (gemini_agent). These come from `agy models` and differ from the API models in gemini_list_models — use the exact label as the agent's "model".
ocrA	Extract text verbatim from images or PDFs using Gemini multimodal OCR. Returns the raw text (as Markdown for structure) — no summarising or analysis. For documents/PDFs, MEDIUM resolution gives the same OCR quality at half the token cost.
generate_summaryA	Summarise text or a local file with Gemini. Provide `text`, or `file_path` to almost any file: text/code, PDF, images, audio or video (Office docs must be exported to PDF first). Choose the length and optionally a focus or output language.
transcribeB	Transcribe speech from an audio or video file to text using Gemini. Optional timestamps, speaker labels (diarization) and language hint. Provide a local file_path (loaded server-side) or inline base64 data.
extract_structured_dataA	Extract structured JSON from text or almost any local file (text/code, PDF, image, audio, video) using Gemini JSON mode. Describe the fields you want in `instructions`, optionally constrain the output with a `json_schema` (Gemini/OpenAPI subset). Returns parsed JSON.

Prompts

Interactive templates invoked by user choice

Name	Description
No prompts

Resources

Contextual data attached and managed by the client

Name	Description
`gemini-video-viewer`
`gemini-image-viewer`
`gemini-svg-viewer`
`gemini-landing-page-viewer`

Server Configuration
Capabilities
Tools
Prompts
Resources

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Raindancer118/gemini-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server