Schema | gemini-media-mcp

gemini-media-mcp

Overview Schema Related Servers Score Discussions

Server Configuration

Describes the environment variables required to run the server.

Name	Required	Description	Default
`GEMINI_API_KEY`	No	Gemini API key. GOOGLE_API_KEY is also accepted.
`GOOGLE_API_KEY`	No	Gemini API key. GEMINI_API_KEY is also accepted.
`MEDIA_OUTPUT_DIR`	No	Directory for saved media files.	~/generated_media
`GOOGLE_CLOUD_PROJECT`	No	GCP project ID for Vertex AI backend.
`GOOGLE_CLOUD_LOCATION`	No	GCP region for Vertex AI.	us-central1

Capabilities

Features and capabilities supported by this server

Capability	Details
`tools`	{ "listChanged": true }
`logging`	{}

Tools

Functions exposed to the LLM to take actions

Name	Description
animate_imageA	Animate a still image into a video clip. Provide the path to a source image and a prompt guiding the animation. This is an async operation — use video_status to poll progress and download_video to retrieve the result.
compose_imagesA	Compose a new image using 1-3 reference images and a text prompt for style/content guidance.
download_videoA	Download a completed video generation to a local file. Only call this after video_status reports the operation is complete.
edit_imageB	Edit an existing image using a text prompt. Provide the path to the source image and a description of the desired changes.
extend_videoA	Extend a previously generated video with a continuation prompt. Requires the operation ID from the original generation. This is an async operation — use video_status to poll progress and download_video to retrieve the result.
generate_audioA	Generate speech audio from a text prompt using Google's Gemini TTS. Supports voice selection and language configuration.
generate_imageB	Generate an image from a text prompt using Google's Gemini image models.
generate_musicA	Generate music from a text prompt using Google's Lyria models. Supports genre, instruments, BPM, key, mood, structure tags like [Verse] [Chorus] [Bridge], and custom lyrics.
generate_videoB	Generate a video from a text prompt using Google's Gemini video models. This is an async operation — use video_status to poll progress and download_video to retrieve the result.
get_configA	Show current server configuration including active backend and output directory.
list_modelsA	List the models supported by this server with their tiers, capabilities, supported resolutions, and pricing guidance.
video_statusA	Check the status of an async video generation operation. Returns progress info (pending, processing, complete, or failed).

Prompts

Interactive templates invoked by user choice

Name	Description
No prompts

Resources

Contextual data attached and managed by the client

Name	Description
No resources

Server Configuration
Capabilities
Tools
Prompts
Resources

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/mordor-forge/gemini-media-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server