gemini-media-mcp
Server Configuration
Describes the environment variables required to run the server.
| Name | Required | Description | Default |
|---|---|---|---|
| GEMINI_API_KEY | No | Gemini API key. GOOGLE_API_KEY is also accepted. | |
| GOOGLE_API_KEY | No | Gemini API key. GEMINI_API_KEY is also accepted. | |
| MEDIA_OUTPUT_DIR | No | Directory for saved media files. | ~/generated_media |
| GOOGLE_CLOUD_PROJECT | No | GCP project ID for Vertex AI backend. | |
| GOOGLE_CLOUD_LOCATION | No | GCP region for Vertex AI. | us-central1 |
Capabilities
Features and capabilities supported by this server
| Capability | Details |
|---|---|
| tools | {
"listChanged": true
} |
| logging | {} |
Tools
Functions exposed to the LLM to take actions
| Name | Description |
|---|---|
| animate_imageA | Animate a still image into a video clip. Provide the path to a source image and a prompt guiding the animation. This is an async operation — use video_status to poll progress and download_video to retrieve the result. |
| compose_imagesB | Compose a new image using 1-3 reference images and a text prompt for style/content guidance. |
| download_videoA | Download a completed video generation to a local file. Only call this after video_status reports the operation is complete. |
| edit_imageB | Edit an existing image using a text prompt. Provide the path to the source image and a description of the desired changes. |
| extend_videoA | Extend a previously generated video with a continuation prompt. Requires the operation ID from the original generation. This is an async operation — use video_status to poll progress and download_video to retrieve the result. |
| generate_audioB | Generate speech audio from a text prompt using Google's Gemini TTS. Supports voice selection and language configuration. |
| generate_imageB | Generate an image from a text prompt using Google's Gemini image models. |
| generate_musicC | Generate music from a text prompt using Google's Lyria models. Supports genre, instruments, BPM, key, mood, structure tags like [Verse] [Chorus] [Bridge], and custom lyrics. |
| generate_videoA | Generate a video from a text prompt using Google's Gemini video models. This is an async operation — use video_status to poll progress and download_video to retrieve the result. |
| get_configB | Show current server configuration including active backend and output directory. |
| list_modelsA | List the models supported by this server with their tiers, capabilities, supported resolutions, and pricing guidance. |
| video_statusA | Check the status of an async video generation operation. Returns progress info (pending, processing, complete, or failed). |
Prompts
Interactive templates invoked by user choice
| Name | Description |
|---|---|
No prompts | |
Resources
Contextual data attached and managed by the client
| Name | Description |
|---|---|
No resources | |
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/mordor-forge/gemini-media-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server