Gemini Omni MCP
Provides tools to generate and edit videos using Google's Gemini Omni model, supporting text-to-video, image-to-video, reference images, and video editing with batch generation.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Gemini Omni MCPGenerate a 5-second video of a sunset over mountains"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Gemini Omni MCP
FastMCP server for Google's gemini-omni-flash-preview video model. It supports text-to-video, image-to-video, multiple reference images, uploaded-video edits, stateful edits with previous_interaction_id, URI delivery, inline delivery, and batch generation.
Install
uvx gemini-omni-mcpOr from source:
uv sync --all-extras
uv run gemini-omni-mcpRelated MCP server: veo-mcp-server
Configuration
export GEMINI_API_KEY=your_key_here
export OUTPUT_DIR=~/gemini_omni_videosOptional settings include REQUEST_TIMEOUT, FILE_POLL_INTERVAL, FILE_POLL_TIMEOUT, MAX_BATCH_SIZE, DEFAULT_ASPECT_RATIO, DEFAULT_DELIVERY, and DEFAULT_DURATION_SECONDS.
Tools
generate_video
Generates one MP4 and returns JSON with video.path, interaction_id, and metadata.
Important arguments:
prompt: scene, motion, camera, lighting, mood, and audio directiontask:text_to_video,image_to_video,reference_to_video, oreditaspect_ratio:16:9or9:16duration_seconds: optional preview field,3to10reference_image_paths: up to 6 local image pathsinput_video_path: local MP4 to upload and editdelivery:uriorinlineprevious_interaction_id: continue editing a generated video
batch_generate
Runs multiple prompts in conservative parallel batches, capped at 4.
Prompting tips
Use "single continuous shot" and "no scene cuts" for one-scene outputs.
Include audio direction, for example "gentle ambient sound, no dialogue".
For edits, keep the prompt short and add "Keep everything else the same".
Use
<FIRST_FRAME>and<IMAGE_REF_N>tags to bind reference-image roles.Timing cues like
[0-3s],[3-6s], and[6-10s]work well.
Development
uv sync --all-extras
uv run ruff format .
uv run ruff check .
uv run mypy gemini_omni_mcp/
uv run pytest
uv buildThis server cannot be installed
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
- Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)By Om-Shree-0709 on .Agentic AiPrompt InjectionWebAssembly
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/nikships/gemini-omni-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server