The MCP Video Generation with Veo2 server allows users to generate and manage videos and images using Google's Veo2 and Imagen models. You can:
- Generate videos from text prompts with configurable options like aspect ratio, duration, and person generation settings 
- Generate videos from images (provided as base64 data, URI, or ID), optionally guided by text prompts 
- Generate images from text prompts using Google Imagen, specifying the number of images desired 
- Generate video from generated image in a single step, using separate prompts for image and video creation 
- List generated videos and images that have been previously created 
- Retrieve specific generated images by their unique ID 
Integrates with Google's Veo2 video generation capabilities, allowing generation of videos from text prompts or images with various configuration options such as aspect ratio, duration, and person generation settings.
MCP Video Generation with Veo2
This project implements a Model Context Protocol (MCP) server that exposes Google's Veo2 video generation capabilities. It allows clients to generate videos from text prompts or images, and access the generated videos through MCP resources.
Features
- Generate videos from text prompts 
- Generate videos from images 
- Access generated videos through MCP resources 
- Example video generation templates 
- Support for both stdio and SSE transports 
Example Images
Example Image to Video
Image to Video - from Grok generated puppy
Image to Video - from real cat
Prerequisites
- Node.js 18 or higher 
- Google API key with access to Gemini API and Veo2 model (= You need to set up a credit card with your API key! -> Go to aistudio.google.com ) 
Installation
Installing in FLUJO
- Click Add Server 
- Copy & Paste Github URL into FLUJO 
- Click Parse, Clone, Install, Build and Save. 
Installing via Smithery
To install mcp-video-generation-veo2 for Claude Desktop automatically via Smithery:
Manual Installation
- Clone the repository: git clone https://github.com/yourusername/mcp-video-generation-veo2.git cd mcp-video-generation-veo2
- Install dependencies: npm install
- Create a - .envfile with your Google API key:cp .env.example .env # Edit .env and add your Google API key- The - .envfile supports the following variables:- GOOGLE_API_KEY: Your Google API key (required)
- PORT: Server port (default: 3000)
- STORAGE_DIR: Directory for storing generated videos (default: ./generated-videos)
- LOG_LEVEL: Logging level (default: fatal)- Available levels: verbose, debug, info, warn, error, fatal, none 
- For development, set to - debugor- infofor more detailed logs
- For production, keep as - fatalto minimize console output
 
 
- Build the project: npm run build
Usage
Starting the Server
You can start the server with either stdio or SSE transport:
stdio Transport (Default)
SSE Transport
This will start the server on port 3000 (or the port specified in your .env file).
MCP Tools
The server exposes the following MCP tools:
generateVideoFromText
Generates a video from a text prompt.
Parameters:
- prompt(string): The text prompt for video generation
- config(object, optional): Configuration options- aspectRatio(string, optional): "16:9" or "9:16"
- personGeneration(string, optional): "dont_allow" or "allow_adult"
- numberOfVideos(number, optional): 1 or 2
- durationSeconds(number, optional): Between 5 and 8
- enhancePrompt(boolean, optional): Whether to enhance the prompt
- negativePrompt(string, optional): Text describing what not to generate
 
Example:
generateVideoFromImage
Generates a video from an image.
Parameters:
- image(string): Base64-encoded image data
- prompt(string, optional): Text prompt to guide the video generation
- config(object, optional): Configuration options (same as above, but personGeneration only supports "dont_allow")
listGeneratedVideos
Lists all generated videos.
MCP Resources
The server exposes the following MCP resources:
videos://{id}
Access a generated video by its ID.
videos://templates
Access example video generation templates.
Development
Project Structure
- src/: Source code- index.ts: Main entry point
- server.ts: MCP server configuration
- config.ts: Configuration handling
- tools/: MCP tool implementations
- resources/: MCP resource implementations
- services/: External service integrations
- utils/: Utility functions
 
Building
Development Mode
License
MIT
remote-capable server
The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.
MCP server that exposes Google's Veo2 video generation capabilities, allowing clients to generate videos from text prompts or images.
Related Resources
Related MCP Servers
- -security-license-qualityMCP server to interact with Google produts.Last updated -439MIT License
- Asecurity-licenseAqualityMCP (Model Context Protocol) server that utilizes the Google Gemini Vision API to interact with YouTube videos. It allows users to get descriptions, summaries, answers to questions, and extract key moments from YouTube videos.Last updated -476MIT License
- -security-license-qualityAn MCP server that enables authentication and authorization with Google's OAuth2 API, allowing users to securely authenticate and access Google services through natural language interactions.Last updated -
- -security-license-qualityAn auto-generated MCP server for Google's Serverless VPC Access API, enabling communication with Google Cloud VPC networks through natural language interactions.Last updated -