Integrations
Integrates with Google's Veo2 video generation capabilities, allowing generation of videos from text prompts or images with various configuration options such as aspect ratio, duration, and person generation settings.
MCP Video Generation with Veo2
This project implements a Model Context Protocol (MCP) server that exposes Google's Veo2 video generation capabilities. It allows clients to generate videos from text prompts or images, and access the generated videos through MCP resources.
Features
- Generate videos from text prompts
- Generate videos from images
- Access generated videos through MCP resources
- Example video generation templates
- Support for both stdio and SSE transports
Example Images
Example Image to Video
Image to Video - from Grok generated puppy
Image to Video - from real cat
Prerequisites
- Node.js 18 or higher
- Google API key with access to Gemini API and Veo2 model (= You need to set up a credit card with your API key! -> Go to aistudio.google.com )
Installation
Installing in FLUJO
- Click Add Server
- Copy & Paste Github URL into FLUJO
- Click Parse, Clone, Install, Build and Save.
Installing via Smithery
To install mcp-video-generation-veo2 for Claude Desktop automatically via Smithery:
Manual Installation
- Clone the repository:Copy
- Install dependencies:Copy
- Create a
.env
file with your Google API key:TheCopy.env
file supports the following variables:GOOGLE_API_KEY
: Your Google API key (required)PORT
: Server port (default: 3000)STORAGE_DIR
: Directory for storing generated videos (default: ./generated-videos)LOG_LEVEL
: Logging level (default: fatal)- Available levels: verbose, debug, info, warn, error, fatal, none
- For development, set to
debug
orinfo
for more detailed logs - For production, keep as
fatal
to minimize console output
- Build the project:Copy
Usage
Starting the Server
You can start the server with either stdio or SSE transport:
stdio Transport (Default)
SSE Transport
This will start the server on port 3000 (or the port specified in your .env
file).
MCP Tools
The server exposes the following MCP tools:
generateVideoFromText
Generates a video from a text prompt.
Parameters:
prompt
(string): The text prompt for video generationconfig
(object, optional): Configuration optionsaspectRatio
(string, optional): "16:9" or "9:16"personGeneration
(string, optional): "dont_allow" or "allow_adult"numberOfVideos
(number, optional): 1 or 2durationSeconds
(number, optional): Between 5 and 8enhancePrompt
(boolean, optional): Whether to enhance the promptnegativePrompt
(string, optional): Text describing what not to generate
Example:
generateVideoFromImage
Generates a video from an image.
Parameters:
image
(string): Base64-encoded image dataprompt
(string, optional): Text prompt to guide the video generationconfig
(object, optional): Configuration options (same as above, but personGeneration only supports "dont_allow")
listGeneratedVideos
Lists all generated videos.
MCP Resources
The server exposes the following MCP resources:
videos://{id}
Access a generated video by its ID.
videos://templates
Access example video generation templates.
Development
Project Structure
src/
: Source codeindex.ts
: Main entry pointserver.ts
: MCP server configurationconfig.ts
: Configuration handlingtools/
: MCP tool implementationsresources/
: MCP resource implementationsservices/
: External service integrationsutils/
: Utility functions
Building
Development Mode
License
MIT
You must be authenticated.
MCP server that exposes Google's Veo2 video generation capabilities, allowing clients to generate videos from text prompts or images.