MCP Video Generation with Veo2

This project implements a Model Context Protocol (MCP) server that exposes Google's Veo2 video generation capabilities. It allows clients to generate videos from text prompts or images, and access the generated videos through MCP resources.

Features

Generate videos from text prompts
Generate videos from images
Access generated videos through MCP resources
Example video generation templates
Support for both stdio and SSE transports

Example Images

1dec9c71-07dc-4a6e-9e17-8da355d72ba1

Example Image to Video

Image to Video - from Grok generated puppy

Image to Video - from real cat

Prerequisites

Node.js 18 or higher
Google API key with access to Gemini API and Veo2 model (= You need to set up a credit card with your API key! -> Go to aistudio.google.com )

Installation

Installing in FLUJO

Click Add Server
Copy & Paste Github URL into FLUJO
Click Parse, Clone, Install, Build and Save.

Installing via Smithery

To install mcp-video-generation-veo2 for Claude Desktop automatically via Smithery:

npx -y @smithery/cli install @mario-andreschak/mcp-veo2 --client claude

Manual Installation

Clone the repository:
git clone https://github.com/yourusername/mcp-video-generation-veo2.git cd mcp-video-generation-veo2
Install dependencies:
npm install
Create a .env file with your Google API key:
cp .env.example .env # Edit .env and add your Google API key
The .env file supports the following variables:
- GOOGLE_API_KEY: Your Google API key (required)
- PORT: Server port (default: 3000)
- STORAGE_DIR: Directory for storing generated videos (default: ./generated-videos)
- LOG_LEVEL: Logging level (default: fatal)
  - Available levels: verbose, debug, info, warn, error, fatal, none
  - For development, set to debug or info for more detailed logs
  - For production, keep as fatal to minimize console output
Build the project:
npm run build

Usage

Starting the Server

You can start the server with either stdio or SSE transport:

stdio Transport (Default)

npm start
# or
npm start stdio

SSE Transport

npm start sse

This will start the server on port 3000 (or the port specified in your .env file).

MCP Tools

The server exposes the following MCP tools:

generateVideoFromText

Generates a video from a text prompt.

Parameters:

prompt (string): The text prompt for video generation
config (object, optional): Configuration options
- aspectRatio (string, optional): "16:9" or "9:16"
- personGeneration (string, optional): "dont_allow" or "allow_adult"
- numberOfVideos (number, optional): 1 or 2
- durationSeconds (number, optional): Between 5 and 8
- enhancePrompt (boolean, optional): Whether to enhance the prompt
- negativePrompt (string, optional): Text describing what not to generate

Example:

{
  "prompt": "Panning wide shot of a serene forest with sunlight filtering through the trees, cinematic quality",
  "config": {
    "aspectRatio": "16:9",
    "personGeneration": "dont_allow",
    "durationSeconds": 8
  }
}

generateVideoFromImage

Generates a video from an image.

Parameters:

image (string): Base64-encoded image data
prompt (string, optional): Text prompt to guide the video generation
config (object, optional): Configuration options (same as above, but personGeneration only supports "dont_allow")

listGeneratedVideos

Lists all generated videos.

MCP Resources

The server exposes the following MCP resources:

videos://{id}

Access a generated video by its ID.

videos://templates

Access example video generation templates.

Development

Project Structure

src/: Source code
- index.ts: Main entry point
- server.ts: MCP server configuration
- config.ts: Configuration handling
- tools/: MCP tool implementations
- resources/: MCP resource implementations
- services/: External service integrations
- utils/: Utility functions

Building

npm run build

Development Mode

npm run dev

License

MIT

Install Server

HTTP connection URL

security – no known vulnerabilities

license - permissive license

quality - confirmed to work

How are these scores calculated?

remote-capable server

The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.

MCP server that exposes Google's Veo2 video generation capabilities, allowing clients to generate videos from text prompts or images.

Related Resources

Reddit Discussion about this server

Related MCP Servers

YouTube MCP Integration
spolepaka
-
security
F
license
-
quality
An MCP server that enables LLMs to search YouTube, retrieve video information, and access video transcripts through standardized tools.
Last updated -
TypeScript
Image Generator MCP Server
luoshui-coder
-
security
-
license
-
quality
A TypeScript-based MCP server that lets users generate images using OpenAI's dall-e-3 model by providing a prompt and image name.
Last updated -
1
MCP Grareco
iuill
-
security
A
license
-
quality
A MCP server that creates graphic recordings by converting websites or text input into visual summaries using different prompt styles (standard, elementary, timeline).
Last updated -
TypeScript
MIT License
FLUX Image Generator MCP Server
frankdeno
A
security
F
license
A
quality
An MCP server that generates images based on text prompts using Black Forest Lab's FLUX model, allowing for customized image dimensions, prompt upsampling, safety settings, and batch generation.
Last updated -
3
TypeScript

View all related MCP servers

MCP Video Generation with Veo2