AI Video Generator MCP Server
This MCP (Model Context Protocol) server provides tools to generate videos from text prompts and images using AI image generation models.
Compatible Models
- Luma Ray2 Flash - Luma's cutting edge image to video conversion model
- Kling v1.6 Pro - Kling's high quality image to video conversion model
function
- Video generation from text prompts
- Video generation with start and/or end images
- Control video parameters (aspect ratio, resolution, duration, loop)
- Check the generation status
- Choosing which AI model to use
install
- Clone this repository
- Install the dependencies:Copy
- Create a
.env
file and set your FAL.AI API key:You can get the API key from FAL.AICopy
Building the Server
Running the Server
You can run the server directly:
Integration with Claude Desktop
To use this server with Claude Desktop, add the following to your claude_desktop_config.json
file:
Available Tools
generate-video
It uses AI models to generate videos from text prompts and/or images.
Parameters:
prompt
(required): A text description of the content of the video you want to generate.image_url
(optional): The starting image URL for the video (URL or base64 data URI).end_image_url
(optional): The end image URL for the video (URL or base64 data URI).aspect_ratio
(default "16:9"): Video aspect ratio ("16:9", "9:16", "4:3", "3:4", "21:9", "9:21")resolution
(default "540p"): Video resolution ("540p", "720p", "1080p")duration
(default "5s"): video length ("5s", "9s")loop
(default false): whether the video should loopmodel
(default "luma"): AI model to use ("luma"=Ray2, "kling"=Kling v1.6 Pro)
check-video-status
Check the status of your video generation request.
Parameters:
request_id
(required): The request ID to check.model
(default "luma"): AI model used for the request ("luma"=Ray2, "kling"=Kling v1.6 Pro)
Claude usage example
Claude calls the generate-video
tool with the appropriate parameters and provides the resulting video URL.
Compare Models
- Luma Ray2 Flash : Excellent for smooth motion and realistic physics, producing natural results.
- Kling v1.6 Pro : Excellent for detailed textures and special effects, producing stylized results.
Depending on the prompt and the desired outcome, different models may work best.
Limitations
- Video generation may take some time (especially at higher resolutions)
- A valid FAL.AI API key and sufficient credits are required
- Higher resolution and longer videos cost more credits
- Both models consume FAL.AI credits (prices may vary per model)
troubleshooting
API Key Error
Make sure the FAL_KEY environment variable is set correctly, or you can set it directly in the Claude Desktop configuration file.
Video Generation Error
If an error occurs during video generation, a detailed error message will be logged. Common issues are:
- Invalid or expired API key
- Insufficient credits on your account
- Inappropriate prompts or images
- A temporary server-side issue
If the error persists, wait a while and try again or try changing the prompt.
license
MIT
This server cannot be installed
remote-capable server
The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.
Model Context Protocol server that enables generating videos from text prompts and/or images using AI models (Luma Ray2 Flash and Kling v1.6 Pro) with configurable parameters like aspect ratio, resolution, and duration.
Related MCP Servers
- -securityFlicense-qualityA server that provides Luma AI's video generation API as the Model Context Protocol (MCP)Last updated -2TypeScript
- AsecurityFlicenseAqualityA Model Context Protocol server that provides image generation capabilities using the Ideogram API, allowing users to create images from text prompts with customizable parameters.Last updated -113JavaScript
- AsecurityAlicenseAqualityA Model Context Protocol server that enables generation of high-quality images using the Flux.1 Schnell model via Together AI, allowing users to create images from text prompts with customizable dimensions.Last updated -18PythonMIT License
- AsecurityAlicenseAqualityA Model Context Protocol server that enables retrieval of transcripts from YouTube videos. This server provides direct access to video transcripts and subtitles through a simple interface, making it ideal for content analysis and processing.Last updated -125810TypeScriptMIT License