Generate Video
generate_videoCreate realistic 4-8 second videos from text prompts using AI. Supports optional starting image and reference images for consistency, with synchronized audio.
Instructions
Generate videos using Google Veo 3.1 AI model. Creates realistic 4-8 second videos from text prompts with optional first-frame image and reference images for character/style consistency. Supports native audio generation. Processing time: 2-5 minutes for 1080p videos. Returns video file path with optional thumbnail and HTML preview player. ⚠️ IMPORTANT: Video generation is ASYNC and takes 2-5 minutes. The tool will poll for completion automatically.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| prompt | Yes | Detailed description of the video to generate. Be specific about actions, camera movements, lighting, and style. Example: "A close-up shot of a futuristic coffee machine brewing a glowing blue espresso, with steam rising dramatically. Cinematic lighting, 4K quality." | |
| model | No | Video generation model (default: veo-3.1-generate-preview) | veo-3.1-generate-preview |
| aspectRatio | No | Video aspect ratio: 16:9 (landscape) or 9:16 (portrait/vertical) | 16:9 |
| resolution | No | Video resolution. Higher resolutions take longer to generate and result in larger files. | 1080p |
| durationSeconds | No | Video duration in seconds (4, 6, or 8 seconds) | |
| generateAudio | No | Generate native synchronized audio effects and dialogue based on the prompt | |
| sampleCount | No | Number of video samples to generate (1-4). Each sample is a separate generation. | |
| seed | No | Optional seed for deterministic output. Use the same seed with the same prompt for consistent results. | |
| outputPath | No | Optional custom output path for the video file (e.g., C:/videos/output.mp4). If not provided, saves to default output directory with timestamped filename. | |
| generateThumbnail | No | Extract thumbnail from video (requires ffmpeg installed). Thumbnail is saved alongside video. | |
| generateHTMLPlayer | No | Generate interactive HTML video player with preview and download options | |
| firstFrameImage | No | Starting frame image for image-to-video generation. Provide via filePath (local file) or data+mimeType (base64). The video will animate from this image. Supports JPEG, PNG, WebP. | |
| referenceImages | No | Up to 3 reference images for character/style consistency. Each needs a referenceType ("asset" or "style") and an image. |