kling_generate_video
Generate AI video from a text description. Specify scene, motion, style, and mood to create high-quality video without reference images.
Instructions
Generate AI video from a text prompt using Kling.
This is the simplest way to create video - just describe what you want and Kling
will generate a high-quality AI video.
Use this when:
- You want to create a video from a text description
- You don't have reference images
- You want quick video generation
For using reference images (start/end frames), use kling_generate_video_from_image instead.
Returns:
Task ID and generated video information including URLs and state.Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| prompt | Yes | Description of the video to generate. Be descriptive about the scene, motion, style, and mood. Examples: 'A cat walking through a garden with butterflies', 'Astronauts shuttle from space to volcano', 'Ocean waves crashing on a beach at sunset' | |
| model | No | Kling model to use. Options: 'kling-v1', 'kling-v1-6', 'kling-v2-master' (default), 'kling-v2-1-master', 'kling-v2-5-turbo', 'kling-v2-6', 'kling-v3', 'kling-v3-omni', 'kling-video-o1'. | kling-v2-master |
| mode | No | Generation mode. 'std' (standard, default) for faster generation, 'pro' for higher quality, '4k' for native 4K (only supported by kling-v3 and kling-v3-omni, not compatible with motion control). | std |
| aspect_ratio | No | Video aspect ratio. Options: '16:9' (landscape, default), '9:16' (portrait), '1:1' (square). | 16:9 |
| duration | No | Video duration in seconds. For kling-v3/kling-v3-omni: 3-15 (integer). Other models: 5 or 10. | |
| generate_audio | No | Whether to generate audio synchronously. Supported by kling-v3, kling-v3-omni, and kling-v2-6 (pro mode only). Default is false. | |
| negative_prompt | No | Things to avoid in the video. Example: 'blurry, low quality, distorted faces' | |
| cfg_scale | No | Classifier-free guidance scale. Higher values follow the prompt more strictly. Typical range: 0.0-1.0. | |
| camera_control | No | Camera control as JSON string. Example: '{"type": "simple", "config": {"horizontal": 5, "vertical": 0, "pan": 0, "tilt": 0, "roll": 0, "zoom": 0}}'. Types: 'simple', 'down_back', 'forward_up', 'left_turn_forward', 'right_turn_forward'. | |
| element_list | No | List of reference subjects from the subject library. Each item should contain an 'element_id'. If a reference video is present, reference subjects + reference images must be ≤ 4; otherwise ≤ 7. | |
| video_list | No | List of reference videos. Each item should contain a 'video_url' (MP4/MOV, 3-10s, 720-2160px, 24-60fps, ≤200MB, max 1 video) and optionally 'refer_type' ('feature' or 'base', default 'base') and 'keep_original_sound' ('yes' or 'no'). | |
| timeout | No | Timeout in seconds for the API to return data. Default is 300. | |
| callback_url | No | Webhook callback URL for asynchronous notifications. When provided, the API will call this URL when the video is generated. |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |