Generate image or video
vicsee_generateGenerate AI images or videos asynchronously by submitting a task and polling for the result. Supports text-to-image, image-to-video, reference-to-video, and video editing models.
Instructions
Create an AI image or video with VicSee. Generation is ASYNCHRONOUS: this returns a task id immediately, then poll vicsee_get_task with that id until status is "completed" (the result URL appears in result.url) or "failed". Use vicsee_list_models to pick a model and see its valid options. For image-to-video / image-to-image, pass source images in image_urls. For reference-to-video models (e.g. "seedance-2-0-reference-to-video"), pass references in reference_image_urls / reference_video_urls / reference_audio_urls and refer to them positionally in the prompt as @Image1, @Image2, … IMAGE inputs (image_urls, reference_image_urls) may be a public https URL, a local file path (this server reads and base64-encodes it for you), or a base64 data URI. VIDEO/AUDIO inputs (reference_video_urls, reference_audio_urls) must be public https URLs. For video-edit models (e.g. "happyhorse-video-edit"), pass the source clip in video_url and optionally set audio_setting ("auto" or "origin").
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| model | Yes | Model id from vicsee_list_models, e.g. "nano-banana-pro-text-to-image" or "seedance-2-0-text-to-video" | |
| prompt | No | Text prompt (required for most models) | |
| image_urls | No | Source image(s) for image-to-video / image-to-image. Each may be a public https URL, a local file path (this server reads and base64-encodes it for you), or a base64 data URI. | |
| reference_image_urls | No | Reference-to-video only: up to 7 reference images. Each may be a public https URL, a local file path (this server reads and base64-encodes it for you), or a base64 data URI. Refer to them in the prompt as @Image1, @Image2, … | |
| reference_video_urls | No | Reference-to-video only: up to 3 public https video URLs (2-15s each, ≤15s total). | |
| reference_audio_urls | No | Reference-to-video only: up to 3 public https audio URLs. | |
| video_url | No | Video-edit models (e.g. "happyhorse-video-edit") only: one public https video URL, 3-15s, to edit. | |
| audio_setting | No | Video-edit models only: "auto" (regenerate audio) or "origin" (keep the source audio). | |
| duration | No | Video length in seconds (e.g. 5, 6, 10, 15) — video models only | |
| aspect_ratio | No | e.g. "16:9", "9:16", "1:1", "landscape", "portrait" | |
| resolution | No | e.g. "1K", "2K", "4K", "720P", "1080P" | |
| output_format | No | Image output format, e.g. "png" or "jpeg" | |
| audio | No | Enable native audio (supported video models) | |
| extra | No | Any additional model-specific params (see the model's options from vicsee_list_models) |