omni_video_ingest
Ingest raw video files to generate word-level transcripts and a semantic Visual Scene Graph for B-Roll searching. Returns the path to generated project metadata.
Instructions
Ingests a directory of video files, generates word-level audio transcripts, and constructs a semantic Visual Scene Graph for B-Roll searching. Returns the path to the generated project metadata.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| request | Yes |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |