video_analyze
Extract transcript, metadata, scenes, audio, quality, chapters, and colors from any video. Works with local files or URLs, including streaming platforms.
Instructions
Comprehensive video analysis — transcript, metadata, scenes, audio, quality, chapters, colors.
Accepts a local file path or an HTTP/HTTPS URL. Direct video URLs (e.g. https://example.com/clip.mp4) are downloaded automatically. Streaming-platform URLs (YouTube, Vimeo, TikTok, Twitter/X, Instagram, Twitch, …) require yt-dlp (pip install yt-dlp). Each sub-analysis is independent so one failure will not abort the others.
Args: input_path: Local path or HTTP/HTTPS URL to the video. whisper_model: Whisper model size (tiny, base, small, medium, large, turbo). language: Language code for transcription (auto-detect if None). scene_threshold: Scene change sensitivity 0.0-1.0. include_transcript: Run speech-to-text via Whisper (requires openai-whisper). include_scenes: Detect scene changes and boundaries. include_audio: Analyse audio waveform, peaks, and silence regions. include_quality: Run visual quality check. include_chapters: Auto-generate chapter markers from scene changes. include_colors: Extract dominant colors and extended metadata. output_srt: Optional path to write SRT subtitle file. output_txt: Optional path to write plain-text transcript. output_md: Optional path to write Markdown transcript with timestamps. output_json: Optional path to write full JSON transcript data.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| input_path | Yes | ||
| whisper_model | No | base | |
| language | No | ||
| scene_threshold | No | ||
| include_transcript | No | ||
| include_scenes | No | ||
| include_audio | No | ||
| include_quality | No | ||
| include_chapters | No | ||
| include_colors | No | ||
| output_srt | No | ||
| output_txt | No | ||
| output_md | No | ||
| output_json | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
No arguments | |||