extract_youtube_transcript
Extract YouTube video transcripts with timestamps, language preferences, and translation. Supports fallback crawling and saving full transcript to markdown files.
Instructions
Extract YouTube transcripts with timestamps. Works with public captioned videos. Supports fallback to page crawl. Use output_path to persist the full unsliced transcript to disk as markdown.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | YouTube video URL | |
| languages | No | Language codes in preference order | |
| translate_to | No | Target language for translation | |
| include_timestamps | No | Include timestamps | |
| preserve_formatting | No | Preserve formatting | |
| include_metadata | No | Include video metadata | |
| auto_summarize | No | Auto-summarize large content | |
| max_content_tokens | No | Max tokens before summarization | |
| summary_length | No | 'short'|'medium'|'long' | medium |
| llm_provider | No | LLM provider | |
| llm_model | No | LLM model | |
| enable_crawl_fallback | No | Enable page crawl fallback when API fails | |
| fallback_timeout | No | Fallback crawl timeout in seconds | |
| enrich_metadata | No | Enrich metadata (upload_date, view_count) via page crawl | |
| content_offset | No | Start position for content (0-indexed) | |
| content_limit | No | Max characters to return (0=unlimited) | |
| output_path | No | Absolute file path (auto .md extension) to persist the full unsliced transcript. When set, the response is slimmed to metadata+file path. content_limit/content_offset still affect the response copy but not the on-disk file. | |
| include_content_in_response | No | When True (with output_path set), keep the transcript in the response too. Note: the response copy is still subject to content_limit/content_offset slicing; only the on-disk file holds the full transcript. Defaults to False. | |
| overwrite | No | Overwrite an existing output file at output_path. Defaults to False. |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
No arguments | |||