extract_video_entities
Extract structured data and entities from videos using prompts or fetch pre-existing extractions from collections. Supports YouTube, Cloudglue, and direct video URLs with pagination.
Instructions
Extract structured data and entities from videos with intelligent cost optimization and pagination support. Two modes: (1) Fetch existing entities from an entities collection by providing collection_id (prompt not required) - retrieves previously extracted entities stored in that collection for the given Cloudglue file, returns error if not found, (2) Extract new entities by providing prompt (collection_id optional) - automatically checks for existing extractions before creating new ones. Supports YouTube URLs, Cloudglue URLs, and direct HTTP video URLs. The quality of results depends heavily on your prompt specificity. Pagination is supported - use the 'page' parameter to retrieve specific pages of segment-level entities. Use this for individual video analysis.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| url | Yes | Video URL to extract entities from. Supports multiple formats: • **Cloudglue platform (default)**: `cloudglue://files/file-id` - Use file ID from list_videos • **YouTube URLs**: `https://www.youtube.com/watch?v=...` or `https://youtu.be/...` • **Public HTTP video URLs**: Direct links to MP4 files (e.g., `https://example.com/video.mp4`) • **Data connector URLs** (requires setup in Cloudglue account): - **Dropbox**: Shareable links (`https://www.dropbox.com/scl/fo/...`) or `dropbox://<path>/<to>/<file>` - **Google Drive**: `gdrive://file/<file_id>` - **Zoom**: Meeting UUID (`zoom://uuid/QFwZYEreTl2e6MBFSslXjQ%3D%3D`) or Meeting ID (`zoom://id/81586198865`) See https://docs.cloudglue.dev/data-connectors/overview for data connector setup. | |
| prompt | No | Detailed extraction prompt that guides what entities to find. Examples: 'Extract speaker names, key topics, and action items', 'Find product names, prices, and features mentioned', 'Identify companies, people, and technologies discussed'. Be specific about the data structure you want. Required when collection_id is not provided. | |
| collection_id | No | Optional collection ID to fetch previously extracted entities from an entities collection (saves time and cost). Use collection ID from list_collections. When provided with a Cloudglue URL, this tool retrieves existing entity extractions that were previously extracted and stored in the specified collection. Only works with Cloudglue URLs. When provided, prompt is not required and entities are fetched from the collection. | |
| page | No | Page number for paginated segment-level entities. Each page contains 25 segment entities. Defaults to 0 (first page). Use this to retrieve segment entities for specific pages of longer videos. |