transcribe
Generate text from audio files in over 90 languages, with the option to identify different speakers.
Instructions
Transcribe audio to text using ElevenLabs Scribe with optional speaker diarization. Supports 90+ languages. Trigger: 'elevenlabs transcribe', 'elevenlabs stt', or 'elevenlabs speech to text'.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| audio_path | Yes | Absolute path to the audio file to transcribe | |
| model | No | Transcription model: "scribe_v2" (default, 90+ languages), "scribe_v1" | |
| language_code | No | Language code (ISO 639-1, e.g., "en", "es", "fr") to improve accuracy | |
| diarize | No | Identify which speaker is talking (default: false) | |
| num_speakers | No | Expected number of speakers (up to 32). Helps improve diarization accuracy. |