gemini-transcribe-audio
Convert audio files (MP3, WAV, etc.) into text with enhanced accuracy using contextual hints and learned preferences for improved transcription results.
Instructions
Transcribe audio files to text using Gemini's multimodal capabilities (with learned user preferences)
Input Schema
Name | Required | Description | Default |
---|---|---|---|
context | No | Optional context for intelligent enhancement (e.g., "medical", "legal", "technical") | |
file_path | Yes | Path to the audio file to transcribe (supports MP3, WAV, FLAC, AAC, OGG, WEBM) | |
language | No | Optional language hint for better transcription accuracy (e.g., "en", "es", "fr") | |
preserve_spelled_acronyms | No | Keep spelled-out letters (U-R-L) instead of converting to acronyms (URL) |