Skip to main content
Glama

gemini-transcribe-audio

Convert audio files (MP3, WAV, etc.) into text with enhanced accuracy using contextual hints and learned preferences for improved transcription results.

Instructions

Transcribe audio files to text using Gemini's multimodal capabilities (with learned user preferences)

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
contextNoOptional context for intelligent enhancement (e.g., "medical", "legal", "technical")
file_pathYesPath to the audio file to transcribe (supports MP3, WAV, FLAC, AAC, OGG, WEBM)
languageNoOptional language hint for better transcription accuracy (e.g., "en", "es", "fr")
preserve_spelled_acronymsNoKeep spelled-out letters (U-R-L) instead of converting to acronyms (URL)

Other Tools

Related Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Garblesnarff/gemini-mcp-server'

If you have feedback or need assistance with the MCP directory API, please join our Discord server