Skip to main content
Glama

Gemini MCP Server for Claude Desktop

gemini-transcribe-audio

Convert audio files (MP3, WAV, etc.) into text with enhanced accuracy using contextual hints and learned preferences for improved transcription results.

Instructions

Transcribe audio files to text using Gemini's multimodal capabilities (with learned user preferences)

Input Schema

NameRequiredDescriptionDefault
contextNoOptional context for intelligent enhancement (e.g., "medical", "legal", "technical")
file_pathYesPath to the audio file to transcribe (supports MP3, WAV, FLAC, AAC, OGG, WEBM)
languageNoOptional language hint for better transcription accuracy (e.g., "en", "es", "fr")
preserve_spelled_acronymsNoKeep spelled-out letters (U-R-L) instead of converting to acronyms (URL)

Input Schema (JSON Schema)

{ "properties": { "context": { "description": "Optional context for intelligent enhancement (e.g., \"medical\", \"legal\", \"technical\")", "type": "string" }, "file_path": { "description": "Path to the audio file to transcribe (supports MP3, WAV, FLAC, AAC, OGG, WEBM)", "type": "string" }, "language": { "description": "Optional language hint for better transcription accuracy (e.g., \"en\", \"es\", \"fr\")", "type": "string" }, "preserve_spelled_acronyms": { "description": "Keep spelled-out letters (U-R-L) instead of converting to acronyms (URL)", "type": "boolean" } }, "required": [ "file_path" ], "type": "object" }

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Garblesnarff/gemini-mcp-server'

If you have feedback or need assistance with the MCP directory API, please join our Discord server