Skip to main content
Glama

Gemini MCP Server for Claude Desktop

gemini-transcribe-audio

Convert audio files into text with intelligent transcription, supporting multiple formats and optional language or context hints for enhanced accuracy using multimodal capabilities.

Instructions

Transcribe audio files to text using Gemini's multimodal capabilities (with learned user preferences)

Input Schema

NameRequiredDescriptionDefault
contextNoOptional context for intelligent enhancement (e.g., "medical", "legal", "technical")
file_pathYesPath to the audio file to transcribe (supports MP3, WAV, FLAC, AAC, OGG, WEBM)
languageNoOptional language hint for better transcription accuracy (e.g., "en", "es", "fr")
preserve_spelled_acronymsNoKeep spelled-out letters (U-R-L) instead of converting to acronyms (URL)

Input Schema (JSON Schema)

{ "properties": { "context": { "description": "Optional context for intelligent enhancement (e.g., \"medical\", \"legal\", \"technical\")", "type": "string" }, "file_path": { "description": "Path to the audio file to transcribe (supports MP3, WAV, FLAC, AAC, OGG, WEBM)", "type": "string" }, "language": { "description": "Optional language hint for better transcription accuracy (e.g., \"en\", \"es\", \"fr\")", "type": "string" }, "preserve_spelled_acronyms": { "description": "Keep spelled-out letters (U-R-L) instead of converting to acronyms (URL)", "type": "boolean" } }, "required": [ "file_path" ], "type": "object" }
Install Server

Other Tools from Gemini MCP Server for Claude Desktop

Related Tools

    MCP directory API

    We provide all the information about MCP servers via our MCP API.

    curl -X GET 'https://glama.ai/api/mcp/v1/servers/Garblesnarff/gemini-mcp-server'

    If you have feedback or need assistance with the MCP directory API, please join our Discord server