Skip to main content
Glama

audio.transcribe.submit

Submit audio files for transcription to convert speech to text. Supports multiple formats, auto-detects languages, and optionally identifies speakers.

Instructions

Submit an audio file URL for speech-to-text transcription. Returns a transcript_id to check status and retrieve results. Supports MP3, WAV, M4A, FLAC, OGG, WebM. 99 languages auto-detected. Optional speaker diarization (AssemblyAI)

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
audio_urlYesPublicly accessible URL of the audio file to transcribe (MP3, WAV, M4A, FLAC, OGG, WebM)
modelNoSpeech model: "universal-2" (default, fast, 99 languages) or "universal-3-pro" (highest accuracy, promptable)
language_codeNoLanguage code (e.g. "en", "es", "de", "fr", "ja"). Auto-detected if omitted
speaker_labelsNoEnable speaker diarization — detect who said what (default false)
Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/whiteknightonhorse/APIbase'

If you have feedback or need assistance with the MCP directory API, please join our Discord server