APIbase

Overview Schema Related Servers Score Discussions

audio.transcribe.submit

Submit audio files for transcription to convert speech to text. Supports multiple formats, auto-detects languages, and optionally identifies speakers.

Instructions

Submit an audio file URL for speech-to-text transcription. Returns a transcript_id to check status and retrieve results. Supports MP3, WAV, M4A, FLAC, OGG, WebM. 99 languages auto-detected. Optional speaker diarization (AssemblyAI)

Input Schema

TableJSON Schema

Name	Required	Description
`audio_url`	Yes	Publicly accessible URL of the audio file to transcribe (MP3, WAV, M4A, FLAC, OGG, WebM)
`model`	No	Speech model: "universal-2" (default, fast, 99 languages) or "universal-3-pro" (highest accuracy, promptable)
`language_code`	No	Language code (e.g. "en", "es", "de", "fr", "ja"). Auto-detected if omitted
`speaker_labels`	No	Enable speaker diarization — detect who said what (default false)

Install Server

Other Tools

Latest Blog Posts

The Hackers Who Tracked My Sleep Cycle
By punkpeye on March 26, 2026.
security
Open Source Has a Bot Problem
By punkpeye on March 19, 2026.
open source
How to make a release?
By punkpeye on March 15, 2026.
tutorial

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/whiteknightonhorse/APIbase'

If you have feedback or need assistance with the MCP directory API, please join our Discord server