Skip to main content
Glama
ZahiriNatZuke

whisper-transcribe-mcp

transcribe_base64

Transcribe base64-encoded audio into text using Whisper models, with automatic language detection and optional GPT post-processing for spelling and grammar corrections.

Instructions

Transcribe audio provided as a base64-encoded string.

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
audio_base64YesBase64-encoded audio data.
extensionNoFile extension for the temp file (mp3, wav, m4a, ogg, etc.).mp3
languageNoLanguage code. Auto-detected if not provided.
model_sizeNoLocal model size. Ignored when using the OpenAI backend.
post_processNoIf True, passes the transcription through GPT to fix spelling, grammar, and punctuation. Requires the openai package.
post_process_promptNoCustom system prompt for post-processing. Use this to provide domain-specific context, proper nouns, or product names.

Output Schema

TableJSON Schema
NameRequiredDescriptionDefault

No arguments

Tool Definition Quality

Score is being calculated. Check back soon.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/ZahiriNatZuke/whisper-transcribe-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server