Audio Transcriber MCP Server

OpenAI Speech-to-Text transcriptions MCP Server

A MCP server that provides audio transcription capabilities using OpenAI's API.

Installation

Setup

Clone the repository:

git clone https://github.com/Ichigo3766/audio-transcriber-mcp.git
cd audio-transcriber-mcp

Install dependencies:

npm install

Build the server:

npm run build

Set up your OpenAI API key in your environment variables.
Add the server configuration to your environment:

{
  "mcpServers": {
    "audio-transcriber": {
      "command": "node",
      "args": [
        "/path/to/audio-transcriber-mcp/build/index.js"
      ],
      "env": {
        "OPENAI_API_KEY": "",
        "OPENAI_BASE_URL": "", // Optional
        "OPENAI_MODEL": "" // Optional
      }
    }
  }
}

Replace /path/to/audio-transcriber-mcp with the actual path where you cloned the repository.

Features

Tools

transcribe_audio - Transcribe audio files using OpenAI's API
- Takes filepath as a required parameter
- Optional parameters:
  - save_to_file: Boolean to save transcription to a file
  - language: ISO-639-1 language code (e.g., "en", "es")

License

This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.

Install Server

HTTP connection URL

security – no known vulnerabilities

license - permissive license

quality - confirmed to work

How are these scores calculated?

remote-capable server

The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.

Tools

transcribe_audio

A MCP server that enables transcription of audio files using OpenAI's Speech-to-Text API, with support for multiple languages and file saving options.

Related Resources

Reddit Discussion about this server

Related MCP Servers

Voice Recorder MCP Server
DefiBax
-
security
A
license
-
quality
Enables recording audio from a microphone and transcribing it using OpenAI's Whisper model. Works as both a standalone MCP server and a Goose AI agent extension.
Last updated -
4
Python
MIT License
Blabber-MCP
pinkpixel-dev
-
security
A
license
-
quality
An MCP server that enables LLMs to generate spoken audio from text using OpenAI's Text-to-Speech API, supporting various voices, models, and audio formats.
Last updated -
4
1
JavaScript
MIT License
Rime MCP
MatthewDailey
A
security
A
license
A
quality
A Model Context Protocol server that enables AI models to generate and play high-quality text-to-speech audio through your device's native audio system using Rime's voice synthesis API.
Last updated -
1
15
4
JavaScript
The Unlicense
ElevenLabs MCP Serverofficial
elevenlabs
A
security
A
license
A
quality
An official Model Context Protocol (MCP) server that enables AI clients to interact with ElevenLabs' Text to Speech and audio processing APIs, allowing for speech generation, voice cloning, audio transcription, and other audio-related tasks.
Last updated -
19
815
Python
MIT License

View all related MCP servers

Appeared in Searches

A search for information about PDFs or related content

Audio Transcriber MCP Server

OpenAI Speech-to-Text transcriptions MCP Server

Installation

Setup

Features

Tools

License

Tools

Related Resources

Related MCP Servers

Voice Recorder MCP Server

Blabber-MCP

Rime MCP

ElevenLabs MCP Serverofficial

Appeared in Searches

New MCP Servers

MCP directory API