Whisper CLI MCP Server

An MCP server that provides shell command execution and OpenAI Whisper transcription capabilities.

Features

whisper_transcribe: Transcribe audio files using OpenAI Whisper
shell_command: Execute shell commands safely with basic security validation

Installation

Install dependencies:

pip install -r requirements.txt

Make the server executable:

chmod +x server.py

Usage

Running the Server

python server.py

Tools Available

whisper_transcribe

Transcribe audio files using whisper-cli.

Parameters:

audio_file (required): Path to the audio file
model (optional): Whisper model (base, small, medium, large, large-v2, large-v3)
language (optional): Language code for transcription
output_format (optional): Output format (txt, vtt, srt, json)

shell_command

Execute shell commands with basic security validation.

Parameters:

command (required): Shell command to execute
working_directory (optional): Working directory for the command

Security

The shell_command tool includes basic security validation to prevent execution of potentially dangerous commands. Commands containing the following patterns are blocked:

rm -rf
sudo
chmod 777
dd if=
> /dev/

Configuration

To use this server with Claude Desktop, add the following to your claude_desktop_config.json:

{
  "mcpServers": {
    "whisper-cli-mcp": {
      "command": "python",
      "args": ["/path/to/whisper-cli-mcp/server.py"]
    }
  }
}

This server cannot be installed

security - not tested

license - permissive license

quality - not tested

How are these scores calculated?

An MCP server that provides shell command execution and OpenAI Whisper transcription capabilities for audio files.

Related MCP Servers

Voice Recorder MCP Server
DefiBax
-
security
A
license
-
quality
Enables recording audio from a microphone and transcribing it using OpenAI's Whisper model. Works as both a standalone MCP server and a Goose AI agent extension.
Last updated -
6
Python
MIT License
Audio Transcriber MCP Server
Ichigo3766
A
security
A
license
A
quality
A MCP server that enables transcription of audio files using OpenAI's Speech-to-Text API, with support for multiple languages and file saving options.
Last updated -
1
2
7
JavaScript
MIT License
Blabber-MCP
pinkpixel-dev
-
security
A
license
-
quality
An MCP server that enables LLMs to generate spoken audio from text using OpenAI's Text-to-Speech API, supporting various voices, models, and audio formats.
Last updated -
2
1
JavaScript
MIT License
MCP Video & Audio Text Extraction Server
SealinGp
-
security
F
license
-
quality
An MCP server that downloads videos/extracts audio from various platforms like YouTube, Bilibili, and TikTok, then transcribes them to text using OpenAI's Whisper model.
Last updated -
5
Python

View all related MCP servers

Whisper CLI MCP Server