Skip to main content
Glama

Whisper CLI MCP Server

by yhsung
MIT License

Whisper CLI MCP Server

An MCP server that provides shell command execution and OpenAI Whisper transcription capabilities.

Features

  • whisper_transcribe: Transcribe audio files using OpenAI Whisper
  • shell_command: Execute shell commands safely with basic security validation

Installation

  1. Install dependencies:
pip install -r requirements.txt
  1. Make the server executable:
chmod +x server.py

Usage

Running the Server

python server.py

Tools Available

whisper_transcribe

Transcribe audio files using whisper-cli.

Parameters:

  • audio_file (required): Path to the audio file
  • model (optional): Whisper model (base, small, medium, large, large-v2, large-v3)
  • language (optional): Language code for transcription
  • output_format (optional): Output format (txt, vtt, srt, json)
shell_command

Execute shell commands with basic security validation.

Parameters:

  • command (required): Shell command to execute
  • working_directory (optional): Working directory for the command

Security

The shell_command tool includes basic security validation to prevent execution of potentially dangerous commands. Commands containing the following patterns are blocked:

  • rm -rf
  • sudo
  • chmod 777
  • dd if=
  • > /dev/

Configuration

To use this server with Claude Desktop, add the following to your claude_desktop_config.json:

{ "mcpServers": { "whisper-cli-mcp": { "command": "python", "args": ["/path/to/whisper-cli-mcp/server.py"] } } }
-
security - not tested
A
license - permissive license
-
quality - not tested

An MCP server that provides shell command execution and OpenAI Whisper transcription capabilities for audio files.

  1. Features
    1. Installation
      1. Usage
        1. Running the Server
        2. Tools Available
      2. Security
        1. Configuration

          Related MCP Servers

          • -
            security
            A
            license
            -
            quality
            Enables recording audio from a microphone and transcribing it using OpenAI's Whisper model. Works as both a standalone MCP server and a Goose AI agent extension.
            Last updated -
            6
            Python
            MIT License
          • A
            security
            A
            license
            A
            quality
            A MCP server that enables transcription of audio files using OpenAI's Speech-to-Text API, with support for multiple languages and file saving options.
            Last updated -
            1
            2
            7
            JavaScript
            MIT License
            • Linux
            • Apple
          • -
            security
            A
            license
            -
            quality
            An MCP server that enables LLMs to generate spoken audio from text using OpenAI's Text-to-Speech API, supporting various voices, models, and audio formats.
            Last updated -
            2
            1
            JavaScript
            MIT License
          • -
            security
            F
            license
            -
            quality
            An MCP server that downloads videos/extracts audio from various platforms like YouTube, Bilibili, and TikTok, then transcribes them to text using OpenAI's Whisper model.
            Last updated -
            5
            Python
            • Linux
            • Apple

          View all related MCP servers

          MCP directory API

          We provide all the information about MCP servers via our MCP API.

          curl -X GET 'https://glama.ai/api/mcp/v1/servers/yhsung/whisper-cli-mcp'

          If you have feedback or need assistance with the MCP directory API, please join our Discord server