Skip to main content
Glama

Audio Transcriber MCP Server

by Ichigo3766

OpenAI 语音转文本转录 MCP 服务器

使用 OpenAI 的 API 提供音频转录功能的 MCP 服务器。

安装

设置

  1. 克隆存储库:
git clone https://github.com/Ichigo3766/audio-transcriber-mcp.git cd audio-transcriber-mcp
  1. 安装依赖项:
npm install
  1. 构建服务器:
npm run build
  1. 在您的环境变量中设置您的 OpenAI API 密钥。
  2. 将服务器配置添加到您的环境中:
{ "mcpServers": { "audio-transcriber": { "command": "node", "args": [ "/path/to/audio-transcriber-mcp/build/index.js" ], "env": { "OPENAI_API_KEY": "", "OPENAI_BASE_URL": "", // Optional "OPENAI_MODEL": "" // Optional } } } }

/path/to/audio-transcriber-mcp替换为您克隆存储库的实际路径。

特征

工具

  • transcribe_audio - 使用 OpenAI 的 API 转录音频文件
    • 将文件路径作为必需参数
    • 可选参数:
      • save_to_file:布尔值,用于将转录保存到文件
      • 语言:ISO-639-1 语言代码(例如“en”、“es”)

执照

此 MCP 服务器采用 MIT 许可证。这意味着您可以自由使用、修改和分发该软件,但须遵守 MIT 许可证的条款和条件。更多详情,请参阅项目仓库中的 LICENSE 文件。

Install Server
A
security – no known vulnerabilities
A
license - permissive license
A
quality - confirmed to work

remote-capable server

The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.

MCP 服务器可使用 OpenAI 的语音转文本 API 转录音频文件,并支持多种语言和文件保存选项。

  1. 安装
    1. 设置
  2. 特征
    1. 工具
  3. 执照

    Related MCP Servers

    • -
      security
      A
      license
      -
      quality
      Enables recording audio from a microphone and transcribing it using OpenAI's Whisper model. Works as both a standalone MCP server and a Goose AI agent extension.
      Last updated -
      4
      Python
      MIT License
    • -
      security
      A
      license
      -
      quality
      An MCP server that enables LLMs to generate spoken audio from text using OpenAI's Text-to-Speech API, supporting various voices, models, and audio formats.
      Last updated -
      4
      1
      JavaScript
      MIT License
    • A
      security
      A
      license
      A
      quality
      A Model Context Protocol server that enables AI models to generate and play high-quality text-to-speech audio through your device's native audio system using Rime's voice synthesis API.
      Last updated -
      1
      15
      4
      JavaScript
      The Unlicense
      • Apple
      • Linux
    • A
      security
      A
      license
      A
      quality
      An official Model Context Protocol (MCP) server that enables AI clients to interact with ElevenLabs' Text to Speech and audio processing APIs, allowing for speech generation, voice cloning, audio transcription, and other audio-related tasks.
      Last updated -
      19
      700
      Python
      MIT License
      • Apple

    View all related MCP servers

    MCP directory API

    We provide all the information about MCP servers via our MCP API.

    curl -X GET 'https://glama.ai/api/mcp/v1/servers/Ichigo3766/audio-transcriber-mcp'

    If you have feedback or need assistance with the MCP directory API, please join our Discord server