MCP 音频转录器

一个 Dockerized Python 工具，通过 AssemblyAI 的 API 实现模型上下文协议 (MCP)。上传或指向音频文件，即可接收结构化的 JSON 转录。

特征

AssemblyMCP ：使用 AssemblyAI 的 REST API 的具体 MCP 实现
命令行界面（ app.py ）：
python app.py <input_audio> <output_json>
Streamlit 网页用户界面( streamlit_app.py )：
- 上传本地文件或粘贴 URL
- 点击转录
- 预览成绩单并下载 JSON
Docker 对环境一致性和可移植性的支持

先决条件

Python 3.10+
AssemblyAI API 密钥
ffmpeg（用于本地解码，如果使用本地文件）
（可选）Docker 桌面/引擎
（可选）Streamlit（ pip install streamlit ）

🔧 安装

克隆 repo
git clone https://github.com/ShreyasTembhare/MCP---Audio-Transcriber.git cd MCP---Audio-Transcriber
创建.env
ASSEMBLYAI_API_KEY=your_assemblyai_api_key_here
确保.gitignore包含：
.env
安装 Python 依赖项
pip install --upgrade pip pip install -r requirements.txt
安装 ffmpeg
- Ubuntu/Debian： sudo apt update && sudo apt install ffmpeg -y
- Windows：从https://ffmpeg.org下载并将其bin/添加到您的 PATH

用法

1. CLI 转录

python app.py <input_audio> <output_json>

<input_audio> ：AssemblyAI 支持的任何文件或 URL
<output_json> ：生成的 JSON 的路径

例子：

python app.py data/input.ogg data/output.json
cat data/output.json

2. Streamlit Web UI

streamlit run streamlit_app.py

打开http://localhost:8501
上传或输入音频 URL
点击转录
下载 JSON 结果

3. Docker

构建图像：

docker build -t mcp-transcriber .

运行它（挂载你的数据/文件夹）：

docker run --rm \
  -e ASSEMBLYAI_API_KEY="$ASSEMBLYAI_API_KEY" \
  -v "$(pwd)/data:/data" \
  mcp-transcriber:latest \
  /data/input.ogg /data/output.json

然后检查：

ls data/output.json
cat data/output.json

Windows PowerShell：

docker run --rm `
  -e ASSEMBLYAI_API_KEY=$env:ASSEMBLYAI_API_KEY `
  -v "${PWD}\data:/data" `
  mcp-transcriber:latest `
  /data/input.ogg /data/output.json

项目结构

MCP-Audio-Transcriber/
├── app.py               # CLI entrypoint (AssemblyMCP only)
├── mcp.py               # ModelContextProtocol + AssemblyMCP
├── streamlit_app.py     # Streamlit interface
├── requirements.txt     # assemblyai, python-dotenv, streamlit, etc.
├── Dockerfile           # builds the container
├── .gitignore           # ignores .env, __pycache__, etc.
├── LICENSE              # MIT license
└── data/                # sample input and output
    ├── input.ogg
    └── output.json

This server cannot be installed

security - not tested

license - permissive license

quality - not tested

How are these scores calculated?

local-only server

The server can only run on the client's local machine because it depends on local resources.

一种可移植的 Dockerized Python 工具，使用 Whisper 模型实现用于音频转录的模型上下文协议，具有 CLI 和 Web UI 界面，用于将音频文件转换为 JSON 转录。

Related MCP Servers

Kokoro TTS MCP Server
giannisanni
-
security
F
license
-
quality
Provides text-to-speech capabilities through the Model Context Protocol, allowing applications to easily integrate speech synthesis with customizable voices, adjustable speech speed, and cross-platform audio playback support.
Last updated -
2
Python
Sonic Pi MCP
abhishekjairath
-
security
A
license
-
quality
A Model Context Protocol server that allows AI assistants like Claude and Cursor to create music and control Sonic Pi programmatically through OSC messages.
Last updated -
10
7
TypeScript
MIT License
Audio Transcriber MCP Server
Ichigo3766
A
security
A
license
A
quality
A MCP server that enables transcription of audio files using OpenAI's Speech-to-Text API, with support for multiple languages and file saving options.
Last updated -
1
2
JavaScript
MIT License
Rime MCP
MatthewDailey
A
security
A
license
A
quality
A Model Context Protocol server that enables AI models to generate and play high-quality text-to-speech audio through your device's native audio system using Rime's voice synthesis API.
Last updated -
1
15
4
JavaScript
The Unlicense

View all related MCP servers

MCP Audio Transcriber

MCP 音频转录器

特征

先决条件

🔧 安装

用法

1. CLI 转录

2. Streamlit Web UI

3. Docker

项目结构

Related MCP Servers

Kokoro TTS MCP Server

Sonic Pi MCP

Audio Transcriber MCP Server

Rime MCP

New MCP Servers

MCP directory API