Skip to main content
Glama

VOICEVOX MCP Server

Integrations
  • Provides Docker Compose setup for easily running the required VOICEVOX engine locally.

  • The MCP server is implemented in TypeScript for type safety and developer experience.

  • Uses Zod for runtime schema validation within the MCP server implementation.

voicevox-mcp

该项目是一个MCP(模型上下文协议)服务器,与VOICEVOX引擎配合,合成语音并获取说话人信息。它是用 TypeScript 实现的,并使用 MCP SDK。

功能

  • 获取 VOICEVOX 引擎的扬声器信息 (/speakers)
  • 在指定的扬声器上将文本合成为语音并在本地播放(/speak)
    • 仅限 Mac

设置

启动 VOICEVOX 引擎(推荐使用 Docker)

docker compose up -d

这将在 localhost:50021 上启动 VOICEVOX 引擎。

安装并构建依赖项

npm install npm run build

如何使用

光标设置示例

{ "mcpServers": { "voicevox-mcp": { "command": "node", "args": ["${Path to Repository}/dist/index.js"], "env": { "SPEAKER_ID": 8, "SPEED_SCALE": 1.2, "VOICEVOX_API_URL": "http://localhost:50021" } } } }

根据需要设置VOICEVOX_API_URL。

  • 您可以使用发言者工具从 MCP 客户端获取发言者列表。
  • 语音工具可以将文本合成为语音并在本地播放(建议使用 Mac,因为它使用 afplay 命令)。

主要依赖项

  • @modelcontextprotocol/sdk
  • zod
  • typescript

防范措施

  • 未来的改进
    • 除非 VOICEVOX 引擎在 localhost:50021 上运行,否则语音合成将不可用。
    • 如果您使用Mac以外的环境,请相应地更改afplay部分。

执照

MIT 许可证

You must be authenticated.

A
security – no known vulnerabilities
A
license - permissive license
A
quality - confirmed to work

local-only server

The server can only run on the client's local machine because it depends on local resources.

模型上下文协议服务器与 VOICEVOX 引擎集成,提供文本到语音合成和说话人信息检索,允许用户从文本生成和播放语音音频。

  1. 功能
    1. 设置
      1. 启动 VOICEVOX 引擎(推荐使用 Docker)
      2. 安装并构建依赖项
    2. 如何使用
      1. 光标设置示例
    3. 防范措施
      1. 执照

        Related MCP Servers

        • -
          security
          F
          license
          -
          quality
          Provides text-to-speech capabilities through the Model Context Protocol, allowing applications to easily integrate speech synthesis with customizable voices, adjustable speech speed, and cross-platform audio playback support.
          Last updated -
          2
          Python
        • -
          security
          F
          license
          -
          quality
          A Model Context Protocol server that provides text-to-speech capabilities using the Kokoro TTS model, offering multiple voice options and customizable speech parameters.
          Last updated -
          239
          JavaScript
          • Apple
          • Linux
        • -
          security
          A
          license
          -
          quality
          A Model Context Protocol server that integrates high-quality text-to-speech capabilities with Claude Desktop and other MCP-compatible clients, supporting multiple voice options and audio formats.
          Last updated -
          TypeScript
          MIT License
        • A
          security
          A
          license
          A
          quality
          A Model Context Protocol server that enables AI models to generate and play high-quality text-to-speech audio through your device's native audio system using Rime's voice synthesis API.
          Last updated -
          1
          176
          4
          JavaScript
          The Unlicense
          • Apple
          • Linux

        View all related MCP servers

        MCP directory API

        We provide all the information about MCP servers via our MCP API.

        curl -X GET 'https://glama.ai/api/mcp/v1/servers/Yuki10Kobayashi/voicevox-mcp'

        If you have feedback or need assistance with the MCP directory API, please join our Discord server