Enables configuration of the server through environment variables, allowing customization of API endpoints and speaker settings
Used for cloning the repository and managing the source code
Hosts the repository and provides issue tracking for bug reports and feature requests
Runtime environment required to run the MCP server (version 18.x or higher)
Package manager used for installing dependencies and running scripts (version 9.x or higher)
Provides type-safe design for the server implementation
AivisSpeech MCP Server
This is an implementation of the Model Context Protocol (MCP) server for AivisSpeech. This server works with the AivisSpeech Engine to provide an interface for speech synthesis. Through the MCP protocol, it becomes easy to use AivisSpeech's speech synthesis function from applications such as AI assistants.
overview
AivisSpeech MCP Server provides the following features:
- MCP protocol compliant API endpoints
- High-quality voice synthesis in cooperation with AivisSpeech Engine
- Type-safe design with TypeScript
- Easy to configure and highly scalable architecture
Prerequisites
- Node.js 18.x or higher
- npm 9.x or higher
- AivisSpeech Engine (requires separate installation)
install
Environment settings
In your .env
file, set the following:
Cursor MCP Settings
In .cursor/mcp.json
file, configure the following:
Replace /path/to/aivis-speech-mcp/dist/index.js
with the path to your project. For Windows, escape the backslashes or use forward slashes. For example: "C:/Users/username/path/to/aivis-speech-mcp/dist/index.js"
How to use
Development Mode
During development you can start the server with hot reloading with the following command:
Build
To build for production run the following command:
Production Mode
After building, start the server in production mode with the following command:
test
To run the tests use the following command:
architecture
AivisSpeech MCP Server consists of the following components:
- MCP Service : Provides a server that complies with the Model Context Protocol and processes requests from clients.
- AivisSpeech Service : Communicates with the AivisSpeech Engine API and performs speech synthesis.
API Specifications
We provide API endpoints that comply with the MCP protocol. The main features are as follows:
- Speech synthesis (generating speech from text)
- Get speaker information
- Audio style settings
For detailed API specifications, please refer to AivisSpeech Engine API Specification .
Integration with MCP protocol
This server implements the Model Context Protocol (MCP) and can be used seamlessly with applications such as AI assistants. For more information about the MCP protocol, see the official MCP documentation .
troubleshooting
Common issues and solutions:
- Can't connect to AivisSpeech Engine : Check if
AIVIS_SPEECH_API_URL
in.env
file is set correctly - No sound : Check your system's audio settings and make sure the correct audio device is selected
- Speaker ID not found : Make sure AivisSpeech Engine is running properly and check for available speaker IDs.
contribution
Please report bugs and feature requests via our GitHub issue tracker. Pull requests are also welcome.
license
Acknowledgements
- Aivis Speech Engine Team
- Developer of Model Context Protocol
This server cannot be installed
hybrid server
The server is able to function both locally and remotely, depending on the configuration or use case.
A Model Context Protocol server that enables AI assistants to utilize AivisSpeech Engine's high-quality voice synthesis capabilities through a standardized API interface.
Related MCP Servers
- AsecurityAlicenseAqualityA Model Context Protocol server that allows AI assistants to interact with the VRChat API, enabling retrieval of user information, friends lists, and other VRChat data through a standardized interface.Last updated -72239TypeScriptMIT License
- -securityFlicense-qualityA comprehensive Model Context Protocol server implementation that enables AI assistants to interact with file systems, databases, GitHub repositories, web resources, and system tools while maintaining security and control.Last updated -161TypeScript
- AsecurityFlicenseAqualityA Model Context Protocol server that enables AI assistants to interact with the Omi API for retrieving and creating conversations and memories for users.Last updated -42TypeScript
- -securityFlicense-qualityA Model Context Protocol server that provides AI models with structured access to external data and services, acting as a bridge between AI assistants and applications, databases, and APIs in a standardized, secure way.Last updated -Python