TTS-MCP

tts-mcp

OpenAI TTS API를 사용하여 고품질 텍스트-음성 생성을 위한 MCP(모델 컨텍스트 프로토콜) 서버 및 명령줄 도구입니다.

주요 특징

MCP 서버 : Claude Desktop 및 기타 MCP 호환 클라이언트와 텍스트-음성 변환 기능 통합
음성 옵션 : 다양한 음성 캐릭터 지원(합금, 노바, 에코 등)
고품질 오디오 : 다양한 출력 포맷 지원(MP3, WAV, OPUS, AAC)
사용자 정의 가능 : 음성 속도, 음성 특성 및 추가 지침 구성
CLI 도구 : 텍스트-음성 직접 변환을 위한 명령줄 유틸리티로도 사용 가능

설치

방법 1: 저장소에서 설치

지엑스피1

방법 2: npx로 직접 실행(설치 필요 없음)

# Start the MCP server directly
npx tts-mcp tts-mcp-server --voice nova --model tts-1-hd

# Use the CLI tool directly
npx tts-mcp -t "Hello, world" -o hello.mp3

MCP 서버 사용

MCP 서버를 사용하면 Claude Desktop과 같은 MCP(Model Context Protocol) 호환 클라이언트와 텍스트-음성 기능을 통합할 수 있습니다.

MCP 서버 시작

# Start with default settings
npm run server

# Start with custom settings
npm run server -- --voice nova --model tts-1-hd

# Or directly with API key
node bin/tts-mcp-server.js --voice echo --api-key your-openai-api-key

MCP 서버 옵션

Options:
  -V, --version       Display version information
  -m, --model <model> TTS model to use (default: "gpt-4o-mini-tts")
  -v, --voice <voice> Voice character (default: "alloy")
  -f, --format <format> Audio format (default: "mp3")
  --api-key <key>     OpenAI API key (can also be set via environment variable)
  -h, --help          Display help information

MCP 클라이언트와 통합

MCP 서버는 Claude Desktop 및 기타 MCP 호환 클라이언트와 함께 사용할 수 있습니다. Claude Desktop 통합:

Claude Desktop 구성 파일을 엽니다(일반적으로 ~/Library/Application Support/Claude/claude_desktop_config.json 에 있음)
OpenAI API 키를 포함하여 다음 구성을 추가합니다.

{
  "mcpServers": {
    "tts-mcp": {
      "command": "node",
      "args": ["full/path/to/bin/tts-mcp-server.js", "--voice", "nova", "--api-key", "your-openai-api-key"],
      "env": {
        "OPENAI_API_KEY": "your-openai-api-key"
      }
    }
  }
}

또는 npx를 사용하면 설정이 더 간편해집니다.

{
  "mcpServers": {
    "tts-mcp": {
      "command": "npx",
      "args": ["-p", "tts-mcp", "tts-mcp-server", "--voice", "nova", "--model", "gpt-4o-mini-tts"],
      "env": {
        "OPENAI_API_KEY": "your-openai-api-key"
      }
    }
  }
}

API 키는 두 가지 방법으로 제공할 수 있습니다.

직접 방법 (테스트용으로 권장): --api-key 매개변수를 사용하여 args 배열에 포함합니다.
환경 변수 방법 (더욱 안전함): 위에 표시된 대로 env 객체에 설정합니다.

보안 참고 사항 : API 키를 포함할 때 구성 파일을 보호해야 합니다.

Claude Desktop을 다시 시작하세요
Claude에게 "이 텍스트를 소리내어 읽어주세요" 또는 이와 유사한 요청을 하면 텍스트가 음성으로 변환됩니다.

사용 가능한 MCP 도구

텍스트 음성 변환 : 텍스트를 음성으로 변환하고 재생하는 도구

CLI 도구 사용법

tts-mcp를 독립 실행형 명령줄 도구로 사용할 수도 있습니다.

# Convert text directly
tts-mcp -t "Hello, world" -o hello.mp3

# Convert from a text file
tts-mcp -f speech.txt -o speech.mp3

# Specify custom voice
tts-mcp -t "Welcome to the future" -o welcome.mp3 -v nova

CLI 도구 옵션

Options:
  -V, --version           Display version information
  -t, --text <text>       Text to convert
  -f, --file <path>       Path to input text file
  -o, --output <path>     Path to output audio file (required)
  -m, --model <n>         Model to use (default: "gpt-4o-mini-tts")
  -v, --voice <n>         Voice character (default: "alloy")
  -s, --speed <number>    Speech speed (0.25-4.0) (default: 1)
  --format <format>       Output format (default: "mp3")
  -i, --instructions <text> Additional instructions for speech generation
  --api-key <key>         OpenAI API key (can also be set via environment variable)
  -h, --help              Display help information

지원되는 음성

지원되는 음성 캐릭터는 다음과 같습니다.

합금(기본값)
금연 건강 증진 협회
산호
에코
우화
오닉스
신성
세이지
반짝임

지원 모델

tts-1
tts-1-hd
gpt-4o-mini-tts(기본값)

출력 형식

다음과 같은 출력 형식이 지원됩니다.

mp3(기본값)
작
aac
FLAC
웨이브
피씨엠

환경 변수

시스템 환경 변수를 사용하여 도구를 구성할 수도 있습니다.

OPENAI_API_KEY=your-api-key-here

특허

MIT

This server cannot be installed

security - not tested

license - permissive license

quality - not tested

How are these scores calculated?

remote-capable server

The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.

Claude Desktop 및 기타 MCP 호환 클라이언트와 고품질 텍스트-음성 변환 기능을 통합하여 다양한 음성 옵션과 오디오 형식을 지원하는 모델 컨텍스트 프로토콜 서버입니다.

Related MCP Servers

Kokoro TTS MCP Server
giannisanni
-
security
F
license
-
quality
Provides text-to-speech capabilities through the Model Context Protocol, allowing applications to easily integrate speech synthesis with customizable voices, adjustable speech speed, and cross-platform audio playback support.
Last updated -
2
Python
Speech MCP Server
hammeiam
-
security
F
license
-
quality
A Model Context Protocol server that provides text-to-speech capabilities using the Kokoro TTS model, offering multiple voice options and customizable speech parameters.
Last updated -
239
JavaScript
ElevenLabs MCP Serverofficial
elevenlabs
A
security
A
license
A
quality
An official Model Context Protocol (MCP) server that enables AI clients to interact with ElevenLabs' Text to Speech and audio processing APIs, allowing for speech generation, voice cloning, audio transcription, and other audio-related tasks.
Last updated -
19
771
Python
MIT License
Gladia MCPofficial
gladiaio
-
security
-
license
-
quality
Official Model Context Protocol server that enables interaction with powerful Speech-to-Text and Audio Intelligence APIs, allowing clients like Claude Desktop to transcribe audio, analyze speech, translate content, and more.
Last updated -
Python
MIT License

View all related MCP servers