대화형 음성 MCP 서버

대화형 음성 MCP 서버(Kokoro TTS + NeMo ASR)

Kokoro를 사용한 텍스트-음성(TTS) 기능과 NVIDIA NeMo Parakeet 모델을 사용한 음성-텍스트(STT) 기능을 제공하는 모델 컨텍스트 프로토콜 서버로, 대화형 음성 대화가 가능합니다.

사용 가능한 도구

interactive_voice_dialog - 텍스트를 음성으로 합성하고 재생한 다음, 사용자 음성 입력을 듣고 음성 변환본을 반환합니다.
- 필수 인수:
  - text_to_speak (문자열): 도우미가 말할 텍스트입니다.
- 선택 인수:
  - voice (문자열): TTS에 사용할 음성(예: 'af_heart'). 기본값은 'af_heart'입니다.

설치

필수 조건

일부 기본 TTS 모델을 사용하려면 시스템에 espeak-ng 설치해야 합니다.

Windows 설치:

espeak-ng 릴리스 로 이동하세요.
"최신 릴리스"를 클릭하세요.
적절한 *.msi 파일(예: espeak-ng-20191129-b702b03-x64.msi )을 다운로드하세요.
다운로드한 설치 프로그램을 실행합니다.

지역 개발 설치

Claude Desktop에서 python -m mcp_server_tts 명령을 사용하여 이 서버를 실행하려면 Python 모듈로 설치해야 합니다. 개발 환경에서는 "편집 가능" 모드( -e )로 설치하는 것이 좋습니다. 이렇게 하면 소스 코드 변경 사항이 재설치 없이 즉시 반영됩니다.

pyproject.toml 파일이 있는 디렉토리(이 서버 프로젝트의 루트)로 이동하여 다음을 실행합니다.

지엑스피1

설치 후 다음을 사용하여 스크립트로 실행할 수 있습니다.

python -m mcp_server_tts.server # Assuming the main module is still server.py within mcp_server_tts
# Or, if you create a new package structure:
# python -m mcp_interactive_voice_server

구성

이 서버를 Claude Desktop과 함께 사용하려면 claude_desktop_config.json 파일에 추가해야 합니다. 이 파일의 위치는 일반적으로 C:\Users\<YourUsername>\AppData\Roaming\Claude\claude_desktop_config.json 입니다.

claude_desktop_config.json 의 mcpServers 개체 아래에 다음 항목을 추가합니다.

    "tts": {
      "command": "python",
      "args": ["-m", "mcp_server_tts"]
    }

예를 들어, mcpServers 섹션은 다음과 같습니다.

{
  // ... other configurations ...
  "mcpServers": {
    // ... other servers ...
    "tts": {
      "command": "python",
      "args": ["-m", "mcp_server_tts"]
    }
    // ... other servers ...
  }
  // ... other configurations ...
}

This server cannot be installed

security - not tested

license - not found

quality - not tested

How are these scores calculated?

local-only server

The server can only run on the client's local machine because it depends on local resources.

Kokoro TTS를 사용하여 텍스트를 음성으로 변환하고 NVIDIA NeMo ASR을 사용하여 사용자 응답을 필사하여 Claude와의 음성 기반 상호작용을 가능하게 하고, 대화형 음성 대화를 생성합니다.

Related MCP Servers

mcp-server-neonofficial
neondatabase-labs
-
security
A
license
-
quality
Lets you use Claude Desktop, or any MCP Client, to use natural language to accomplish things with Neon.
Last updated -
664
386
TypeScript
MIT License
Zonos TTS MCP Server
PhialsBasement
A
security
F
license
A
quality
Facilitates direct speech generation using Claude for multiple languages and emotions, integrating with a Zonos TTS setup via the Model Context Protocol.
Last updated -
1
0
12
TypeScript
Azure MCP Server
kalivaraprasad-gonapa
A
security
A
license
A
quality
Enables natural language interaction with Azure services through Claude Desktop, supporting resource management, subscription handling, and tenant selection with secure authentication.
Last updated -
3
7
14
TypeScript
MIT License
TranscriptionTools MCP Server
MushroomFleet
A
security
A
license
A
quality
Provides intelligent transcript processing capabilities for Claude, featuring natural formatting, contextual repair, and smart summarization powered by Deep Thinking LLMs.
Last updated -
4
15
TypeScript
MIT License

View all related MCP servers

Interactive Voice MCP Server