synthesize_speech

synthesize_speech

Convert text into natural-sounding speech with customizable voice, emotion, speed, and pacing. Ideal for creating expressive audio content in various formats for professional use.

Instructions

Convert text to speech with advanced voice controls and natural expression

Input Schema

Name	Required	Description	Default
`emotion`	No	Voice emotion	neutral
`filename`	No	Custom filename for saved audio
`outputFormat`	No	Audio output format	wav
`pacing`	No	Speech pacing style	natural
`saveFile`	No	Save audio to file
`speed`	No	Speech speed (0.25-3.0)
`text`	Yes	Text to convert to speech
`voiceId`	No	Voice to use for synthesis	af_heart
`volume`	No	Audio volume (0.1-2.0)

Input Schema (JSON Schema)

{
  "properties": {
    "emotion": {
      "default": "neutral",
      "description": "Voice emotion",
      "enum": [
        "neutral",
        "happy",
        "excited",
        "calm",
        "serious",
        "casual",
        "confident"
      ],
      "type": "string"
    },
    "filename": {
      "description": "Custom filename for saved audio",
      "type": "string"
    },
    "outputFormat": {
      "default": "wav",
      "description": "Audio output format",
      "enum": [
        "wav",
        "mp3",
        "flac",
        "ogg"
      ],
      "type": "string"
    },
    "pacing": {
      "default": "natural",
      "description": "Speech pacing style",
      "enum": [
        "natural",
        "conversational",
        "presentation",
        "tutorial",
        "narrative",
        "fast",
        "slow"
      ],
      "type": "string"
    },
    "saveFile": {
      "default": false,
      "description": "Save audio to file",
      "type": "boolean"
    },
    "speed": {
      "default": 1,
      "description": "Speech speed (0.25-3.0)",
      "maximum": 3,
      "minimum": 0.25,
      "type": "number"
    },
    "text": {
      "description": "Text to convert to speech",
      "maxLength": 10000,
      "type": "string"
    },
    "voiceId": {
      "default": "af_heart",
      "description": "Voice to use for synthesis",
      "enum": [
        "af_heart",
        "af_sky",
        "af_bella",
        "af_sarah",
        "af_nicole",
        "am_adam",
        "am_michael",
        "bf_emma",
        "bf_isabella",
        "bm_lewis"
      ],
      "type": "string"
    },
    "volume": {
      "default": 1,
      "description": "Audio volume (0.1-2.0)",
      "maximum": 2,
      "minimum": 0.1,
      "type": "number"
    }
  },
  "required": [
    "text"
  ],
  "type": "object"
}

Advanced TTS MCP Server

Instructions

Input Schema

Input Schema (JSON Schema)

Other Tools from Advanced TTS MCP Server

Related Tools

MCP directory API