ElevenLabs MCP Server

Server Configuration

Describes the environment variables required to run the server.

NameRequiredDescriptionDefault
ELEVENLABS_STYLENoValue for styling of generated speech, default is 0.1.0.1
ELEVENLABS_API_KEYYesYour ElevenLabs API key required for accessing the text-to-speech service.
ELEVENLABS_MODEL_IDNoThe model ID for ElevenLabs, default is 'eleven_flash_v2'.eleven_flash_v2
ELEVENLABS_VOICE_IDYesThe ID of the voice you wish to use for text-to-speech conversion.
ELEVENLABS_STABILITYNoValue representing the stability of the generated speech, default is 0.5.0.5
ELEVENLABS_OUTPUT_DIRNoDirectory where output audio files will be stored, default is 'output'.output
ELEVENLABS_SIMILARITY_BOOSTNoValue representing the similarity boost, default is 0.75.0.75

Schema

Prompts

Interactive templates invoked by user choice

NameDescription

No prompts

Resources

Contextual data attached and managed by the client

NameDescription

No resources

Tools

Functions exposed to the LLM to take actions

NameDescription
generate_audio_simpleGenerate audio from plain text using default voice settings
generate_audio_scriptGenerate audio from a structured script with multiple voices and actors. Accepts either: 1. Plain text string 2. JSON string with format: { "script": [ { "text": "Text to speak", "voice_id": "optional-voice-id", "actor": "optional-actor-name" }, ... ] }
delete_jobDelete a voiceover job and its associated files
get_audio_fileGet the audio file content for a specific job
list_voicesGet a list of all available ElevenLabs voices with metadata
get_voiceover_historyGet voiceover job history. Optionally specify a job ID for a specific job.