ElevenLabs MCP Server

Server Configuration

Describes the environment variables required to run the server.

NameRequiredDescriptionDefault
ELEVENLABS_STYLENoValue for styling of generated speech, default is 0.1.0.1
ELEVENLABS_API_KEYYesYour ElevenLabs API key required for accessing the text-to-speech service.
ELEVENLABS_MODEL_IDNoThe model ID for ElevenLabs, default is 'eleven_flash_v2'.eleven_flash_v2
ELEVENLABS_VOICE_IDYesThe ID of the voice you wish to use for text-to-speech conversion.
ELEVENLABS_STABILITYNoValue representing the stability of the generated speech, default is 0.5.0.5
ELEVENLABS_OUTPUT_DIRNoDirectory where output audio files will be stored, default is 'output'.output
ELEVENLABS_SIMILARITY_BOOSTNoValue representing the similarity boost, default is 0.75.0.75

Schema

Prompts

Interactive templates invoked by user choice

NameDescription

No prompts

Resources

Contextual data attached and managed by the client

NameDescription

No resources

Tools

Functions exposed to the LLM to take actions

NameDescription
generate_audio_simple

Generate audio from plain text using default voice settings

generate_audio_script

Generate audio from a structured script with multiple voices and actors. Accepts either: 1. Plain text string 2. JSON string with format: { "script": [ { "text": "Text to speak", "voice_id": "optional-voice-id", "actor": "optional-actor-name" }, ... ] }

delete_job

Delete a voiceover job and its associated files

get_audio_file

Get the audio file content for a specific job

list_voices

Get a list of all available ElevenLabs voices with metadata

get_voiceover_history

Get voiceover job history. Optionally specify a job ID for a specific job.