text_to_sound_effects
Generate sound effects from text descriptions using ElevenLabs API. Specify duration, output format, and save location for audio files used in projects requiring custom sound design.
Instructions
Convert text description of a sound effect to sound effect with a given duration and save the output audio file to a given directory. Directory is optional, if not provided, the output file will be saved to $HOME/Desktop. Duration must be between 0.5 and 5 seconds.
⚠️ COST WARNING: This tool makes an API call to ElevenLabs which may incur costs. Only use when explicitly requested by the user.
Args:
text: Text description of the sound effect
duration_seconds: Duration of the sound effect in seconds
output_directory: Directory where files should be saved.
Defaults to $HOME/Desktop if not provided.
loop: Whether to loop the sound effect. Defaults to False.
output_format (str, optional): Output format of the generated audio. Formatted as codec_sample_rate_bitrate. So an mp3 with 22.05kHz sample rate at 32kbs is represented as mp3_22050_32. MP3 with 192kbps bitrate requires you to be subscribed to Creator tier or above. PCM with 44.1kHz sample rate requires you to be subscribed to Pro tier or above. Note that the μ-law format (sometimes written mu-law, often approximated as u-law) is commonly used for Twilio audio inputs.
Defaults to "mp3_44100_128". Must be one of:
mp3_22050_32
mp3_44100_32
mp3_44100_64
mp3_44100_96
mp3_44100_128
mp3_44100_192
pcm_8000
pcm_16000
pcm_22050
pcm_24000
pcm_44100
ulaw_8000
alaw_8000
opus_48000_32
opus_48000_64
opus_48000_96
opus_48000_128
opus_48000_192
Input Schema
TableJSON Schema
| Name | Required | Description | Default |
|---|---|---|---|
| duration_seconds | No | ||
| loop | No | ||
| output_directory | No | ||
| output_format | No | mp3_44100_128 | |
| text | Yes |
Implementation Reference
- elevenlabs_mcp/server.py:360-386 (handler)The handler function that implements the core logic of the 'text_to_sound_effects' tool. It validates input, generates sound effects audio using the ElevenLabs client, saves the audio file, and returns the file path.def text_to_sound_effects( text: str, duration_seconds: float = 2.0, output_directory: str | None = None, output_format: str = "mp3_44100_128", loop: bool = False, ) -> TextContent: if duration_seconds < 0.5 or duration_seconds > 5: make_error("Duration must be between 0.5 and 5 seconds") output_path = make_output_path(output_directory, base_path) output_file_name = make_output_file("sfx", text, output_path, "mp3") audio_data = client.text_to_sound_effects.convert( text=text, output_format=output_format, duration_seconds=duration_seconds, loop=loop, ) audio_bytes = b"".join(audio_data) with open(output_path / output_file_name, "wb") as f: f.write(audio_bytes) return TextContent( type="text", text=f"Success. File saved as: {output_path / output_file_name}", )
- elevenlabs_mcp/server.py:325-359 (registration)The @mcp.tool decorator registers the 'text_to_sound_effects' tool, including its detailed description, parameters, and usage instructions.@mcp.tool( description="""Convert text description of a sound effect to sound effect with a given duration and save the output audio file to a given directory. Directory is optional, if not provided, the output file will be saved to $HOME/Desktop. Duration must be between 0.5 and 5 seconds. ⚠️ COST WARNING: This tool makes an API call to ElevenLabs which may incur costs. Only use when explicitly requested by the user. Args: text: Text description of the sound effect duration_seconds: Duration of the sound effect in seconds output_directory: Directory where files should be saved. Defaults to $HOME/Desktop if not provided. loop: Whether to loop the sound effect. Defaults to False. output_format (str, optional): Output format of the generated audio. Formatted as codec_sample_rate_bitrate. So an mp3 with 22.05kHz sample rate at 32kbs is represented as mp3_22050_32. MP3 with 192kbps bitrate requires you to be subscribed to Creator tier or above. PCM with 44.1kHz sample rate requires you to be subscribed to Pro tier or above. Note that the μ-law format (sometimes written mu-law, often approximated as u-law) is commonly used for Twilio audio inputs. Defaults to "mp3_44100_128". Must be one of: mp3_22050_32 mp3_44100_32 mp3_44100_64 mp3_44100_96 mp3_44100_128 mp3_44100_192 pcm_8000 pcm_16000 pcm_22050 pcm_24000 pcm_44100 ulaw_8000 alaw_8000 opus_48000_32 opus_48000_64 opus_48000_96 opus_48000_128 opus_48000_192 """ )