Create custom voice profiles from audio samples for text-to-speech and speech-to-speech applications. Analyze MP3 or WAV files to generate voice replicas that match the characteristics of your audio input.
Convert audio files from one voice to another using ElevenLabs voice transformation technology. Provide an input audio file and specify the target voice to generate new audio output.
Convert audio files to different voices while preserving speech content. Supports MP3 and WAV formats up to 50MB, with options for voice similarity and background noise removal.
Convert text to audio with customizable voice, speed, and emotion, saving the file to a specified directory. Integrates with MiniMax API for high-quality speech synthesis.
A server that allows Claude to control audio playback on your computer, supporting MP3, WAV, and OGG files with features like play, list, and stop commands.
A voice-to-text transcription service that converts audio files to transcripts using SiliconFlow, supporting both multipart/form-data and base64 formats.
A portable, Dockerized Python tool that implements Model Context Protocol for audio transcription using Whisper models, featuring both CLI and web UI interfaces for converting audio files to JSON transcriptions.