Audio Processing
Services for manipulating, generating, and working with audio content. Includes audio synthesis, processing, playback control, and format conversion capabilities.
MCP ServersBrowse all →
- AsecurityAlicenseAqualityGaudio Lab Audio AI — Stem Separation, DME Separation, AI Text SyncLast updated27183MIT
- AsecurityAlicenseAqualitySuno AI music generation with custom lyrics, song extension, cover/remix creation, lyrics generation, and persona management for reusable voice styles.Last updated273MIT
- AsecurityAlicenseAqualityEnables AI assistants to control Audacity for real-time local audio editing, mastering, and transcription through over 90 specialized tools. It allows users to perform complex audio processing tasks like noise reduction and podcast cleanup using natural language commands.Last updated10019Apache 2.0
- AsecurityAlicense-qualityTranscribes videos from 1000+ platforms (YouTube, TikTok, Vimeo, etc.) and local video files using OpenAI's Whisper model, with support for 90+ languages and multiple output formats.Last updated14111MIT
- AsecurityAlicenseAqualityAI-powered speech tools by Brainiall: pronunciation assessment with phoneme-level feedback, speech-to-text with language detection, and text-to-speech with multiple voices.Last updated4MIT
- AsecurityAlicenseAqualityGemini Audio MCP is a high-performance Model Context Protocol (MCP) server that leverages the power of the Gemini 2.0 Multimodal Live API to generate high-fidelity, environmental soundscapes on-demand.Last updated9MIT
- AsecurityAlicenseBqualityAgent-native media processing: video encoding, image manipulation, document conversion, audio transcription, and more via 86+ cloud Robots.Last updated771MIT
- AsecurityAlicenseAqualityAn MCP server that enables transcribing local audio files and Telegram voice messages using OpenAI's Whisper via local inference or cloud API. It supports multiple audio formats, automatic language detection, and optional word-level timestamps for AI-powered audio analysis.Last updated5MIT

ElevenLabs MCP Serverofficial
AsecurityAlicense-qualityAn official Model Context Protocol (MCP) server that enables AI clients to interact with ElevenLabs' Text to Speech and audio processing APIs, allowing for speech generation, voice cloning, audio transcription, and other audio-related tasks.Last updated241,310MIT
MCP ConnectorsBrowse all →
AI music and podcast platform for autonomous agents. SoundCloud for AI bots.
AudioAlpha turns 100+ daily finance and crypto podcasts into structured intelligence — α-sentiment scores, narrative signals, asset mentions, transcripts, and market snapshots with 40+ custom metrics. Built for AI-driven research and trading workflows.
Financial podcast intelligence platform — sentiment, narrative, and asset signals from 100+ podcasts
Generate game assets with AI: sprites, 3D models, animations, sound effects, music, and voices.
AI image, video & music generation. Flux, Veo 3.1, Suno V5. Free tier included.
125+ browser tools for PDF, Image, Video, Audio, AI, Scanner. Files never leave your device.
Transcribe and summarize audio and video. Pay per job via Stripe or crypto.
Media intelligence analysis for audio, video, and images via the Echosaw MCP server.
Pronunciation scoring, speech-to-text, and text-to-speech for language learning