Audio Processing

MCP Servers for Audio Processing

Services for manipulating, generating, and working with audio content. Includes audio synthesis, processing, playback control, and format conversion capabilities.

View all MCP Servers

ElevenLabs MCP Serverofficial
elevenlabs
A
security
A
license
A
quality
An official Model Context Protocol (MCP) server that enables AI clients to interact with ElevenLabs' Text to Speech and audio processing APIs, allowing for speech generation, voice cloning, audio transcription, and other audio-related tasks.
Last updated -
19
843
Python
MIT License
Kokoro Text to Speech MCP Server
mberg
A
security
A
license
A
quality
A server that generates MP3 audio files from text using Kokoro TTS technology with optional S3 upload capabilities.
Last updated -
1
48
Python
Apache 2.0
Ableton Copilot MCP
xiaolaa2
A
security
A
license
A
quality
A Model Context Protocol server that enables real-time interaction with Ableton Live, allowing AI assistants to control song creation, track management, clip operations, and audio recording workflows.
Last updated -
23
29
32
TypeScript
MIT License
MCP Make Sound
nocoo
A
security
A
license
A
quality
A Model Context Protocol server for macOS that enables AI assistants to play system sounds for audio feedback, offering informational, warning, and error sound options.
Last updated -
3
JavaScript
MIT License
Audio Transcriber MCP Server
Ichigo3766
A
security
A
license
A
quality
A MCP server that enables transcription of audio files using OpenAI's Speech-to-Text API, with support for multiple languages and file saving options.
Last updated -
1
2
7
JavaScript
MIT License
mcp-svstudio
ocadaruma
A
security
A
license
A
quality
MCP server for Synthesizer V AI Vocal Studio, which allows LLMs to create/edit vocal tracks e.g. adding lyrics to the melody.
Last updated -
6
5
Apache 2.0
Say MCP Server
bmorphism
A
security
A
license
A
quality
Enables text-to-speech functionality on macOS using the say command, offering extensive control over speech parameters like voice, rate, volume, and pitch for a customizable auditory experience.
Last updated -
2
6
15
JavaScript
MIT License
Decent-Sampler Drums MCP Server
dandeliongold
A
security
A
license
A
quality
Facilitates the creation of DecentSampler drum kit configurations, supporting WAV file analysis and XML generation to ensure accurate sample lengths and well-structured presets.
Last updated -
5
1
1
TypeScript
MIT License
supercollider-mcp
Synohara
A
security
A
license
A
quality
supercollider-mcp
Last updated -
2
0
4
JavaScript
MIT License
ElevenLabs MCP Server
mamertofabian
A
security
A
license
A
quality
Integrates with ElevenLabs text-to-speech API.
Last updated -
6
104
Python
MIT License
mcp-audio-analysis
hugohow
A
security
A
license
A
quality
MCP to analyse local audio file.
Last updated -
8
14
Python
MIT License
VOICEVOX MCP Server
Yuki10Kobayashi
A
security
A
license
A
quality
A Model Context Protocol server that integrates with VOICEVOX engine to provide text-to-speech synthesis and speaker information retrieval, allowing users to generate and play voice audio from text.
Last updated -
2
TypeScript
MIT License
MIDI File MCP
xiaolaa2
A
security
A
license
A
quality
A powerful MCP tool for parsing and manipulating MIDI files that allows users to read, analyze, and modify MIDI files through natural language commands, supporting operations like reading file information, modifying tracks, adding notes, and setting tempo.
Last updated -
11
254
4
JavaScript
MIT License
mcp-hfspace
evalstate
A
security
A
license
A
quality
Use HuggingFace Spaces directly from Claude. Use Open Source Image Generation, Chat, Vision tasks and more. Supports Image, Audio and text uploads/downloads.
Last updated -
3
397
335
TypeScript
MIT License
Zoom Recordings No-Auth
peakmojo
A
security
A
license
A
quality
A MCP server for accessing Zoom recordings and transcripts without requiring direct authentication from the end user.
Last updated -
4
1
3
Python
Apache 2.0
Advanced TTS MCP Server
samihalawa
A
security
A
license
A
quality
Provides high-quality text-to-speech synthesis with 10 natural voices, emotion control, and dynamic pacing for professional applications requiring expressive speech output.
Last updated -
5
1
JavaScript
MIT License
MCP Video Recognition Server
mario-andreschak
A
security
A
license
A
quality
Provides tools for image, audio, and video recognition using Google's Gemini AI through the Model Context Protocol.
Last updated -
3
9
TypeScript
MIT License
棒読みちゃんMCPサーバー (Node.js版)
uraoz
A
security
A
license
A
quality
A Model Context Protocol server that enables AI assistants like Claude to use Bouyomichan (a Japanese text-to-speech program) for voice reading with adjustable voice types, volume, speed, and pitch.
Last updated -
1
1
JavaScript
MIT License
Gladia MCPofficial
gladiaio
-
security
A
license
-
quality
Official Model Context Protocol server that enables interaction with powerful Speech-to-Text and Audio Intelligence APIs, allowing clients like Claude Desktop to transcribe audio, analyze speech, translate content, and more.
Last updated -
2
Python
MIT License
MCP FFmpeg Video Processor
bitscorp-mcp
A
security
F
license
A
quality
A Node.js server that enables video manipulation through natural language requests, including resizing videos to different resolutions (360p to 1080p) and extracting audio in various formats (MP3, AAC, WAV, OGG).
Last updated -
4
97
25
TypeScript
Flyworks MCPofficial
Flyworks-AI
-
security
A
license
-
quality
A Model Context Protocol server that enables fast and free lipsync video creation for a wide range of digital avatars, supporting both audio and text inputs to generate synchronized lip movements.
Last updated -
90
Python
MIT License
MCPollinations Multimodal MCP Server
pinkpixel-dev
A
security
F
license
A
quality
A Model Context Protocol server that enables AI assistants to generate images, text, and audio through the Pollinations APIs without requiring authentication.
Last updated -
7
564
27
JavaScript
play-sound-mcp-server
PetitBaguette
A
security
F
license
A
quality
An MCP server that plays local sound files on macOS using the afplay command, allowing AI assistants to trigger audio notifications after responding.
Last updated -
1
TypeScript
Cursor Sound MCP
bcharleson
A
security
F
license
A
quality
Plays sound effects when Cursor AI completes code generation, providing audio feedback for a more interactive coding experience.
Last updated -
1
2
TypeScript
Suno-MCP
lioensky
A
security
F
license
A
quality
A Model Context Protocol server that allows AI assistants to generate music through the Suno API, supporting custom lyrics and style inputs or inspiration-based creation.
Last updated -
1
9
JavaScript
gong-mcp
MaPa07
A
security
F
license
A
quality
gong-mcp
Last updated -
6
JavaScript
Voice Recorder MCP Server
DefiBax
-
security
A
license
-
quality
Enables recording audio from a microphone and transcribing it using OpenAI's Whisper model. Works as both a standalone MCP server and a Goose AI agent extension.
Last updated -
6
Python
MIT License
TTS-MCP
nakamurau1
-
security
A
license
-
quality
A Model Context Protocol server that integrates high-quality text-to-speech capabilities with Claude Desktop and other MCP-compatible clients, supporting multiple voice options and audio formats.
Last updated -
14
1
TypeScript
MIT License
MCP FFmpeg Helper
sworddut
-
security
A
license
-
quality
A lightweight server that exposes FFmpeg's video processing capabilities to AI assistants through the Model Context Protocol (MCP), supporting operations like video format conversion, audio extraction, and adding watermarks.
Last updated -
279
15
TypeScript
MIT License
Sonic Pi MCP
abhishekjairath
-
security
A
license
-
quality
A Model Context Protocol server that allows AI assistants like Claude and Cursor to create music and control Sonic Pi programmatically through OSC messages.
Last updated -
331
7
TypeScript
MIT License
MCP-Audio Plugin
AIO-2030
-
security
A
license
-
quality
A voice-to-text transcription service that converts audio files to transcripts using SiliconFlow, supporting both multipart/form-data and base64 formats.
Last updated -
7
Python
Apache 2.0
Video & Audio Editing MCP Server
misbahsy
-
security
A
license
-
quality
Provides powerful video and audio editing capabilities through FFmpeg, enabling AI assistants to perform professional-grade operations including format conversion, trimming, overlays, transitions, and advanced audio processing.
Last updated -
16
Python
MIT License
AbletonMCP
ahujasid
-
security
A
license
-
quality
Connects Ableton Live to Claude AI through the Model Context Protocol, enabling AI-assisted music production by allowing Claude to directly interact with and control Ableton Live sessions.
Last updated -
1,794
Python
MIT License
MCP Audio Transcriber
ShreyasTembhare
-
security
A
license
-
quality
A portable, Dockerized Python tool that implements Model Context Protocol for audio transcription using Whisper models, featuring both CLI and web UI interfaces for converting audio files to JSON transcriptions.
Last updated -
Python
MIT License
MCP Video Digest
R-lz
-
security
A
license
-
quality
A service that extracts and transcribes audio content from videos across 1000+ streaming websites including YouTube, Bilibili, TikTok, and Twitter, supporting multiple transcription providers like Deepgram, Gladia, Speechmatics, and AssemblyAI.
Last updated -
20
Python
MIT License
Freesound MCP Server
timjrobinson
-
security
A
license
-
quality
An MCP server that enables AI assistants to search, analyze, and retrieve information about audio samples from Freesound.org through their API.
Last updated -
JavaScript
MIT License
Voice Recognition MCP Service
yangsenessa
-
security
A
license
-
quality
Provides voice recognition and text extraction capabilities with support for both stdio and MCP modes, processing audio files or base64 encoded data and returning structured results with language, emotion, and speaker information.
Last updated -
Python
MIT License
MCP FishAudio Server
da-okazaki
-
security
A
license
-
quality
An MCP (Model Context Protocol) server that provides seamless integration between Fish Audio's Text-to-Speech API and LLMs like Claude, enabling natural language-driven speech synthesis.
Last updated -
27
6
TypeScript
MIT License
melrōse musical expression player
emicklei
-
security
A
license
-
quality
melrōse musical expression player
Last updated -
192
Go
MIT License
MCP Play Sound Server
davidteren
-
security
A
license
-
quality
Provides audio playback functionality for AI agents, allowing them to play notification sounds when coding tasks are completed.
Last updated -
1
Python
MIT License
MCP Sound Tool
tijs
-
security
A
license
-
quality
A Model Context Protocol implementation that plays sound effects (completion, error, notification) for Cursor AI and other MCP-compatible environments, providing audio feedback for a more interactive coding experience.
Last updated -
1
Python
MIT License
Mureka MCP Server
SkyworkAI
-
security
A
license
-
quality
A Model Context Protocol server that enables AI assistants like Claude to generate lyrics, songs, and background music through Mureka's APIs.
Last updated -
45
Python
MIT License
REAPER MCP Server
shiehn
-
security
A
license
-
quality
A Model Context Protocol server that exposes REAPER digital audio workstation functionality through a clean API interface, enabling programmatic control of 169+ REAPER operations across track management, MIDI editing, effects, automation and more.
Last updated -
4
Python
MIT License
Audio MCP Server
GongRzhe
-
security
A
license
-
quality
Enables Claude and other AI assistants to interact with your computer's audio system, allowing for recording from microphones and playing audio through speakers.
Last updated -
3
Python
MIT License
ElevenLabs MCP Server
nguyendinhsinh361
-
security
A
license
-
quality
Official ElevenLabs Model Context Protocol server that enables AI assistants like Claude to interact with Text to Speech and audio processing APIs, allowing them to generate speech, clone voices, transcribe audio, and create soundscapes.
Last updated -
1
Python
MIT License
Audio Player MCP Server
Here-and-Tomorrow-LLC
-
security
A
license
-
quality
A server that allows Claude to control audio playback on your computer, supporting MP3, WAV, and OGG files with features like play, list, and stop commands.
Last updated -
3
Python
MIT License
Daisys MCP Server
daisys-ai
-
security
F
license
-
quality
A beta server that enables integration with Daisys.ai services via the Message Control Protocol (MCP), allowing AI clients like Claude Desktop and Cursor to use Daisys features through a standardized interface.
Last updated -
6
Python
MCP Voice/Text-Controlled Q-SYS Demo
charliem716
-
security
F
license
-
quality
AI-powered system that enables voice and text control for Q-SYS audio systems using OpenAI Agents SDK and Model Context Protocol.
Last updated -
TypeScript
Sound Notification MCP
ks0318-p
-
security
F
license
-
quality
An MCP server that plays notification sounds when AI coding assistants like Windsurf or Cursor require user attention, such as when coding is complete or when user approval is needed.
Last updated -
9
2
TypeScript
Cursor Sound MCP
ericlistin
-
security
F
license
-
quality
Provides audio feedback by playing sound effects when Cursor AI completes code generation, creating a more interactive coding experience.
Last updated -
18
TypeScript
REAPER MCP Server
itsuzef
-
security
F
license
-
quality
A Model Context Protocol server that enables AI agents to create fully mixed and mastered tracks in REAPER DAW, supporting project management, MIDI composition, audio recording, and mixing automation.
Last updated -
10
Python
MCP MIDI Server
sandst1
-
security
F
license
-
quality
A FastMCP server that creates a virtual MIDI output port, allowing LLMs to generate and send MIDI data to any software that accepts MIDI input.
Last updated -
7
Python
FL Studio MCP
veenastudio
-
security
F
license
-
quality
An MCP server that connects Claude to FL Studio, allowing the AI to compose music, control instruments, and live record melodies, chords, and drums to the piano roll.
Last updated -
46
Python
Flyworks MCP
Flyworks-AI
-
security
-
license
-
quality
A Model Context Protocol server that provides a convenient interface for creating lipsynced videos by matching digital avatar videos with audio inputs.
Last updated -
1
Python
FluidSynth MCP Server
kimjune01
-
security
F
license
-
quality
A MIDI composition system that enables AI assistants to create music through FluidSynth, with capabilities for playing notes, creating melodies, managing tracks, and exporting audio.
Last updated -
Python
Typecast API MCP Server
neosapience
-
security
F
license
-
quality
Enables seamless integration with Typecast API through the Model Context Protocol, allowing clients to manage voices, convert text to speech, and play audio in a standardized way.
Last updated -
2
Python
BirdNet-Pi MCP Server
DMontgomery40
-
security
F
license
-
quality
A Python-based server that enables accessing and analyzing bird detection data through the Model Context Protocol, offering features like filtering detections, accessing audio recordings, and generating reports.
Last updated -
3
Python
Speech MCP Server
hammeiam
-
security
F
license
-
quality
A Model Context Protocol server that provides text-to-speech capabilities using the Kokoro TTS model, offering multiple voice options and customizable speech parameters.
Last updated -
31
1
JavaScript
FastAPI SSE MCP Random
hk4crprasad
-
security
F
license
-
quality
A FastAPI server implementing the Model Context Protocol (MCP) for structured tool use, providing utility tools including random number generation, image generation via Azure OpenAI DALL-E, and AI podcast generation.
Last updated -
Python
FL Studio MCP
ohhalim
-
security
F
license
-
quality
An MCP server that connects Claude to FL Studio, allowing the AI to send melodies, chords, and drum patterns directly to the DAW via virtual MIDI ports.
Last updated -
2
Python

MCP Servers for Audio Processing

ElevenLabs MCP Serverofficial

Hugging Facemcp-hfspace

Gladia MCPofficial

Flyworks MCPofficial

mcp-hfspace