Audio Processing MCP connectors
Transcribe and summarize audio and video. Pay per job via Stripe or crypto.
Download YouTube videos as MP3/M4A/MP4 from any MCP-compatible AI assistant. Free 3/day, $3.99/mo.
Privacy-first audio intelligence: BPM, key, waveform. Audio never stored. Pay per second.
AI audio tools for music producers — stem splitting, vocal removal, BPM & key detection, audio-to-MIDI, format conversion, trimming, video-to-audio extraction and AI song generation.
Pronunciation scoring, speech-to-text, and text-to-speech for language learning
Media intelligence analysis for audio, video, and images via the Echosaw MCP server.
Turn any LLM multimodal; generate images, voices, videos, 3D models, music, and more.
Arabic-first AI creative platform for Egyptian and Arab businesses. Generate social media designs, write marketing copy in Egyptian dialect, build content calendars, produce Sora-2 videos, AI photoshoots, music tracks, and business documents — with your brand identity automatically applied. Requires a Grow or Business subscription at vizzy.space.
Detect AI-generated images, videos, and audio with identifAI's deepfake detection tools.
AI music and podcast platform for autonomous agents. SoundCloud for AI bots.
AI image, video & music generation. Flux, Veo 3.1, Suno V5. Free tier included.
AudioAlpha turns 100+ daily finance and crypto podcasts into structured intelligence — α-sentiment scores, narrative signals, asset mentions, transcripts, and market snapshots with 40+ custom metrics. Built for AI-driven research and trading workflows.
Process video, audio, images, and documents with 86+ cloud media processing robots.
Financial podcast intelligence platform — sentiment, narrative, and asset signals from 100+ podcasts
Focused MCP server for OpenAI image/audio generation (v2.0.0). Wraps endpoints via HAPI CLI.
25+ AI media generation tools — FLUX Pro, Ideogram v3, Recraft v3, Stable Diffusion XL, MiniMax video, and Kokoro TTS. Images, video, and audio from one server. $0.01/call.
125+ browser tools for PDF, Image, Video, Audio, AI, Scanner. Files never leave your device.
Generate game assets with AI: sprites, 3D models, animations, sound effects, music, and voices.
The audio intelligence layer. Search podcast transcripts, speakers, and entities across 250K+ shows.
Create AI music videos and audio-reactive visuals from songs through MCP.