Speech Processing MCP connectors
Pronunciation scoring, speech-to-text, and text-to-speech for language learning
AI voice agents on SMB websites — fully autonomous build in 2–3 min. 23 MCP tools. EU, GDPR.
OCR, transcription, file extraction, and image generation for AI agents via MCP.
An MCP server that fetches video transcripts/subtitles, with pagination for large responses. Supports YouTube, Twitter/X, Instagram, TikTok, Twitch, Vimeo, Facebook, Bilibili, VK, Dailymotion, Reddit. Whisper fallback — transcribes audio when subtitles are unavailable.
Give AI agents real phone numbers, messages, and voice calls via MCP.
AI-powered calorie tracking with photo recognition, barcode scanning, and voice logging
Access your Cosmonote audio notes, transcriptions, summaries, and action items.
Create and manage AI voice agents, real-time conversations, and analytics with eigi.ai
Free hosted API serving 10 professional AI voice clones powered by ElevenLabs. Browse, search, and get platform-ready configurations for voice integration across 29 platforms. Endpoints include voice listing, search by keyword/language/use-case, natural language recommendations with platform-specific configs, audio previews, and OpenAPI documentation. Zero authentication required, zero integration fee.
Give your AI a face, a voice, and a personality. 3D avatars with custom personas.
Voice-led, FSRS-scheduled flashcards from YouTube, PDFs, web, or text. Auto-graded quizzes.
Search recordings, summarize meetings, create clips, and automate workflows from your AI assistant.
YouTube video search with transcript extraction as first-class output.