The Minimax MCP Tools server provides AI-powered capabilities through the Model Context Protocol:
Image Generation: Create high-quality images from text prompts with customizable aspect ratio, number of images, and subject reference images for character consistency.
Text-to-Speech: Convert text to natural-sounding speech with extensive customization options including voice selection, emotion, speed, volume, pitch, and audio format settings (sample rate, bitrate, channels).
Advanced Features: Utilize voice mixing (timber weights), LaTeX reading, pronunciation dictionaries, streaming mode, language boosting for improved accuracy, and subtitle generation for accessibility.
Integration: Works seamlessly with Windsurf and Cursor editors via MCP server configuration.
Supports reading LaTeX formulas in text-to-speech functionality with configurable options for pronunciation.
Required as a runtime environment for the MCP server with version 16 or higher needed as a prerequisite.
Minimax MCP Tools

A Model Context Protocol (MCP) server for Minimax AI integration, providing async image generation and text-to-speech with advanced rate limiting and error handling.
English | 简体中文
MCP Configuration
Add to your MCP settings:
Async Design - Perfect for Content Production at Scale
This MCP server uses an asynchronous submit-and-barrier pattern designed for batch content creation:
🎬 Narrated Slideshow Production - Generate dozens of slide images and corresponding narration in parallel
📚 AI-Driven Audiobook Creation - Produce chapters with multiple voice characters simultaneously
🖼️ Website Asset Generation - Create consistent visual content and audio elements for web projects
🎯 Multimedia Content Pipelines - Perfect for LLM-driven content workflows requiring both visuals and audio
Architecture Benefits:
Submit Phase: Tools return immediately with task IDs, tasks execute in background
Smart Rate Limiting: Adaptive rate limiting (10 RPM images, 20 RPM speech) with burst capacity
Barrier Synchronization:
task_barrierwaits for all tasks and returns comprehensive resultsBatch Optimization: Submit multiple tasks to saturate rate limits, then barrier once for maximum throughput
Related MCP server: Vibe Coder MCP
Tools
submit_image_generation
Submit Image Generation Task - Generate images asynchronously.
Required: prompt, outputFile
Optional: aspectRatio, customSize, seed, subjectReference, style
submit_speech_generation
Submit Speech Generation Task - Convert text to speech asynchronously.
Required: text, outputFile
Optional: highQuality, voiceId, speed, volume, pitch, emotion, format, sampleRate, bitrate, languageBoost, intensity, timbre, sound_effects
task_barrier
Wait for Task Completion - Wait for ALL submitted tasks to complete and retrieve results. Essential for batch processing.
Architecture
License
MIT