What can you do with this server?

The Minimax MCP Tools server provides AI-powered capabilities through the Model Context Protocol: * Image Generation: Create high-quality images from text prompts with customizable aspect ratio, number of images, and subject reference images for character consistency. * Text-to-Speech: Convert text to natural-sounding speech with extensive customization options including voice selection, emotion, speed, volume, pitch, and audio format settings (sample rate, bitrate, channels). * Advanced Features: Utilize voice mixing (timber weights), LaTeX reading, pronunciation dictionaries, streaming mode, language boosting for improved accuracy, and subtitle generation for accessibility. * Integration: Works seamlessly with Windsurf and Cursor editors via MCP server configuration.

Which integrations are available for this server?

Supports reading LaTeX formulas in text-to-speech functionality with configurable options for pronunciation. Required as a runtime environment for the MCP server with version 16 or higher needed as a prerequisite.

How do I use Minimax MCP Tools?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@Minimax MCP Tools generate an image of a futuristic city skyline at sunset" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

The Minimax MCP Tools server provides AI-powered capabilities through the Model Context Protocol:

Image Generation: Create high-quality images from text prompts with customizable aspect ratio, number of images, and subject reference images for character consistency.
Text-to-Speech: Convert text to natural-sounding speech with extensive customization options including voice selection, emotion, speed, volume, pitch, and audio format settings (sample rate, bitrate, channels).
Advanced Features: Utilize voice mixing (timber weights), LaTeX reading, pronunciation dictionaries, streaming mode, language boosting for improved accuracy, and subtitle generation for accessibility.
Integration: Works seamlessly with Windsurf and Cursor editors via MCP server configuration.

Minimax MCP Tools

Banner

A Model Context Protocol (MCP) server for Minimax AI integration, providing async image generation and text-to-speech with advanced rate limiting and error handling.

English | 简体中文

MCP Configuration

Add to your MCP settings:

{
  "mcpServers": {
    "minimax-mcp-tools": {
      "command": "npx",
      "args": ["minimax-mcp-tools"],
      "env": {
        "MINIMAX_API_KEY": "your_api_key_here"
      }
    }
  }
}

Async Design - Perfect for Content Production at Scale

This MCP server uses an asynchronous submit-and-barrier pattern designed for batch content creation:

🎬 Narrated Slideshow Production - Generate dozens of slide images and corresponding narration in parallel
📚 AI-Driven Audiobook Creation - Produce chapters with multiple voice characters simultaneously
🖼️ Website Asset Generation - Create consistent visual content and audio elements for web projects
🎯 Multimedia Content Pipelines - Perfect for LLM-driven content workflows requiring both visuals and audio

Architecture Benefits:

Submit Phase: Tools return immediately with task IDs, tasks execute in background
Smart Rate Limiting: Adaptive rate limiting (10 RPM images, 20 RPM speech) with burst capacity
Barrier Synchronization: task_barrier waits for all tasks and returns comprehensive results
Batch Optimization: Submit multiple tasks to saturate rate limits, then barrier once for maximum throughput

Related MCP server: Vibe Coder MCP

Tools

`submit_image_generation`

Submit Image Generation Task - Generate images asynchronously.

Required: prompt, outputFile
Optional: aspectRatio, customSize, seed, subjectReference, style

`submit_speech_generation`

Submit Speech Generation Task - Convert text to speech asynchronously.

Required: text, outputFile
Optional: highQuality, voiceId, speed, volume, pitch, emotion, format, sampleRate, bitrate, languageBoost, intensity, timbre, sound_effects

`task_barrier`

Wait for Task Completion - Wait for ALL submitted tasks to complete and retrieve results. Essential for batch processing.

Architecture

sequenceDiagram
    participant User
    participant MCP as MCP Server
    participant TM as Task Manager
    participant API as Minimax API

    Note over User, API: Async Submit-and-Barrier Pattern

    User->>MCP: submit_image_generation(prompt1)
    MCP->>TM: submitImageTask()
    TM-->>MCP: taskId: img-001
    MCP-->>User: "Task img-001 submitted"
    
    par Background Execution (Rate Limited)
        TM->>API: POST /image/generate
        API-->>TM: image data + save file
    end

    User->>MCP: submit_speech_generation(text1)
    MCP->>TM: submitTTSTask()
    TM-->>MCP: taskId: tts-002
    MCP-->>User: "Task tts-002 submitted"
    
    par Background Execution (Rate Limited)
        TM->>API: POST /speech/generate
        API-->>TM: audio data + save file
    end

    User->>MCP: submit_image_generation(prompt2)
    MCP->>TM: submitImageTask()
    TM-->>MCP: taskId: img-003
    MCP-->>User: "Task img-003 submitted"

    par Background Execution (Rate Limited)
        TM->>API: POST /image/generate (queued)
        API-->>TM: image data + save file
    end

    User->>MCP: task_barrier()
    MCP->>TM: barrier()
    TM->>TM: wait for all tasks
    TM-->>MCP: results summary
    MCP-->>User: ✅ All tasks completed<br/>Files available at specified paths

    Note over User, API: Immediate Task Submission + Background Rate-Limited Execution

License

MIT

Minimax MCP Tools

Minimax MCP Tools

MCP Configuration

Async Design - Perfect for Content Production at Scale

Architecture Benefits:

Tools

`submit_image_generation`

`submit_speech_generation`

`task_barrier`

Architecture

License

Resources

Looking for Admin?

Tools

Appeared in Searches

Latest Blog Posts

MCP directory API