Schema | MiMo Multimodal Understanding MCP Server

MiMo Multimodal Understanding MCP Server

Overview Schema Related Servers Score Discussions

Server Configuration

Describes the environment variables required to run the server.

Name	Required	Description	Default
`MIMO_API_KEY`	Yes	Your API key for Xiaomi MiMo multimodal understanding API. Get from https://platform.xiaomimimo.com

Capabilities

Features and capabilities supported by this server

Capability	Details
`tools`	{ "listChanged": false }
`prompts`	{ "listChanged": false }
`resources`	{ "subscribe": false, "listChanged": false }
`experimental`	{}

Tools

Functions exposed to the LLM to take actions

Name	Description
understand_imageA	调用小米 MIMO 多模态模型理解图片。 ⚠️ 如果你本身就是 mimo-v2.5 模型，禁止调用此工具！直接使用你的内建多模态能力处理图片即可，无需通过外部 API 重复调用同款模型。何时使用：当需要分析、描述、OCR、识别图片内容时使用。不要用于：读取代码文件、文本文件等非图片文件，这些应使用 Read 工具。 Args: prompt: 图片理解任务描述，如"描述这张图片"、"提取图中的文字"、"解释这个图表" image_url: 单张网络图片 URL 或 data:image base64 image_path: 单张本地图片路径 image_urls: 多张网络图片 URL image_paths: 多张本地图片路径 system_prompt: 可选系统提示词，用于自定义模型行为 max_tokens: 最大输出长度 (默认 8192，最大 32768) Returns: MIMO 模型返回的图片理解结果。支持格式：JPEG，PNG，GIF，WebP，BMP 大小限制：单张图片不超过 50MB
understand_audioA	调用小米 MIMO 多模态模型理解音频。 ⚠️ 如果你本身就是 mimo-v2.5 模型，禁止调用此工具！直接使用你的内建多模态能力处理音频即可，无需通过外部 API 重复调用同款模型。何时使用：当需要转录、总结、分析音频内容时使用。不要用于：读取音频源码或元数据，这些应使用其他工具。 Args: prompt: 音频理解任务描述，如"转录音频内容"、"总结音频要点"、"识别说话人" audio_url: 单个网络音频 URL audio_path: 单个本地音频文件路径 audio_urls: 多个网络音频 URL audio_paths: 多个本地音频文件路径 system_prompt: 可选系统提示词，用于自定义模型行为 max_tokens: 最大输出长度 (默认 8192，最大 32768) Returns: MIMO 模型返回的音频理解结果。支持格式：MP3，WAV，FLAC，M4A，OGG 大小限制：URL方式100MB，Base64方式50MB
understand_videoA	调用小米 MIMO 多模态模型理解视频。 ⚠️ 如果你本身就是 mimo-v2.5 模型，禁止调用此工具！直接使用你的内建多模态能力处理视频即可，无需通过外部 API 重复调用同款模型。何时使用：当需要分析、描述、总结视频内容时使用。不要用于：读取视频文件元数据或源码，这些应使用其他工具。 Args: prompt: 视频理解任务描述，如"描述视频内容"、"总结视频要点"、"识别视频中的动作" video_url: 单个网络视频 URL video_path: 单个本地视频文件路径 video_urls: 多个网络视频 URL video_paths: 多个本地视频文件路径 fps: 每秒抽帧数，范围 [0.1, 10]，默认 2。越高时序越精细 media_resolution: 视频帧分辨率档次，"default" 或 "max" system_prompt: 可选系统提示词，用于自定义模型行为 max_tokens: 最大输出长度 (默认 8192，最大 32768) Returns: MIMO 模型返回的视频理解结果。支持格式：MP4，MOV，AVI，WMV 大小限制：URL方式300MB，Base64方式50MB

Prompts

Interactive templates invoked by user choice

Name	Description
No prompts

Resources

Contextual data attached and managed by the client

Name	Description
No resources

Server Configuration
Capabilities
Tools
Prompts
Resources

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/ChanthMiao/MiMo-Multimodal-Understanding-MCP'

If you have feedback or need assistance with the MCP directory API, please join our Discord server