multimodal-mcp
Server Configuration
Describes the environment variables required to run the server.
| Name | Required | Description | Default |
|---|---|---|---|
| BFL_API_KEY | No | BFL API key — enables image generation and editing | |
| XAI_API_KEY | No | xAI API key — enables image and video generation | |
| GEMINI_API_KEY | No | Gemini API key — enables image, video, and audio generation | |
| GOOGLE_API_KEY | No | Alias for GEMINI_API_KEY | |
| OPENAI_API_KEY | No | OpenAI API key — enables image, video, audio generation, and transcription | |
| MEDIA_OUTPUT_DIR | No | Directory for saved media files. Defaults to current working directory | |
| ELEVENLABS_API_KEY | No | ElevenLabs API key — enables audio generation and transcription |
Capabilities
Features and capabilities supported by this server
| Capability | Details |
|---|---|
| tools | {
"listChanged": true
} |
Tools
Functions exposed to the LLM to take actions
| Name | Description |
|---|---|
| generate_imageB | Generate an image from a text prompt using AI. Providers: openai (DALL-E), xai (Aurora), google (Imagen), bfl (FLUX). Available: none configured |
| edit_imageB | Edit an existing image using AI. Provide the path to an image and a text prompt describing the desired edits. Providers: openai, xai, google, bfl (FLUX Kontext). Available: none configured |
| generate_videoC | Generate a video from a text prompt using AI. Optionally provide an image as the first frame. Available providers: none configured |
| generate_audioB | Generate audio from text using AI. Supports text-to-speech and sound effects. Providers: openai, google, elevenlabs. ElevenLabs: use providerOptions.mode = "sound-effect" for sound effects. Available: none configured |
| transcribe_audioA | Transcribe audio to text using AI (speech-to-text). Providers: openai (Whisper), elevenlabs (Scribe). Available: none configured |
| list_providersA | List all configured media generation providers and their capabilities |
Prompts
Interactive templates invoked by user choice
| Name | Description |
|---|---|
No prompts | |
Resources
Contextual data attached and managed by the client
| Name | Description |
|---|---|
No resources | |
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/rsmdt/multimodal-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server