generate_multimodal_response

Generate AI responses using text prompts combined with images or files from multiple providers to create multimodal content and analysis.

Instructions

Generate response from multimodal model.

Args: model: Model name to use prompt: Text prompt image_urls: Optional list of image URLs file_paths: Optional list of file paths system_prompt: Optional system prompt max_tokens: Maximum tokens to generate temperature: Generation temperature provider: Optional provider name (openai, dashscope) Returns: Generated response text

Input Schema

Name	Required	Description	Default
`model`	Yes
`prompt`	Yes
`image_urls`	No
`file_paths`	No
`system_prompt`	No
`max_tokens`	No
`temperature`	No
`provider`	No

Input Schema (JSON Schema)

{ "properties": { "file_paths": { "anyOf": [ { "items": { "type": "string" }, "type": "array" }, { "type": "null" } ], "default": null, "title": "File Paths" }, "image_urls": { "anyOf": [ { "items": { "type": "string" }, "type": "array" }, { "type": "null" } ], "default": null, "title": "Image Urls" }, "max_tokens": { "anyOf": [ { "type": "integer" }, { "type": "null" } ], "default": 1000, "title": "Max Tokens" }, "model": { "title": "Model", "type": "string" }, "prompt": { "title": "Prompt", "type": "string" }, "provider": { "anyOf": [ { "type": "string" }, { "type": "null" } ], "default": null, "title": "Provider" }, "system_prompt": { "anyOf": [ { "type": "string" }, { "type": "null" } ], "default": null, "title": "System Prompt" }, "temperature": { "anyOf": [ { "type": "number" }, { "type": "null" } ], "default": 0.7, "title": "Temperature" } }, "required": [ "model", "prompt" ], "type": "object" }

VLLM MCP Server