generate_multimodal_response
Generate AI responses using text prompts combined with images or files from multiple providers to create multimodal content and analysis.
Instructions
Generate response from multimodal model.
Args:
model: Model name to use
prompt: Text prompt
image_urls: Optional list of image URLs
file_paths: Optional list of file paths
system_prompt: Optional system prompt
max_tokens: Maximum tokens to generate
temperature: Generation temperature
provider: Optional provider name (openai, dashscope)
Returns:
Generated response text
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| model | Yes | ||
| prompt | Yes | ||
| image_urls | No | ||
| file_paths | No | ||
| system_prompt | No | ||
| max_tokens | No | ||
| temperature | No | ||
| provider | No |
Input Schema (JSON Schema)
{
"properties": {
"file_paths": {
"anyOf": [
{
"items": {
"type": "string"
},
"type": "array"
},
{
"type": "null"
}
],
"default": null,
"title": "File Paths"
},
"image_urls": {
"anyOf": [
{
"items": {
"type": "string"
},
"type": "array"
},
{
"type": "null"
}
],
"default": null,
"title": "Image Urls"
},
"max_tokens": {
"anyOf": [
{
"type": "integer"
},
{
"type": "null"
}
],
"default": 1000,
"title": "Max Tokens"
},
"model": {
"title": "Model",
"type": "string"
},
"prompt": {
"title": "Prompt",
"type": "string"
},
"provider": {
"anyOf": [
{
"type": "string"
},
{
"type": "null"
}
],
"default": null,
"title": "Provider"
},
"system_prompt": {
"anyOf": [
{
"type": "string"
},
{
"type": "null"
}
],
"default": null,
"title": "System Prompt"
},
"temperature": {
"anyOf": [
{
"type": "number"
},
{
"type": "null"
}
],
"default": 0.7,
"title": "Temperature"
}
},
"required": [
"model",
"prompt"
],
"type": "object"
}