Server Configuration
Describes the environment variables required to run the server.
Name | Required | Description | Default |
---|---|---|---|
GEMINI_API_KEY | Yes | Your Google Gemini API key | |
OUTPUT_IMAGE_PATH | Yes | Path to the folder where generated images will be saved |
Schema
Prompts
Interactive templates invoked by user choice
Name | Description |
---|---|
No prompts |
Resources
Contextual data attached and managed by the client
Name | Description |
---|---|
No resources |
Tools
Functions exposed to the LLM to take actions
Name | Description |
---|---|
generate_image_from_text | Generate an image based on the given text prompt using Google's Gemini model. Args:
prompt: User's text prompt describing the desired image to generate
Returns:
Path to the generated image file using Gemini's image generation capabilities |
transform_image_from_encoded | Transform an existing image based on the given text prompt using Google's Gemini model. Args:
encoded_image: Base64 encoded image data with header. Must be in format:
"data:image/[format];base64,[data]"
Where [format] can be: png, jpeg, jpg, gif, webp, etc.
prompt: Text prompt describing the desired transformation or modifications
Returns:
Path to the transformed image file saved on the server |
transform_image_from_file | Transform an existing image file based on the given text prompt using Google's Gemini model. Args:
image_file_path: Path to the image file to be transformed
prompt: Text prompt describing the desired transformation or modifications
Returns:
Path to the transformed image file saved on the server |