extract_image_from_file
Extract and analyze images from local file paths for visual content understanding, OCR text extraction, and object recognition in screenshots, photos, diagrams, and documents.
Instructions
Extract and analyze images from local file paths. Supports visual content understanding, OCR text extraction, and object recognition for screenshots, photos, diagrams, and documents.
Input Schema
Name | Required | Description | Default |
---|---|---|---|
file_path | Yes | Path to the image file to analyze (supports screenshots, photos, diagrams, and documents in PNG, JPG, GIF, WebP formats) | |
max_height | No | For backward compatibility only. Default maximum height is now 512px | |
max_width | No | For backward compatibility only. Default maximum width is now 512px | |
resize | No | For backward compatibility only. Images are always automatically resized to optimal dimensions (max 512x512) for LLM analysis |