list_windows | List all available windows for screenshot capture.
Returns:
JSON string containing list of windows with their IDs, titles, and properties. |
capture_window | Capture screenshot of a specific window.
Args:
window_id: Window ID to capture (from list_windows)
output_path: Optional path to save the screenshot
Returns:
JSON string with capture results and file path. |
capture_full_screen | Capture screenshot of the entire screen.
Args:
output_path: Optional path to save the screenshot
Returns:
JSON string with capture results and file path. |
capture_document_pages | Capture multiple pages from a document window with automatic navigation.
Args:
window_id: Window ID containing the document
page_count: Number of pages to capture
output_dir: Directory to save captured pages
navigation_key: Key to press for navigation (Page_Down, Right, space)
delay_seconds: Delay between navigation and capture
Returns:
JSON string with capture results and list of captured files. |
process_images | Process images in a directory with various operations.
Args:
image_dir: Directory containing images to process
operations: List of operations (enhance, ocr, resize, deduplicate)
ocr_language: Language for OCR processing (default: eng)
Returns:
JSON string with processing results. |
convert_to_pdf | Convert a list of images to a PDF document.
Args:
image_paths: List of image file paths to convert
output_path: Path for the output PDF file
title: Optional title for the PDF document
sort_files: Whether to sort files by name before conversion
Returns:
JSON string with conversion results. |
directory_to_pdf | Convert all images in a directory to a PDF document.
Args:
image_dir: Directory containing images
output_path: Path for the output PDF file
title: Optional title for the PDF document
pattern: File pattern to match (default: all files)
Returns:
JSON string with conversion results. |
full_document_workflow | Complete workflow: capture document pages, optionally process them, and convert to PDF.
Args:
window_id: Window ID containing the document
page_count: Number of pages to capture
output_pdf: Path for the final PDF file
capture_dir: Temporary directory for captures
title: Optional PDF title
navigation_key: Key for page navigation
delay_seconds: Delay between navigation and capture
process_images_flag: Whether to enhance images before PDF conversion
Returns:
JSON string with complete workflow results. |
check_system_dependencies | Check if all required system dependencies are installed.
Returns:
JSON string with dependency check results. |
debug_window_detection | Comprehensive debugging information for window detection issues.
Returns:
JSON string with detailed diagnostics about PowerShell environment,
process enumeration, and window detection capabilities. |