tool_search_logs
Search and filter evaluation logs by task, model, status, date range, and sample count to analyze performance data.
Instructions
Search and filter evaluation logs by various criteria.
Supports filtering by task name, model, status, date range, and minimum sample count. Task and model filters support wildcards (e.g., 'mind2web*', 'google/*').
Args: log_dir: Directory containing log files task: Filter by task name (supports wildcards like 'mind2web*') model: Filter by model name (supports wildcards like 'google/*') status: Filter by status: 'success', 'error', 'cancelled' date_from: Filter logs from this date (ISO format: YYYY-MM-DD) date_to: Filter logs until this date (ISO format: YYYY-MM-DD) min_samples: Minimum sample count limit: Maximum results (default: INSPECT_LOGS_MCP_DEFAULT_LIMIT or 50, max: INSPECT_LOGS_MCP_MAX_LIMIT or 500)
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| log_dir | No | ||
| task | No | ||
| model | No | ||
| status | No | ||
| date_from | No | ||
| date_to | No | ||
| min_samples | No | ||
| limit | No |