get_latency_analytics
Analyze latency performance over time by retrieving time-series data with average, p50, p90, and p99 percentiles. Filter results by time range, token counts, costs, status codes, and other parameters to identify performance patterns.
Instructions
Retrieve latency analytics as time-series data, showing average, p50, p90, and p99 latency percentiles over time
Input Schema
TableJSON Schema
| Name | Required | Description | Default |
|---|---|---|---|
| time_of_generation_min | Yes | Start time for the analytics period (ISO8601 format, e.g., '2024-01-01T00:00:00Z') | |
| time_of_generation_max | Yes | End time for the analytics period (ISO8601 format, e.g., '2024-02-01T00:00:00Z') | |
| total_units_min | No | Minimum number of total tokens to filter by | |
| total_units_max | No | Maximum number of total tokens to filter by | |
| cost_min | No | Minimum cost in cents to filter by | |
| cost_max | No | Maximum cost in cents to filter by | |
| prompt_token_min | No | Minimum number of prompt tokens | |
| prompt_token_max | No | Maximum number of prompt tokens | |
| completion_token_min | No | Minimum number of completion tokens | |
| completion_token_max | No | Maximum number of completion tokens | |
| status_code | No | Filter by specific HTTP status codes (comma-separated) | |
| weighted_feedback_min | No | Minimum weighted feedback score (-10 to 10) | |
| weighted_feedback_max | No | Maximum weighted feedback score (-10 to 10) | |
| virtual_keys | No | Filter by specific virtual key slugs (comma-separated) | |
| configs | No | Filter by specific config slugs (comma-separated) | |
| workspace_slug | No | Filter by specific workspace | |
| api_key_ids | No | Filter by specific API key UUIDs (comma-separated) | |
| metadata | No | Filter by metadata (stringified JSON object) | |
| ai_org_model | No | Filter by AI provider and model (comma-separated, use __ as separator) | |
| trace_id | No | Filter by trace IDs (comma-separated) | |
| span_id | No | Filter by span IDs (comma-separated) |