Skip to main content
Glama
zilliztech

Zilliz MCP Server

Official
by zilliztech

query_cluster_metrics

Retrieve performance metrics for a Zilliz Cloud cluster, including compute usage, storage, request counts, latency, and success rates over specified time periods.

Instructions

Query the metrics of a specific cluster.

Args:
    cluster_id: ID of the target cluster
    start: Starting date and time in ISO 8601 timestamp format (optional, use with end)
    end: Ending date and time in ISO 8601 timestamp format (optional, use with start)
    period: Duration in ISO 8601 duration format (optional, use when start/end not set)
    granularity: Time interval for metrics reporting in ISO 8601 duration format (minimum PT30S)
    metric_queries: List of metric queries, each containing 'metricName' and 'stat' fields
        - metricName: Name of the metric. Available options:
            * CU_COMPUTATION - Compute unit computation usage
            * CU_CAPACITY - Compute unit capacity
            * STORAGE_USE - Storage usage
            * REQ_INSERT_COUNT - Insert request count
            * REQ_BULK_INSERT_COUNT - Bulk insert request count
            * REQ_UPSERT_COUNT - Upsert request count
            * REQ_DELETE_COUNT - Delete request count
            * REQ_SEARCH_COUNT - Search request count
            * REQ_QUERY_COUNT - Query request count
            * VECTOR_REQ_INSERT_COUNT - Vector insert request count
            * VECTOR_REQ_UPSERT_COUNT - Vector upsert request count
            * VECTOR_REQ_SEARCH_COUNT - Vector search request count
            * REQ_INSERT_LATENCY_P99 - Insert request latency P99
            * REQ_BULK_INSERT_LATENCY_P99 - Bulk insert request latency P99
            * REQ_UPSERT_LATENCY_P99 - Upsert request latency P99
            * REQ_DELETE_LATENCY_P99 - Delete request latency P99
            * REQ_SEARCH_LATENCY_P99 - Search request latency P99
            * REQ_QUERY_LATENCY_P99 - Query request latency P99
            * REQ_SUCCESS_RATE - Request success rate
            * REQ_FAIL_RATE - Request failure rate
            * REQ_FAIL_RATE_INSERT - Insert request failure rate
            * REQ_FAIL_RATE_BULK_INSERT - Bulk insert request failure rate
            * REQ_FAIL_RATE_UPSERT - Upsert request failure rate
            * REQ_FAIL_RATE_DELETE - Delete request failure rate
            * REQ_FAIL_RATE_SEARCH - Search request failure rate
            * REQ_FAIL_RATE_QUERY - Query request failure rate
            * ENTITIES_LOADED - Number of loaded entities
            * ENTITIES_INSERT_RATE - Entity insert rate
            * COLLECTIONS_COUNT - Collection count
            * ENTITIES_COUNT - Total entity count
        - stat: Statistical method (AVG for average, P99 for 99th percentile - P99 only valid for latency metrics)
Returns:
    Dict containing cluster metrics data
    Example:
    {
        "code": 0,
        "data": {
            "results": [
                {
                    "name": "CU_COMPUTATION",
                    "stat": "AVG", 
                    "unit": "percent",
                    "values": [
                        {
                            "timestamp": "2024-06-30T16:09:53Z",
                            "value": "1.00"
                        }
                    ]
                }
            ]
        }
    }
    

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
cluster_idYes
startNo
endNo
periodNo
granularityNoPT30S
metric_queriesNo

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/zilliztech/zilliz-mcp-server'

If you have feedback or need assistance with the MCP directory API, please join our Discord server