llama_metrics
Retrieve Prometheus-compatible metrics from llama.cpp, including tokens processed and latency, for monitoring local LLM performance.
Instructions
Get Prometheus-compatible metrics (tokens processed, latency, etc.)
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
No arguments | |||