Skip to main content
Glama

get_inference_usage

Retrieve usage statistics for serverless inference subscriptions, including token counts for chat models, character usage for text-to-speech, monthly allotment details, and overage information.

Instructions

Get usage statistics for a serverless inference subscription.

Args: subscription_id: The inference subscription ID or label

Returns: Detailed usage information including: - chat: Token usage for chat/completion models - audio: Character usage for text-to-speech models - monthly_allotment: Total tokens/characters allocated - overage: Usage exceeding the monthly limit

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
subscription_idYes

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/rsp2k/mcp-vultr'

If you have feedback or need assistance with the MCP directory API, please join our Discord server