Retrieve usage statistics for serverless inference subscriptions, including token counts for chat models, character usage for text-to-speech, monthly allotment details, and overage information.
Discover available AI models on Replicate for image generation and inference tasks. Filter results by owner to find specific models for your project needs.
Enables scraping and comparing pricing information for LLM inference services across multiple providers (CloudRift, DeepInfra, Fireworks, Groq) using Firecrawl API and SQLite storage.
Enables access to 200,000+ machine learning models through the Hugging Face Inference API. Supports text generation, image creation, classification, translation, speech processing, embeddings, and more AI tasks.
Enables integration of local LLM capabilities with MCP-compatible clients like Claude Desktop, Continue.dev, and Cline. Provides tools for processing text prompts through local language models using a customizable inference function.