Fast, Reliable AI Gateway

Connect to 100+ AI models using an OpenAI compatible API.

We're focused on:

Performance

~40ms latency

Security

End-to-end encryption

Reliability

99% uptime

We support every major AI provider:

Additionally, we don't charge any hidden fees. You pay for what you use at listed prices.

Get your API key Documentation

Glama gateway in numbers

+ billion

tokens per day

active users

models

OpenAI compatible API

Connect, load balance, setup fallbacks and seamlessly manage 100+ AI models using an OpenAI compatible API.

import { OpenAI } from 'openai'

const openai = new OpenAI({
  baseURL: 'https://glama.ai/api/gateway/openai/v1',
  apiKey: GLAMA_API_KEY,
});

await openai.chat.completions.create({
  messages: [
    {
      role: 'user',
      content: 'Hello!'
    }
  ],
  model: 'anthropic/claude-2',
});

Benefit from our globally distributed infrastructure, low latency, transparent pricing, no rate limits, built-in caching, logging and consolidated billing.

Real-Time API Cost Insights

Get real-time visibility into your API costs with detailed analytics of token consumption, cache performance, and overall expenditure.

Complete Transparency with Detailed Logs

Every API interaction is logged, allowing you to track spending, audit usage, implement guardrails, and troubleshoot issues. Easily export logs to JSON format.

Join Our Community

Join our community for product updates, feature discussions, and support.

Discord1,754 members

Reddit172 members