LLM Model Prices
Glama chat pricing is calculated using the official rates from our model and service providers. These prices are passed through without any additional fees per token.
In the event that you discover a price discrepancy, please let us know at support@glama.ai. We will investigate and correct billing retroactively.
Last updated: 2024-12-03T01:38:03.370Z
Provider | Model | Capabilities | Max Tokens | USD / Token | TPS | Supported | ||
---|---|---|---|---|---|---|---|---|
Input | Output | Input | Output | |||||
alibaba | qwen-max | FC | 128000 | 6000 | 0.000010 | 0.000030 | - | Yes |
alibaba | qwen-plus | FC | 30000 | 30000 | 0.0000030 | 0.0000090 | - | Yes |
alibaba | qwen-turbo | FC | 6000 | 6000 | 0.00000040 | 0.0000012 | - | Yes |
anthropic | claude-2 | 100000 | 8191 | 0.000008 | 0.000024 | - | No | |
anthropic | claude-2.1 | 200000 | 8191 | 0.000008 | 0.000024 | - | No | |
anthropic | claude-3-5-haiku-20241022 | FC | 200000 | 8192 | 0.000001 | 0.000005 | - | No |
anthropic | claude-3-5-sonnet-20240620 | FCVI | 200000 | 8192 | 0.000003 | 0.000015 | 52 | Yes |
anthropic | claude-3-5-sonnet-20241022 | FCVI | 200000 | 8192 | 0.000003 | 0.000015 | - | Yes |
anthropic | claude-3-haiku-20240307 | FCVI | 200000 | 4096 | 0.00000025 | 0.0000013 | 105 | No |
anthropic | claude-3-opus-20240229 | FCVI | 200000 | 4096 | 0.000015 | 0.000075 | 19 | Yes |
anthropic | claude-3-sonnet-20240229 | FCVI | 200000 | 4096 | 0.000003 | 0.000015 | 65 | No |
anthropic | claude-instant-1 | 100000 | 8191 | 0.0000016 | 0.0000055 | - | No | |
anthropic | claude-instant-1.2 | 100000 | 8191 | 0.00000016 | 0.00000055 | - | No | |
azure | command-r-plus | FC | 128000 | 4096 | 0.000003 | 0.000015 | - | No |
azure | global-standard/gpt-4o-2024-08-06 | FCPFCVI | 128000 | 16384 | 0.0000025 | 0.00001 | - | No |
azure | global-standard/gpt-4o-2024-11-20 | FCPFCVI | 128000 | 16384 | 0.0000025 | 0.00001 | - | No |
azure | global-standard/gpt-4o-mini | FCPFCVI | 128000 | 16384 | 0.00000015 | 0.0000006 | - | No |
azure | gpt-35-turbo | FC | 4097 | 4096 | 0.0000005 | 0.0000015 | - | No |
azure | gpt-35-turbo-0125 | FCPFC | 16384 | 4096 | 0.0000005 | 0.0000015 | - | No |
azure | gpt-35-turbo-0301 | FCPFC | 4097 | 4096 | 0.0000002 | 0.000002 | - | No |
azure | gpt-35-turbo-0613 | FCPFC | 4097 | 4096 | 0.0000015 | 0.000002 | - | No |
azure | gpt-35-turbo-1106 | FCPFC | 16384 | 4096 | 0.000001 | 0.000002 | - | No |
azure | gpt-35-turbo-16k | 16385 | 4096 | 0.000003 | 0.000004 | - | No | |
azure | gpt-35-turbo-16k-0613 | FC | 16385 | 4096 | 0.000003 | 0.000004 | - | No |
azure | gpt-4 | FC | 8192 | 4096 | 0.00003 | 0.00006 | - | No |
azure | gpt-4-0125-preview | FCPFC | 128000 | 4096 | 0.00001 | 0.00003 | - | No |
azure | gpt-4-0613 | FC | 8192 | 4096 | 0.00003 | 0.00006 | - | No |
azure | gpt-4-1106-preview | FCPFC | 128000 | 4096 | 0.00001 | 0.00003 | - | No |
azure | gpt-4-32k | 32768 | 4096 | 0.00006 | 0.00012 | - | No | |
azure | gpt-4-32k-0613 | 32768 | 4096 | 0.00006 | 0.00012 | - | No | |
azure | gpt-4-turbo | FCPFC | 128000 | 4096 | 0.00001 | 0.00003 | - | No |
azure | gpt-4-turbo-2024-04-09 | FCPFCVI | 128000 | 4096 | 0.00001 | 0.00003 | - | No |
azure | gpt-4-turbo-vision-preview | VI | 128000 | 4096 | 0.00001 | 0.00003 | - | No |
azure | gpt-4o | FCPFCVI | 128000 | 4096 | 0.000005 | 0.000015 | - | No |
azure | gpt-4o-2024-05-13 | FCPFCVI | 128000 | 4096 | 0.000005 | 0.000015 | - | No |
azure | gpt-4o-2024-08-06 | FCPFCVI | 128000 | 16384 | 0.0000028 | 0.000011 | - | No |
azure | gpt-4o-2024-11-20 | FCPFCVI | 128000 | 16384 | 0.0000028 | 0.000011 | - | No |
azure | gpt-4o-mini | FCPFCVI | 128000 | 16384 | 0.00000017 | 0.00000066 | - | No |
azure | gpt-4o-mini-2024-07-18 | FCPFCVI | 128000 | 16384 | 0.00000017 | 0.00000066 | - | No |
azure | o1-mini | FCPFC | 128000 | 65536 | 0.000003 | 0.000012 | - | No |
azure | o1-mini-2024-09-12 | FCPFC | 128000 | 65536 | 0.000003 | 0.000012 | - | No |
azure | o1-preview | FCPFC | 128000 | 32768 | 0.000015 | 0.00006 | - | No |
azure | o1-preview-2024-09-12 | FCPFC | 128000 | 32768 | 0.000015 | 0.00006 | - | No |
glama | qwq-32b-preview | 32000 | 32000 | 0.00000017 | 0.00000070 | - | Yes | |
gemini-1.0-pro | FC | 32760 | 8192 | 0.0000005 | 0.0000015 | - | No | |
gemini-1.0-pro-001 | FC | 32760 | 8192 | 0.0000005 | 0.0000015 | - | No | |
gemini-1.0-pro-002 | FC | 32760 | 8192 | 0.0000005 | 0.0000015 | - | No | |
gemini-1.0-pro-vision | FCVI | 16384 | 2048 | 0.00000025 | 0.0000005 | - | No | |
gemini-1.0-pro-vision-001 | FCVI | 16384 | 2048 | 0.00000025 | 0.0000005 | - | No | |
gemini-1.0-ultra | FC | 8192 | 2048 | 0.0000005 | 0.0000015 | - | No | |
gemini-1.0-ultra-001 | FC | 8192 | 2048 | 0.0000005 | 0.0000015 | - | No | |
gemini-1.5-flash | FCVI | 1000000 | 8192 | 0.000000075 | 0.0000003 | 2397 | No | |
gemini-1.5-flash-001 | FCVI | 1000000 | 8192 | 0.000000075 | 0.0000003 | - | No | |
gemini-1.5-flash-002 | FCVI | 1048576 | 8192 | 0.000000075 | 0.0000003 | - | No | |
gemini-1.5-flash-exp-0827 | FCVI | 1000000 | 8192 | 0.0000000047 | 0.0000000047 | - | No | |
gemini-1.5-flash-preview-0514 | FCVI | 1000000 | 8192 | 0.000000075 | 0.0000000047 | - | No | |
gemini-1.5-pro | FC | 2097152 | 8192 | 0.0000013 | 0.000005 | 225 | No | |
gemini-1.5-pro-001 | FC | 1000000 | 8192 | 0.0000013 | 0.000005 | - | No | |
gemini-1.5-pro-002 | FC | 2097152 | 8192 | 0.0000013 | 0.000005 | - | No | |
gemini-1.5-pro-preview-0215 | FC | 1000000 | 8192 | 0.000000078 | 0.00000031 | - | No | |
gemini-1.5-pro-preview-0409 | FC | 1000000 | 8192 | 0.000000078 | 0.00000031 | - | No | |
gemini-1.5-pro-preview-0514 | FC | 1000000 | 8192 | 0.000000078 | 0.00000031 | - | No | |
gemini-pro | FC | 32760 | 8192 | 0.0000005 | 0.0000015 | 561 | No | |
gemini-pro-vision | FCVI | 16384 | 2048 | 0.00000025 | 0.0000005 | - | No | |
google-vertex | gemini-1.5-flash-001 | VIFC | 1000000 | 8192 | 0.000000075 | 0.00000030 | - | Yes |
google-vertex | gemini-1.5-flash-002 | VIFC | 1048576 | 8192 | 0.000000075 | 0.00000030 | - | Yes |
google-vertex | gemini-1.5-pro-001 | FC | 1000000 | 8192 | 0.0000013 | 0.0000050 | - | Yes |
google-vertex | gemini-1.5-pro-002 | FC | 2097152 | 8192 | 0.0000013 | 0.0000050 | - | Yes |
groq | gemma-7b-it | FC | 8192 | 8192 | 0.00000007 | 0.00000007 | - | No |
groq | gemma2-9b-it | FC | 8192 | 8192 | 0.0000002 | 0.0000002 | - | No |
groq | llama-3.1-405b-reasoning | FC | 8192 | 8192 | 0.00000059 | 0.00000079 | - | No |
groq | llama-3.1-70b-versatile | FC | 8192 | 8192 | 0.00000059 | 0.00000079 | - | No |
groq | llama-3.1-8b-instant | FC | 8192 | 8192 | 0.00000005 | 0.00000008 | - | No |
groq | llama-3.2-11b-text-preview | FC | 8192 | 8192 | 0.00000018 | 0.00000018 | - | No |
groq | llama-3.2-11b-vision-preview | FC | 8192 | 8192 | 0.00000018 | 0.00000018 | - | No |
groq | llama-3.2-1b-preview | FC | 8192 | 8192 | 0.00000004 | 0.00000004 | - | No |
groq | llama-3.2-3b-preview | FC | 8192 | 8192 | 0.00000006 | 0.00000006 | - | No |
groq | llama-3.2-90b-text-preview | FC | 8192 | 8192 | 0.0000009 | 0.0000009 | - | No |
groq | llama-3.2-90b-vision-preview | FC | 8192 | 8192 | 0.0000009 | 0.0000009 | - | No |
groq | llama2-70b-4096 | FC | 4096 | 4096 | 0.0000007 | 0.0000008 | - | No |
groq | llama3-70b-8192 | FC | 8192 | 8192 | 0.00000059 | 0.00000079 | - | No |
groq | llama3-8b-8192 | FC | 8192 | 8192 | 0.00000005 | 0.00000008 | - | No |
groq | llama3-groq-70b-8192-tool-use-preview | FC | 8192 | 8192 | 0.00000089 | 0.00000089 | - | No |
groq | llama3-groq-8b-8192-tool-use-preview | FC | 8192 | 8192 | 0.00000019 | 0.00000019 | - | No |
groq | mixtral-8x7b-32768 | FC | 32768 | 32768 | 0.00000024 | 0.00000024 | - | No |
mistral | codestral-2405 | 32000 | 8191 | 0.000001 | 0.000003 | - | No | |
mistral | codestral-latest | 32000 | 8191 | 0.000001 | 0.000003 | - | No | |
mistral | codestral-mamba-latest | 256000 | 256000 | 0.00000025 | 0.00000025 | - | No | |
mistral | ministral-3b-2410 | FC | 128000 | 4096 | 0.000000040 | 0.000000040 | - | Yes |
mistral | ministral-8b-2410 | FC | 128000 | 4096 | 0.00000010 | 0.00000010 | - | Yes |
mistral | mistral-large-2402 | FC | 32000 | 8191 | 0.000004 | 0.000012 | - | No |
mistral | mistral-large-2407 | FC | 128000 | 128000 | 0.000003 | 0.000009 | - | No |
mistral | mistral-large-2411 | FC | 128000 | 4096 | 0.0000020 | 0.0000060 | - | Yes |
mistral | mistral-large-latest | FC | 128000 | 128000 | 0.000003 | 0.000009 | - | No |
mistral | mistral-medium | 32000 | 8191 | 0.0000027 | 0.0000081 | - | No | |
mistral | mistral-medium-2312 | 32000 | 8191 | 0.0000027 | 0.0000081 | - | No | |
mistral | mistral-medium-latest | 32000 | 8191 | 0.0000027 | 0.0000081 | - | No | |
mistral | mistral-small | FC | 32000 | 8191 | 0.000001 | 0.000003 | - | No |
mistral | mistral-small-latest | FC | 32000 | 8191 | 0.000001 | 0.000003 | - | No |
mistral | mistral-tiny | 32000 | 8191 | 0.00000025 | 0.00000025 | - | No | |
mistral | open-codestral-mamba | 256000 | 256000 | 0.00000025 | 0.00000025 | - | No | |
mistral | open-mistral-7b | 32000 | 8191 | 0.00000025 | 0.00000025 | - | No | |
mistral | open-mistral-nemo | 128000 | 128000 | 0.0000003 | 0.0000003 | - | No | |
mistral | open-mistral-nemo-2407 | 128000 | 128000 | 0.0000003 | 0.0000003 | - | No | |
mistral | open-mixtral-8x22b | FC | 64000 | 8191 | 0.000002 | 0.000006 | - | No |
mistral | open-mixtral-8x7b | FC | 32000 | 8191 | 0.0000007 | 0.0000007 | - | No |
mistral | pixtral-12b-2409 | FCVI | 128000 | 128000 | 0.00000015 | 0.00000015 | - | No |
openai | gpt-3.5-turbo | FC | 16385 | 4096 | 0.0000015 | 0.000002 | 134 | No |
openai | gpt-3.5-turbo-0125 | FCPFC | 16385 | 4096 | 0.0000005 | 0.0000015 | 134 | No |
openai | gpt-3.5-turbo-0301 | 4097 | 4096 | 0.0000015 | 0.000002 | - | No | |
openai | gpt-3.5-turbo-0613 | FC | 4097 | 4096 | 0.0000015 | 0.000002 | - | No |
openai | gpt-3.5-turbo-1106 | FCPFC | 16385 | 4096 | 0.000001 | 0.000002 | 109 | No |
openai | gpt-3.5-turbo-16k | 16385 | 4096 | 0.000003 | 0.000004 | - | No | |
openai | gpt-3.5-turbo-16k-0613 | 16385 | 4096 | 0.000003 | 0.000004 | - | No | |
openai | gpt-4 | FC | 8192 | 4096 | 0.00003 | 0.00006 | 75 | No |
openai | gpt-4-0125-preview | FCPFC | 128000 | 4096 | 0.00001 | 0.00003 | 43 | No |
openai | gpt-4-0314 | 8192 | 4096 | 0.00003 | 0.00006 | - | No | |
openai | gpt-4-0613 | FC | 8192 | 4096 | 0.00003 | 0.00006 | - | No |
openai | gpt-4-1106-preview | FCPFC | 128000 | 4096 | 0.00001 | 0.00003 | 23 | No |
openai | gpt-4-1106-vision-preview | VI | 128000 | 4096 | 0.00001 | 0.00003 | - | No |
openai | gpt-4-32k | 32768 | 4096 | 0.00006 | 0.00012 | - | No | |
openai | gpt-4-32k-0314 | 32768 | 4096 | 0.00006 | 0.00012 | - | No | |
openai | gpt-4-32k-0613 | 32768 | 4096 | 0.00006 | 0.00012 | - | No | |
openai | gpt-4-turbo | FCPFCVI | 128000 | 4096 | 0.00001 | 0.00003 | 52 | No |
openai | gpt-4-turbo-2024-04-09 | FCPFCVI | 128000 | 4096 | 0.00001 | 0.00003 | - | No |
openai | gpt-4-turbo-preview | FCPFC | 128000 | 4096 | 0.00001 | 0.00003 | - | No |
openai | gpt-4-vision-preview | VI | 128000 | 4096 | 0.00001 | 0.00003 | - | No |
openai | gpt-4o | FCPFCVI | 128000 | 16384 | 0.0000025 | 0.00001 | 75 | Yes |
openai | gpt-4o-2024-05-13 | FCPFCVI | 128000 | 4096 | 0.000005 | 0.000015 | - | Yes |
openai | gpt-4o-2024-08-06 | FCPFCVI | 128000 | 16384 | 0.0000025 | 0.00001 | - | Yes |
openai | gpt-4o-2024-11-20 | FCPFCVI | 128000 | 16384 | 0.0000025 | 0.00001 | - | Yes |
openai | gpt-4o-audio-preview | FCPFC | 128000 | 16384 | 0.0000025 | 0.00001 | - | No |
openai | gpt-4o-audio-preview-2024-10-01 | FCPFC | 128000 | 16384 | 0.0000025 | 0.00001 | - | No |
openai | gpt-4o-mini | FCPFCVI | 128000 | 16384 | 0.00000015 | 0.0000006 | 18 | Yes |
openai | gpt-4o-mini-2024-07-18 | FCPFCVI | 128000 | 16384 | 0.00000015 | 0.0000006 | - | Yes |
openai | o1-mini | FCPFC | 128000 | 65536 | 0.000003 | 0.000012 | - | No |
openai | o1-mini-2024-09-12 | FCPFC | 128000 | 65536 | 0.000003 | 0.000012 | - | No |
openai | o1-preview | FCPFC | 128000 | 32768 | 0.000015 | 0.00006 | - | No |
openai | o1-preview-2024-09-12 | FCPFC | 128000 | 32768 | 0.000015 | 0.00006 | - | No |
perplexity | codellama-34b-instruct | 16384 | 16384 | 0.00000035 | 0.0000014 | - | No | |
perplexity | codellama-70b-instruct | 16384 | 16384 | 0.0000007 | 0.0000028 | - | No | |
perplexity | llama-2-70b-chat | 4096 | 4096 | 0.0000007 | 0.0000028 | - | No | |
perplexity | llama-3.1-70b-instruct | 131072 | 131072 | 0.000001 | 0.000001 | - | No | |
perplexity | llama-3.1-8b-instruct | 131072 | 131072 | 0.0000002 | 0.0000002 | - | No | |
perplexity | llama-3.1-sonar-huge-128k-online | 127072 | 127072 | 0.000005 | 0.000005 | - | No | |
perplexity | llama-3.1-sonar-large-128k-chat | 131072 | 131072 | 0.000001 | 0.000001 | - | No | |
perplexity | llama-3.1-sonar-large-128k-online | 127072 | 127072 | 0.000001 | 0.000001 | - | No | |
perplexity | llama-3.1-sonar-small-128k-chat | 131072 | 131072 | 0.0000002 | 0.0000002 | - | No | |
perplexity | llama-3.1-sonar-small-128k-online | 127072 | 127072 | 0.0000002 | 0.0000002 | - | No | |
perplexity | mistral-7b-instruct | 4096 | 4096 | 0.00000007 | 0.00000028 | - | No | |
perplexity | mixtral-8x7b-instruct | 4096 | 4096 | 0.00000007 | 0.00000028 | - | No | |
perplexity | pplx-70b-chat | 4096 | 4096 | 0.0000007 | 0.0000028 | - | No | |
perplexity | pplx-7b-chat | 8192 | 8192 | 0.00000007 | 0.00000028 | - | No | |
perplexity | sonar-medium-chat | 16384 | 16384 | 0.0000006 | 0.0000018 | - | No | |
perplexity | sonar-small-chat | 16384 | 16384 | 0.00000007 | 0.00000028 | - | No | |
replicate | meta/llama-2-13b | 4096 | 4096 | 0.0000001 | 0.0000005 | - | No | |
replicate | meta/llama-2-13b-chat | 4096 | 4096 | 0.0000001 | 0.0000005 | - | No | |
replicate | meta/llama-2-70b | 4096 | 4096 | 0.00000065 | 0.0000028 | - | No | |
replicate | meta/llama-2-70b-chat | 4096 | 4096 | 0.00000065 | 0.0000028 | - | No | |
replicate | meta/llama-2-7b | 4096 | 4096 | 0.00000005 | 0.00000025 | - | No | |
replicate | meta/llama-2-7b-chat | 4096 | 4096 | 0.00000005 | 0.00000025 | - | No | |
replicate | meta/llama-3-70b | 8192 | 8192 | 0.00000065 | 0.0000028 | - | No | |
replicate | meta/llama-3-70b-instruct | 8192 | 8192 | 0.00000065 | 0.0000028 | - | No | |
replicate | meta/llama-3-8b | 8086 | 8086 | 0.00000005 | 0.00000025 | - | No | |
replicate | meta/llama-3-8b-instruct | 8086 | 8086 | 0.00000005 | 0.00000025 | - | No | |
replicate | mistralai/mistral-7b-instruct-v0.2 | 4096 | 4096 | 0.00000005 | 0.00000025 | - | No | |
replicate | mistralai/mistral-7b-v0.1 | 4096 | 4096 | 0.00000005 | 0.00000025 | - | No | |
replicate | mistralai/mixtral-8x7b-instruct-v0.1 | 4096 | 4096 | 0.0000003 | 0.000001 | - | No | |
vertex_ai | claude-3-5-haiku | FC | 200000 | 8192 | 0.000001 | 0.000005 | - | No |
vertex_ai | claude-3-5-haiku@20241022 | FC | 200000 | 8192 | 0.000001 | 0.000005 | - | No |
vertex_ai | claude-3-5-sonnet | FCVI | 200000 | 8192 | 0.000003 | 0.000015 | - | No |
vertex_ai | claude-3-5-sonnet-v2 | FCVI | 200000 | 8192 | 0.000003 | 0.000015 | - | No |
vertex_ai | claude-3-5-sonnet-v2@20241022 | FCVI | 200000 | 8192 | 0.000003 | 0.000015 | - | No |
vertex_ai | claude-3-5-sonnet@20240620 | FCVI | 200000 | 8192 | 0.000003 | 0.000015 | - | No |
vertex_ai | claude-3-haiku | FCVI | 200000 | 4096 | 0.00000025 | 0.0000013 | - | No |
vertex_ai | claude-3-haiku@20240307 | FCVI | 200000 | 4096 | 0.00000025 | 0.0000013 | - | No |
vertex_ai | claude-3-opus | FCVI | 200000 | 4096 | 0.000015 | 0.000075 | - | No |
vertex_ai | claude-3-opus@20240229 | FCVI | 200000 | 4096 | 0.000015 | 0.000075 | - | No |
vertex_ai | claude-3-sonnet | FCVI | 200000 | 4096 | 0.000003 | 0.000015 | - | No |
vertex_ai | claude-3-sonnet@20240229 | FCVI | 200000 | 4096 | 0.000003 | 0.000015 | - | No |
vertex_ai | codestral@2405 | FC | 128000 | 128000 | 0.000001 | 0.000003 | - | No |
vertex_ai | codestral@latest | FC | 128000 | 128000 | 0.000001 | 0.000003 | - | No |
vertex_ai | jamba-1.5 | 256000 | 256000 | 0.0000002 | 0.0000004 | - | No | |
vertex_ai | jamba-1.5-large | 256000 | 256000 | 0.000002 | 0.000008 | - | No | |
vertex_ai | jamba-1.5-large@001 | 256000 | 256000 | 0.000002 | 0.000008 | - | No | |
vertex_ai | jamba-1.5-mini | 256000 | 256000 | 0.0000002 | 0.0000004 | - | No | |
vertex_ai | jamba-1.5-mini@001 | 256000 | 256000 | 0.0000002 | 0.0000004 | - | No | |
vertex_ai | mistral-large@2407 | FC | 128000 | 8191 | 0.000003 | 0.000009 | - | No |
vertex_ai | mistral-large@latest | FC | 128000 | 8191 | 0.000003 | 0.000009 | - | No |
vertex_ai | mistral-nemo@2407 | FC | 128000 | 128000 | 0.000003 | 0.000003 | - | No |
vertex_ai | mistral-nemo@latest | FC | 128000 | 128000 | 0.000003 | 0.000003 | - | No |
xai | grok-beta | FCVI | 131072 | 131072 | 0.000005 | 0.000015 | - | Yes |