Skip to main content
Glama

command-r-08-2024 vs phi-3.5-mini-128k-instruct

Pricing, Performance & Features Comparison

Price unit:
Flag of Canada (Pantone colours)
command-r-08-2024
Authorcohere
Context Length128K
Reasoning
-
Providers1
ReleasedAug 2024
Knowledge CutoffOct 2023
License-

cohere/command-r-08-2024 is a 32-billion parameter generative model tailored for multilingual reasoning, summarization, and question answering. It supports advanced tool use features (function calling, agents) and retrieval-augmented generation, while maintaining robust performance across diverse tasks. The model excels at long context interactions, with improved decision-making and structured data analysis.

Input$0.14
Output$0.57
Latency (p50)960ms
Output Limit4K
Function Calling
JSON Mode
-
InputText
OutputText
Flag of Canada (Pantone colours)
cohere
in$0.14out$0.57--
Latency (24h)
Success Rate (24h)
Authormicrosoft
Context Length128K
Reasoning
-
Providers1
ReleasedAug 2024
Knowledge CutoffOct 2023
LicenseMIT License

Phi-3.5-mini-128k-instruct is a compact, advanced language model that can handle up to 128K tokens in context, enabling tasks such as lengthy document summarization, multi-turn conversation, and complex reasoning. It is optimized for logic, code, and math, and provides robust multi-lingual capabilities. With 3.8 billion parameters, it uses a dense decoder-only transformer architecture that balances performance and efficiency.

Input$0.1
Output$0.1
Latency (p50)-
Output Limit128K
Function Calling
-
JSON Mode
-
InputText
OutputText
in$0.1out$0.1--