Skip to main content
Glama

gemini-flash-1.5 vs phi-3-medium-128k-instruct

Pricing, Performance & Features Comparison

Price unit:
Authorgoogle
Context Length1M
Reasoning
-
Providers1
ReleasedMay 2024
Knowledge CutoffSep 2024
License-

Gemini 1.5 Flash is a multimodal large language model from Google designed for high-volume tasks with low latency. It supports text, audio, image, and video inputs, and can output up to 8,192 tokens while maintaining efficient performance. Its long context window and advanced reasoning capabilities make it versatile for summarization, chat applications, and data extraction tasks.

Input$0.075
Output$0.3
Latency (p50)-
Output Limit8K
Function Calling
JSON Mode
InputText, Image, Audio, Video
OutputText
in$0.075out$0.3--
Authormicrosoft
Context Length128K
Reasoning
-
Providers1
ReleasedMay 2024
Knowledge CutoffOct 2023
LicenseMIT License

microsoft/phi-3-medium-128k-instruct is a 14B-parameter language model optimized for high-quality, reasoning-intensive tasks across math, code, and logic. It offers a long context window of 128,000 tokens, making it suitable for handling large inputs in memory-constrained environments. The model is fine-tuned for instruction-following and safety, delivering robust performance across various natural language benchmarks.

Input$0.00
Output$0.00
Latency (p50)-
Output Limit4K
Function Calling
-
JSON Mode
-
InputText
OutputText
in$0.00out$0.00--