Skip to main content
Glama

phi-3-medium-128k-instruct vs gemini-1.5-flash-001

Pricing, Performance & Features Comparison

Price unit:
Authormicrosoft
Context Length128K
Reasoning
-
Providers1
ReleasedMay 2024
Knowledge CutoffOct 2023
LicenseMIT License

microsoft/phi-3-medium-128k-instruct is a 14B-parameter language model optimized for high-quality, reasoning-intensive tasks across math, code, and logic. It offers a long context window of 128,000 tokens, making it suitable for handling large inputs in memory-constrained environments. The model is fine-tuned for instruction-following and safety, delivering robust performance across various natural language benchmarks.

Input$0.00
Output$0.00
Latency (p50)-
Output Limit4K
Function Calling
-
JSON Mode
-
InputText
OutputText
in$0.00out$0.00--
Authorgoogle
Context Length1M
Reasoning
-
Providers1
ReleasedMay 2024
Knowledge CutoffMay 2024
License-
Superseded Bygemini-1.5-flash

Gemini 1.5 Flash-001 is designed for high-volume, cost-effective multimodal applications spanning text, images, audio, video, and PDFs. It offers expansive context windows, supports function calling, and can output structured data such as JSON. The model focuses on both speed and quality, making it ideal for fast, lower-cost deployments.

Input$0.075
Output$0.3
Latency (p50)-
Output Limit8K
Function Calling
JSON Mode
InputText, Image, Audio, Video
OutputText
in$0.075out$0.3--