Skip to main content
Glama

gemini-flash-1.5 vs grok-3-mini-beta

Pricing, Performance & Features Comparison

Price unit:
Authorgoogle
Context Length1M
Reasoning
-
Providers1
ReleasedMay 2024
Knowledge CutoffSep 2024
License-

Gemini 1.5 Flash is a multimodal large language model from Google designed for high-volume tasks with low latency. It supports text, audio, image, and video inputs, and can output up to 8,192 tokens while maintaining efficient performance. Its long context window and advanced reasoning capabilities make it versatile for summarization, chat applications, and data extraction tasks.

Input$0.075
Output$0.3
Latency (p50)-
Output Limit8K
Function Calling
JSON Mode
InputText, Image, Audio, Video
OutputText
in$0.075out$0.3--
Authorxai
Context Length131K
Reasoning
-
Providers1
ReleasedMay 2024
Knowledge CutoffNov 2024
License-

A lightweight model that thinks before responding. Fast, smart, and great for logic-based tasks that do not require deep domain knowledge. The raw thinking traces are accessible.

Input$0.3
Output$0.5
Latency (p50)2.3s
Output Limit131K
Function Calling
JSON Mode
-
InputText
OutputText
in$0.3out$0.5--
Latency (24h)
Success Rate (24h)