Skip to main content
Glama

gemini-flash-1.5 vs gpt-4o-2024-05-13

Pricing, Performance & Features Comparison

Price unit:
Authorgoogle
Context Length1M
Reasoning
-
Providers1
ReleasedMay 2024
Knowledge CutoffSep 2024
License-

Gemini 1.5 Flash is a multimodal large language model from Google designed for high-volume tasks with low latency. It supports text, audio, image, and video inputs, and can output up to 8,192 tokens while maintaining efficient performance. Its long context window and advanced reasoning capabilities make it versatile for summarization, chat applications, and data extraction tasks.

Input$0.075
Output$0.3
Latency (p50)-
Output Limit8K
Function Calling
JSON Mode
InputText, Image, Audio, Video
OutputText
in$0.075out$0.3--
Authoropenai
Context Length128K
Reasoning
-
Providers1
ReleasedMay 2024
Knowledge CutoffOct 2023
License-
Superseded Bygpt-4o-2024-08-06

openai/gpt-4o-2024-05-13 is a high-capacity multimodal language model suited for both lightweight and complex tasks. It offers a large input context window and supports advanced features like multi-tool calls, making it capable of handling diverse scenarios. This model is designed for chat-based interactions, with configurable parameters for controlled outputs.

Input$5
Output$15
Latency (p50)1s
Output Limit4K
Function Calling
JSON Mode
InputText, Image
OutputText
in$5out$15cache$1.3-
Latency (24h)
Success Rate (24h)