gemini-flash-1.5 vs gpt-4o-2024-05-13

Pricing, Performance & Features Comparison

gemini-flash-1.5

Authorgoogle

Context Length1M

Reasoning

Providers1

ReleasedMay 2024

Knowledge CutoffSep 2024

License-

Gemini 1.5 Flash is a multimodal large language model from Google designed for high-volume tasks with low latency. It supports text, audio, image, and video inputs, and can output up to 8,192 tokens while maintaining efficient performance. Its long context window and advanced reasoning capabilities make it versatile for summarization, chat applications, and data extraction tasks.

Input$0.075

Output$0.3

Latency (p50)-

Output Limit8K

Function Calling

JSON Mode

InputText, Image, Audio, Video

OutputText

google-vertex

in$0.075out$0.3--

gpt-4o-2024-05-13

Authoropenai

Context Length128K

Reasoning

Providers1

ReleasedMay 2024

Knowledge CutoffOct 2023

License-

Superseded Bygpt-4o-2024-08-06

openai/gpt-4o-2024-05-13 is a high-capacity multimodal language model suited for both lightweight and complex tasks. It offers a large input context window and supports advanced features like multi-tool calls, making it capable of handling diverse scenarios. This model is designed for chat-based interactions, with configurable parameters for controlled outputs.

Input$5

Output$15

Latency (p50)1.8s

Output Limit4K

Function Calling

JSON Mode

InputText, Image

OutputText

openai

in$5out$15cache$1.3-

gemini-flash-1.5 vs gpt-4o-2024-05-13

Latency (24h)

Success Rate (24h)