gemini-1.5-pro-001 vs gpt-4o-2024-05-13

Pricing, Performance & Features Comparison

gemini-1.5-pro-001

Authorgoogle

Context Length1M

Reasoning

Providers1

ReleasedMay 2024

Knowledge CutoffMay 2024

License-

Superseded Bygemini-1.5-pro

Gemini 1.5 Pro is Google’s multimodal large language model designed for advanced text generation, summarization, and complex reasoning. It supports inputs such as images, audio, video, and PDFs, as well as system instructions and JSON-formatted outputs. The model excels at handling large context windows and offers features like grounding, tuning, and provisioned throughput.

Input$1.3

Output$5

Latency (p50)-

Output Limit8K

Function Calling

JSON Mode

InputText, Image, Audio, Video

OutputText

google-vertex

in$1.3out$5--

gpt-4o-2024-05-13

Authoropenai

Context Length128K

Reasoning

Providers1

ReleasedMay 2024

Knowledge CutoffOct 2023

License-

Superseded Bygpt-4o-2024-08-06

openai/gpt-4o-2024-05-13 is a high-capacity multimodal language model suited for both lightweight and complex tasks. It offers a large input context window and supports advanced features like multi-tool calls, making it capable of handling diverse scenarios. This model is designed for chat-based interactions, with configurable parameters for controlled outputs.

Input$5

Output$15

Latency (p50)2.2s

Output Limit4K

Function Calling

JSON Mode

InputText, Image

OutputText

openai

in$5out$15cache$1.3-

gemini-1.5-pro-001 vs gpt-4o-2024-05-13

Latency (24h)

Success Rate (24h)