Skip to main content
Glama

gemini-1.5-pro-001 vs gpt-4o-2024-05-13

Pricing, Performance & Features Comparison

Price unit:
Authorgoogle
Context Length1M
Reasoning
-
Providers1
ReleasedMay 2024
Knowledge CutoffMay 2024
License-
Superseded Bygemini-1.5-pro

Gemini 1.5 Pro is Google’s multimodal large language model designed for advanced text generation, summarization, and complex reasoning. It supports inputs such as images, audio, video, and PDFs, as well as system instructions and JSON-formatted outputs. The model excels at handling large context windows and offers features like grounding, tuning, and provisioned throughput.

Input$1.3
Output$5
Latency (p50)-
Output Limit8K
Function Calling
JSON Mode
InputText, Image, Audio, Video
OutputText
in$1.3out$5--
Authoropenai
Context Length128K
Reasoning
-
Providers1
ReleasedMay 2024
Knowledge CutoffOct 2023
License-
Superseded Bygpt-4o-2024-08-06

openai/gpt-4o-2024-05-13 is a high-capacity multimodal language model suited for both lightweight and complex tasks. It offers a large input context window and supports advanced features like multi-tool calls, making it capable of handling diverse scenarios. This model is designed for chat-based interactions, with configurable parameters for controlled outputs.

Input$5
Output$15
Latency (p50)1s
Output Limit4K
Function Calling
JSON Mode
InputText, Image
OutputText
in$5out$15cache$1.3-
Latency (24h)
Success Rate (24h)