Skip to main content
Glama

grok-vision-beta vs nova-lite-v1

Pricing, Performance & Features Comparison

Price unit:
Authorxai
Context Length8K
Reasoning
-
Providers1
ReleasedNov 2024
Knowledge CutoffOct 2023
License-

xAI’s Grok Vision Beta is an experimental large language model with integrated vision capabilities, supporting both text and image inputs. It can generate text-based responses in multiple languages and offers a context window of up to 8,192 tokens. The model is currently offered as a beta release and does not support fine-tuning on custom datasets.

Input$5
Output$15
Latency (p50)-
Output Limit8K
Function Calling
JSON Mode
-
InputText, Image
OutputText
in$5out$15--
Authoramazon
Context Length300K
Reasoning
-
Providers1
ReleasedNov 2024
Knowledge Cutoff-
License-

Amazon Nova Lite v1 is a multimodal model designed for high-speed, cost-effective processing of text, image, and video inputs. It supports a large context window of 300k tokens and can produce up to 5k tokens in output, making it suitable for complex interactive scenarios. The model also offers fine-tuning capabilities on Amazon Bedrock to tailor performance to specific use cases.

Input$0.06
Output$0.24
Latency (p50)-
Output Limit5K
Function Calling
-
JSON Mode
-
InputText, Image, Video
OutputText
in$0.06out$0.24--