xAI Grok 2

xAI Grok 2: $2/1M input tokens, 350 tok/s throughput, 800ms latency. Compare on BenchNode.io.

per 1M input tokens

$10 / 1M output

Specifications

model id: grok-2
model family: Grok
parameters: undisclosed
context window tokens: 131,072
modality: text + vision
reasoning: No

Performance

Latency (TTFT) 800 ms

lower is better

Uptime SLA 99.9%

Throughput 350 tok/s

800

ms latency

99.9%

uptime

Pricing Detail

input per 1m tokens usd: 2
output per 1m tokens usd: 10
free tier available: No

AI Analysis · gpt-4o-mini

Technical Verdict

With an input cost of $2.0 / 1M tokens and output cost of $10.0 / 1M tokens, this pricing is premium for the model class, while the throughput of 350 tok/s is relatively fast. The first-token latency of 800 ms and undisclosed rate limits suggest suitability for low-volume prototyping rather than high-volume production, with latency favoring batch use over real-time applications.