xAI Grok 2 Mini

xAI Grok 2 Mini: $0.2/1M input tokens, 650 tok/s throughput, 300ms latency. Compare on BenchNode.io.

$0.2

per 1M input tokens

$0.4 / 1M output

Specifications

model id: grok-2-mini
model family: Grok
parameters: undisclosed
context window tokens: 131,072
modality: text
reasoning: No

Performance

Latency (TTFT) 300 ms

lower is better

Uptime SLA 99.9%

Throughput 650 tok/s

300

ms latency

99.9%

uptime

Pricing Detail

input per 1m tokens usd: 0.2
output per 1m tokens usd: 0.4
free tier available: No

AI Analysis · gpt-4o-mini

Technical Verdict

With an input cost of $0.2 / 1M tokens and an output cost of $0.4 / 1M tokens, this pricing is mid-tier for the model class, while the throughput of 650 tok/s is fast. The undisclosed rate limits suggest suitability for low-volume prototyping, and the first-token latency of 300 ms favors real-time applications like chat interfaces rather than batch processing.