Together AI DeepSeek V3

Together AI DeepSeek V3: $1.25/1M input tokens, 450 tok/s throughput, 400ms latency. Compare on BenchNode.io.

$1.25

per 1M input tokens

$1.25 / 1M output

Free tier available

Specifications

model id: deepseek-v3
model family: DeepSeek V3
parameters: 685B MoE
context window tokens: 128,000
modality: text
reasoning: No

Performance

Latency (TTFT) 400 ms

lower is better

Uptime SLA 99.9%

Throughput 450 tok/s

400

ms latency

99.9%

uptime

Pricing Detail

input per 1m tokens usd: 1.25
output per 1m tokens usd: 1.25
free tier available: Yes
rate limit tpm: 10,000
rate limit rpm: 60

AI Analysis · gpt-4o-mini

Technical Verdict

The input and output costs are both $1.25 per 1M tokens, which positions this API tier in the mid-range for its model class, while the throughput of 450 tok/s is relatively fast. With rate limits of 10,000 TPM and 60 RPM, this setup is more suited for low-volume prototyping rather than high-volume production, and the first-token latency of 400 ms favors batch use over real-time applications.