DeepInfra raises $107M Series B to scale the inference cloud — read the announcement
deepseek-ai/
$0.10
in
$0.20
out
$0.02
cached
/ 1M tokens
| Tier | Input | Output | Cached input |
|---|---|---|---|
Priority (1.5×)Learn More | $0.15 | $0.30 | $0.03 |
per 1M tokens
DeepSeek V4 Flash is an efficiency-focused MoE model with 284B total parameters (13B active) and a 1M-token context window. It's tuned for fast inference and high-throughput use cases while still holding up on reasoning and coding tasks.

1IioDTus
2026-04-24T06:16:17+00:00
© 2026 DeepInfra. All rights reserved.