DeepInfra raises $107M Series B to scale the inference cloud — read the announcement
Qwen/
$0.18
in
$1.00
out
/ 1M tokens
Qwen3.5-35B-A3B is an efficient Mixture-of-Experts model from Alibaba's Qwen3.5 series with 35B total parameters and only 3B activated per token. It features a 262K token context window (extensible to 1M with YaRN), thinking/reasoning mode, tool calling, and support for 201 languages. Delivers strong performance on reasoning, coding, and vision-language tasks at a fraction of the compute cost.

EAfyQupe
2026-03-24T00:53:31+00:00
© 2026 DeepInfra. All rights reserved.