DeepInfra raises $107M Series B to scale the inference cloud — read the announcement
Ask me anything
Settings
FLUX.2 [klein]FLUX.2 [klein] is our fastest model family — generating and editing (multiple) images in under a second without sacrificing quality. Built for real-time applications, creative iteration, and deployment on consumer hardware.
| Model | Best For |
|---|---|
| [klein] 4B | Maximum speed |
| [klein] 9B | Best quality-to-latency ratio, production apps |
Licensing: 4B models are Apache 2.0. 9B models use the FLUX.2-dev Non-Commercial License.
Example focused on realism

Example focused on output diversity


© 2026 DeepInfra. All rights reserved.