DeepInfra raises $107M Series B to scale the inference cloud — read the announcement

At DeepInfra, we care about one thing above all: making cutting-edge AI models accessible. Today, we're excited to release the most downloaded model to our platform.
Whether you're a visual artist, developer, or building an app that relies on high-fidelity outputs, this is the model series you need.
With over 12 million downloads across platforms like HuggingFace and Civitai, the Juggernaut FLUX Series has earned its place as the most trusted name in photorealistic AI image generation. This series delivers results. From lightning-fast inference speeds to pro-grade detail rendering, these models are for creators who expect more from their tools.
Prompt: A Brazilian street dancer with caramel skin and curly hair wearing a cropped graphic tee and loose cargo pants mid-movement in an expressive hip-hop pose, a vibrant graffiti-covered wall behind them. Golden hour lighting.
Num inference steps: 4
Seed: 42
Prompt: A Brazilian street dancer with caramel skin and curly hair wearing a cropped graphic tee and loose cargo pants mid-movement in an expressive hip-hop pose, a vibrant graffiti-covered wall behind them. Golden hour lighting.
Num inference steps: 33
Seed: 42
Do not forget to follow us on Linkedin and on X (formerly Twitter).
Lzlv model for roleplaying and creative workRecently an interesting new model got released.
It is called Lzlv, and it is basically
a merge of few existing models. This model is using the Vicuna prompt format, so keep this
in mind if you are using our raw [API](/lizpreciatior/lzlv_70b...
Accelerating Reasoning Workflows with Nemotron 3 Nano on DeepInfraDeepInfra is an official launch partner for NVIDIA Nemotron 3 Nano, the newest open reasoning model in the Nemotron family. Our goal is to give developers, researchers, and teams the fastest and simplest path to using Nemotron 3 Nano from day one.
Search That Actually Works: A Guide to LLM RerankersSearch relevance isn’t a nice-to-have feature for your site or app. It can make or break the entire user experience.
When a customer searches "best laptop for video editing" and gets results for gaming laptops or budget models, they leave empty-handed.
Embeddings help you find similar content, bu...© 2026 DeepInfra. All rights reserved.