We use essential cookies to make our site work. With your consent, we may also use non-essential cookies to improve user experience and analyze website traffic…

Qwen3-Max-Thinking state-of-the-art reasoning model at your fingertips!

Browse deepinfra models:

All categories and models you can try out and directly use in deepinfra:

text-generation

automatic-speech-recognition

zero-shot-image-classification

Pixverse-T2V-HD

Pixverse/Pixverse-T2V-HD cover image

The 1080p high-fidelity mode in PixVerse renders videos with significantly enhanced sharpness and visual clarity, capturing intricate details and providing a crisp, professional-grade quality suitable for more polished projects.

PrunaAI/p-video cover image

Real-time AI video generation from text, images, and audio. Supports up to 1080p at 48 FPS with built-in audio generation, draft mode for 4x faster previews, and prompt upsampling.

Qwen-Image-Edit

Qwen/Qwen-Image-Edit cover image

Qwen-Image-Edit is a next-generation image editing model built on top of Qwen-Image, designed for both semantic and appearance-level edits. It excels at tasks like precise text modifications, style transfers, viewpoint transformations, and element adjustments while preserving overall visual consistency.

$0.025 x (width / 1024) x (height / 1024) x (iters / 25)

Qwen-Image-Edit-Max

Qwen/Qwen-Image-Edit-Max cover image

Enhanced industrial design and geometric reasoning, improved character consistency, reduced offset issues, and integrated LoRA capabilities

Qwen/Qwen-Image-Max cover image

Compared with the Plus series, it significantly reduces the “AI-like” feel in generated images, enhancing their realism. It delivers more lifelike material textures for human subjects, finer and more detailed natural textures, and more visually appealing text rendering.

text-generation

Qwen2.5-72B-Instruct

Qwen/Qwen2.5-72B-Instruct cover image

Qwen2.5 is a model pretrained on a large-scale dataset of up to 18 trillion tokens, offering significant improvements in knowledge, coding, mathematics, and instruction following compared to its predecessor Qwen2. The model also features enhanced capabilities in generating long texts, understanding structured data, and generating structured outputs, while supporting multilingual capabilities for over 29 languages.

$0.12 in, $0.39 out / 1M

text-generation

Qwen2.5-VL-32B-Instruct

Qwen/Qwen2.5-VL-32B-Instruct cover image

$0.20 in, $0.60 out / 1M

text-generation

Qwen/Qwen3-14B cover image

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support.

$0.12 in, $0.24 out / 1M

text-generation

Qwen3-235B-A22B-Instruct-2507

Qwen/Qwen3-235B-A22B-Instruct-2507 cover image

Qwen3-235B-A22B-Instruct-2507 is the updated version of the Qwen3-235B-A22B non-thinking mode, featuring Significant improvements in general capabilities, including instruction following, logical reasoning, text comprehension, mathematics, science, coding and tool usage.

$0.071 in, $0.10 out / 1M

text-generation

Qwen3-235B-A22B-Thinking-2507

Qwen/Qwen3-235B-A22B-Thinking-2507 cover image

Qwen3-235B-A22B-Thinking-2507 is the Qwen3's new model with scaling the thinking capability of Qwen3-235B-A22B, improving both the quality and depth of reasoning.

$0.20 cached, $0.23 in, $2.30 out / 1M

text-generation

Qwen/Qwen3-30B-A3B cover image

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support

$0.08 in, $0.28 out / 1M

text-generation

Qwen/Qwen3-32B cover image

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support

$0.08 in, $0.28 out / 1M

text-generation

Qwen3-Coder-480B-A35B-Instruct

Qwen/Qwen3-Coder-480B-A35B-Instruct cover image

Qwen3-Coder-480B-A35B-Instruct is the Qwen3's most agentic code model, featuring Significant Performance on Agentic Coding, Agentic Browser-Use and other foundational coding tasks, achieving results comparable to Claude Sonnet.

$0.40 in, $1.60 out / 1M

text-generation

Qwen3-Coder-480B-A35B-Instruct-Turbo

Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo cover image

Qwen3-Coder-480B-A35B-Instruct is the Qwen3's most agentic code model, featuring Significant Performance on Agentic Coding, Agentic Browser-Use and other foundational coding tasks, achieving results comparable to Claude Sonnet.

$0.022 cached, $0.22 in, $1.00 out / 1M

Qwen3-Embedding-0.6B

Qwen/Qwen3-Embedding-0.6B cover image

The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks. Building upon the dense foundational models of the Qwen3 series, it provides a comprehensive range of text embeddings and reranking models in various sizes (0.6B, 4B, and 8B).

$0.010 / 1M tokens

Qwen3-Embedding-0.6B-batch

Qwen/Qwen3-Embedding-0.6B-batch cover image

The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks. Building upon the dense foundational models of the Qwen3 series, it provides a comprehensive range of text embeddings and reranking models in various sizes (0.6B, 4B, and 8B).

$0.005 / 1M tokens

Qwen3-Embedding-4B

Qwen/Qwen3-Embedding-4B cover image

The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks. Building upon the dense foundational models of the Qwen3 series, it provides a comprehensive range of text embeddings and reranking models in various sizes (0.6B, 4B, and 8B).

$0.020 / 1M tokens

Qwen3-Embedding-4B-batch

Qwen/Qwen3-Embedding-4B-batch cover image

The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks. Building upon the dense foundational models of the Qwen3 series, it provides a comprehensive range of text embeddings and reranking models in various sizes (0.6B, 4B, and 8B).

$0.010 / 1M tokens

Qwen3-Embedding-8B

Qwen/Qwen3-Embedding-8B cover image

The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks. Building upon the dense foundational models of the Qwen3 series, it provides a comprehensive range of text embeddings and reranking models in various sizes (0.6B, 4B, and 8B).

$0.050 / 1M tokens

Qwen3-Embedding-8B-batch

Qwen/Qwen3-Embedding-8B-batch cover image

The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks. Building upon the dense foundational models of the Qwen3 series, it provides a comprehensive range of text embeddings and reranking models in various sizes (0.6B, 4B, and 8B).

$0.040 / 1M tokens

text-generation

Qwen/Qwen3-Max cover image

The latest flagship model in the Qwen family. State-of-the-art results across a comprehensive suite of benchmarks — including knowledge, reasoning, coding, instruction following, human preference alignment, agent tasks, and multilingual understanding.

$1.20 in $6.00 out $0.24 cached / 1M tokens

text-generation

Qwen3-Max-Thinking

Qwen/Qwen3-Max-Thinking cover image

The latest flagship reasoning model in the Qwen3 family. Further enhanced by multiple innovations like adaptive tool-use and advanced test-time scaling techniques

$1.20 in $6.00 out $0.24 cached / 1M tokens

text-generation

Qwen3-Next-80B-A3B-Instruct

Qwen/Qwen3-Next-80B-A3B-Instruct cover image

Over the past few months, we have observed increasingly clear trends toward scaling both total parameters and context lengths in the pursuit of more powerful and agentic artificial intelligence (AI). We are excited to share our latest advancements in addressing these demands, centered on improving scaling efficiency through innovative model architecture. We call this next-generation foundation models Qwen3-Next.

$0.09 in, $1.10 out / 1M

Qwen3-Reranker-0.6B

Qwen/Qwen3-Reranker-0.6B cover image

The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks. Building upon the dense foundational models of the Qwen3 series, it provides a comprehensive range of text embeddings and reranking models in various sizes (0.6B, 4B, and 8B)

$0.010 / 1M tokens

SOC 2 Certified

ISO 27001 Certified

Have questions or need a custom solution?

Company

Latest Models

moonshotai/Kimi-K2-Instruct-0905 zai-org/GLM-4.6 anthropic/claude-3-7-sonnet-latest deepseek-ai/DeepSeek-V3.2-Exp deepseek-ai/DeepSeek-V3.1

Featured Models

Bria/video_remove_background deepseek-ai/DeepSeek-V3.2 black-forest-labs/FLUX-2-klein-9b ResembleAI/chatterbox-turbo black-forest-labs/FLUX-2-klein-4b

Built With Love in Palo Alto

© 2026 Deep Infra. All rights reserved.

Privacy Policy Terms of Service