The Wan2.1 1.3B model is a lightweight, efficient text-to-video generator. Despite its compact size, it delivers impressive performance across benchmarks and generates high-quality 480P videos.
The Wan2.1 1.3B model is a lightweight, efficient text-to-video generator. Despite its compact size, it delivers impressive performance across benchmarks and generates high-quality 480P videos.
You can use cURL or any other http client to run inferences:
curl -X POST \
-d '{"prompt": "A hand with delicate fingers picks up a bright yellow lemon from a wooden bowl filled with lemons and sprigs of mint against a peach-colored background. The hand gently tosses the lemon up and catches it, showcasing its smooth texture. A beige string bag sits beside the bowl, adding a rustic touch to the scene. Additional lemons, one halved, are scattered around the base of the bowl. The even lighting enhances the vibrant colors and creates a fresh, inviting atmosphere."}' \
-H "Authorization: bearer $DEEPINFRA_TOKEN" \
-H 'Content-Type: application/json' \
'https://api.deepinfra.com/v1/inference/Wan-AI/Wan2.1-T2V-1.3B'
which will give you back something similar to:
{
"video_url": "/model/inference/pyramid_sample.mp4",
"seed": "12345",
"request_id": null,
"inference_status": {
"status": "unknown",
"runtime_ms": 0,
"cost": 0.0,
"tokens_generated": 0,
"tokens_input": 0
}
}
guidance_scale
numberControls how closely the generated video follows the text prompt. Higher values (>1.0) produce content more closely aligned with the prompt but may reduce overall quality. A value of 1.0 disables guidance.
Default value: 5
webhook
fileThe webhook to call when inference is done, by default you will get the output in the response of your inference request