DeepInfra raises $107M Series B to scale the inference cloud — read the announcement
You can use cURL or any other http client to run inferences:
curl -X POST \
-d '{"prompt": "A woman in her twenties dressed as a friendly party clown \u2014 light, natural face makeup, a round red nose, bright red curly clown wig, and a colorful costume \u2014 performs a magic trick on the green lawn of a cozy backyard birthday party: with a graceful flourish of her hands, a bright bouquet of colorful flowers suddenly appears, and a small group of excited kids gasp, clap, and laugh. Natural human face with realistic skin texture and a warm genuine smile. Behind them: a wooden fence, a picnic table with a frosted birthday cake, and colorful decorations strung between two trees. Bright natural afternoon sunlight, candid documentary style, photorealistic, shot on a DSLR, vivid natural colors, lively gentle motion, sharp focus."}' \
-H "Authorization: bearer $DEEPINFRA_TOKEN" \
-H 'Content-Type: application/json' \
'https://api.deepinfra.com/v1/inference/FastVideo/LTX-2.3-Distilled-Diffusers'
which will give you back something similar to:
{
"video_url": "/model/inference/pyramid_sample.mp4",
"seed": "12345",
"request_id": null,
"inference_status": {
"status": "unknown",
"runtime_ms": 0,
"cost": 0.0,
"tokens_generated": 0,
"tokens_input": 0,
"output_length": 0
}
}
© 2026 DeepInfra. All rights reserved.