🚀 New models by Bria.ai, generate and edit images at scale 🚀
openai/
$0.00045
/ minute
Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.
Please upload an audio file
You need to login to use this model
LoginSettings
Task
task to perform
Initial Prompt
optional text to provide as a prompt for the first window.. (Default: empty)
Temperature
temperature to use for sampling (Default: 0)
Language
language that the audio is in; uses detected language if None; use two letter language code (ISO 639-1) (e.g. en, de, ja)
Chunk Level
chunk level, either 'segment' or 'word'
Chunk Length S
chunk length in seconds to split audio (Default: 30, 1 ≤ chunk_length_s ≤ 30)
© 2025 Deep Infra. All rights reserved.