We use essential cookies to make our site work. With your consent, we may also use non-essential cookies to improve user experience and analyze website traffic…

DeepInfra raises $107M Series B to scale the inference cloud — read the announcement

openai logo

openai/

whisper-large-v3

$0.00045

/ minute

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.

openai/whisper-large-v3 cover image

Input

Please upload an audio file

You need to log in to use this model

Log In

Settings

Task

task to perform

Initial Prompt

optional text to provide as a prompt for the first window.. (Default: empty)

Temperature

temperature to use for sampling (Default: 0)

Language

language that the audio is in; uses detected language if None; use two letter language code (ISO 639-1) (e.g. en, de, ja)

Chunk Level

chunk level, either 'segment' or 'word'

Chunk Length S

chunk length in seconds to split audio (Default: 30, 1 ≤ chunk_length_s ≤ 30)

Output