Orpheus TTS is a state-of-the-art, Llama-based Speech-LLM designed for high-quality, empathetic text-to-speech generation. This model has been finetuned to deliver human-level speech synthesis, achieving exceptional clarity, expressiveness, and real-time streaming performances.
Orpheus TTS is a state-of-the-art, Llama-based Speech-LLM designed for high-quality, empathetic text-to-speech generation. This model has been finetuned to deliver human-level speech synthesis, achieving exceptional clarity, expressiveness, and real-time streaming performances.
Text to convert to speech
Select the desired voice for the speech output. 7
Select the desired format for the speech output. Supported formats include mp3, opus, flac, wav, and pcm. 5
Temperature
Temperature of the generation (Default: 0.4, 0 ≤ temperature ≤ 2)
Top P
Top p value for the generation (Default: 0.9, 0 ≤ top_p ≤ 1)
Maximum number of tokens for the generation (Default: 2000, 0 < max_tokens ≤ 4096)
Repetition penalty for the generation (Default: 1.1, 0 ≤ repetition_penalty)
Whether to stream audio bytes in chunks 2
Waiting for audio data... Submit request to start streaming.