The world's most realistic voice AI, in real-time

Prompt the first LLM for text-to-speech to create new voices, instruct emotions, and more

Describe the desired AI voice's identity, voice qualities, and more

To help shape the voice we generate, input something distinctive this AI voice would say

A text-to-speech system that understands what it's saying

Octave (Omni-capable text and voice engine) isn't a traditional TTS model. It’s a voice-based LLM. That means it understands what words mean in context, so it can predict emotions, cadence, and more.

Create any voice you can imagine with Octave Voice Design

Create any AI voice you can imagine, like a "sarcastic medieval peasant," with a brief prompt or evocative script

Sarcastic medieval peasant

Full prompt: The speaker is a medieval peasant with a cockney accent, raspy voice, dripping with sarcasm.

00:00
00:00

Literature professor

Full prompt: A retired Black female literature professor who analyzes poetry with precise academic language and references to her own published criticism.

00:00
00:00

Charming cowboy

Full prompt: The speaker is a grizzled old cowboy with a folksy Texan drawl Southern accent, speaking in a charismatic tone with a deep but relaxed vibe.

00:00
00:00

Sitcom inner monologue

Full prompt: The star of a popular sitcom, with frequent inner monologues about her life.

00:00
00:00

Dungeon master

Full prompt: A know-it-all dungeons and dragons dungeon master speaking excitedly with a lisp.

00:00
00:00

Warm English narrator

Full prompt: The speaker is a sophisticated British female narrator with a gentle, warm voice, recounting the ending of a classic romance novel.

00:00
00:00

Unserious movie trailer guy

Full prompt: The speaker is an American, deep middle-aged male film trailer narrator for a film about chickens.

00:00
00:00

Raspy evil vampire

Full prompt: A villainous undead vampire, with a horrifying raspy voice, and a slight Transylvanian accent.

00:00
00:00

Reminiscing man

Full prompt: A middle-aged African American man, reminiscing with a slightly gravelly voice and a tone of hard-earned wisdom.

00:00
00:00

Nature documentary narrator

Full prompt: The speaker is a distinguished British narrator, whose voice carries a deep sense of wisdom and curiosity.

00:00
00:00

Texan fishing guru

Prompt: The speaker has a booming, charismatic radio voice, like a Texan fishing guru with a hint of gravel and an infectious laugh, perfect for reeling in listeners to 'Big Dicky's live fishing frenzy.'

00:00
00:00

Any emotion or speaking style, on command

Octave is the first TTS system that can take natural language instructions to change emotional delivery and speaking style. Give directions like "sound sarcastic" or "whisper fearfully." For the first time, creators have total control.

For creators and developers alike

Octave was built to generate the most expressive AI voices for any content: podcasts, voiceovers, audiobooks, and more. With our API, you can bring it to any application.

TTS Projects
Empathic Voice Interface (EVI)
The world's most realistic and instructible speech-to-speech model

As a speech-language model, where the same intelligence handles transcription, language, and speech, EVI 3 brings more expressiveness, realism, and emotional understanding to voice AI. 

Text-to-Speech
Octave Text to Speech

Hume's Text-to-Speech model, Octave, is available today for content creators and developers. Octave understands what words mean in context, so it can predict emotions, cadence, and more. It can also take natural language instructions to change emotional delivery and speaking style. Give directions like "sound sarcastic" or "whisper fearfully." For the first time, creators have total control.

TTS Projects
Expression Measurement Models
Emotional intelligence for any application

Measure emotional expression with unmatched precision. One API, four modalities, hundreds of dimensions of emotional expression. 

Trusted By

Design Works Logo
Lge Logo
Woven Logo
Softbank Logo
Humana Logo
Aecho AI Logo
Betteryou Logo
Nestwork Logo
Innovax Systems Logo
Jammy Chat Logo
Aura Health Logo
Wonsulting Logo
Memorang Logo
Flourish Logo
Climb Together Logo
Design Works Logo
Lge Logo
Woven Logo
Softbank Logo
Humana Logo
Aecho AI Logo
Betteryou Logo
Nestwork Logo
Innovax Systems Logo
Jammy Chat Logo
Aura Health Logo
Wonsulting Logo
Memorang Logo
Flourish Logo
Climb Together Logo
Design Works Logo
Lge Logo
Woven Logo
Softbank Logo
Humana Logo
Aecho AI Logo
Betteryou Logo
Nestwork Logo
Innovax Systems Logo
Jammy Chat Logo
Aura Health Logo
Wonsulting Logo
Memorang Logo
Flourish Logo
Climb Together Logo
Sentra Logo
Althea Logo
Study Fetch Logo
Tone AI Logo
Thumos Logo
Ream Logo
New Computer Logo
Everfriends AI Logo
Mynd Logo
Pressmaster AI Logo
Nancy AI Logo
Parrot Prep Logo
Stimuler Logo
Quantanite Logo
University of Zurich
Sentra Logo
Althea Logo
Study Fetch Logo
Tone AI Logo
Thumos Logo
Ream Logo
New Computer Logo
Everfriends AI Logo
Mynd Logo
Pressmaster AI Logo
Nancy AI Logo
Parrot Prep Logo
Stimuler Logo
Quantanite Logo
University of Zurich
Sentra Logo
Althea Logo
Study Fetch Logo
Tone AI Logo
Thumos Logo
Ream Logo
New Computer Logo
Everfriends AI Logo
Mynd Logo
Pressmaster AI Logo
Nancy AI Logo
Parrot Prep Logo
Stimuler Logo
Quantanite Logo
University of Zurich

Developer Resources

Developer Platform

Platform

Create your Hume account, get your API keys, monitor your usage, and explore our products in the interactive platform.

Visit the platform
Documentation New

Documentation

Explore our documentation with concise guides, hands-on tutorials, and an in-depth API reference—crafted to support your integration.

Explore the docs
Developer Community

Community

Join our community of developers and researchers working with Hume APIs—your go-to hub for collaboration, support, and knowledge sharing.

Join our community

00/00