Training Data

Voice Training Data Built by Researchers, for Researchers

Datasets for creating realistic voices across global languages—powering our own state-of-the-art models, and now available to power yours.

Contact Research Team
50+Languages
48+Emotions
600+Voice Descriptors

Datasets

Covering the Full Spectrum of Voice

Conversational Audio

Turn-taking, interruptions, and multi-speaker dynamics.

Request Samples

Emotional Reproduction

Fine-grained annotations across a wide range of expressive speech.

Request Samples

Multilingual Audio

Native speaker recordings across global languages and dialects.

Request Samples

Voice Realism

Prosody, intonation, pacing, and expressive range.

Request Samples

Domain-Specific

Industry contexts like healthcare, education, and customer service.

Request Samples

Task-Specific

Conversations for assistants, support, tutoring, and research.

Request Samples

How It Works

From Research Question to Production-Ready Data

Hume operates a research-grade data pipeline purpose-built for voice.

1

Request Samples

Start with curated speech datasets from our library.

2

Create Your Own

Launch custom collections with defined speakers and recording conditions.

3

License Access

Datasets include rich metadata—demographics, acoustics, and labels.

4

API Access

Programmatically refresh or generate new training data.

Ready to explore our training data?

Talk to our research team about how Hume's datasets can accelerate your voice AI development.

Stay in the loop

Get the latest on empathic AI research, product updates, and company news.

Join the community

Connect with other developers, share projects, and get help from the team.

Join our Discord