Pricing
Pricing that works at any scale
Choose an affordable plan that’s packed with the best features for engaging your audience, creating customer loyalty, and driving sales.
Plan | Free $0 / month | Starter $3 / month | Creator $14 / month | Pro $70 / month | Scale $200 / month | Business $500 / month | Enterprise Custom |
---|---|---|---|---|---|---|---|
Text-to-speech | |||||||
Monthly included characters | 10,000(~10 minutes) | 30,000(~30 minutes) | 140,000(~140 minutes) | 1,000,000(~1,000 minutes) | 3,300,000(~3,300 minutes) | 10,000,000(~10,000 minutes) | As much as you need |
Additional characters cost (Usage-based) | $0.15/1,000 | $0.12/1,000 | $0.10/1,000 | $0.05/1,000 | Custom | ||
RPM (requests per minute) | 15 | 15 | 75 | 75 | 150 | 225 | Custom |
Projects | 20 | 1,000 | 3,000 | 10,000 | 20,000 | As much as you need | |
Commercial license | |||||||
Speech-to-speech (EVI 3) | |||||||
Monthly EVI 3 usage included | 5 minutes | 40 minutes($0.07/minute) | 200 minutes($0.07/minute) | 1,200 minutes($0.06/minute) | 5,000 minutes($0.05/minute) | 12,500 minutes($0.04/minute) | As much as you need |
Additional EVI 3 cost (Usage-based) | $0.06/minute | $0.05/minute | $0.04/minute | Custom | |||
External LLMs | |||||||
Concurrent connections | 1 | 5 | 5 | 10 | 20 | 30 | As much as you need |
Voices | |||||||
Voice cloning | Create only | Create only | Unlimited (create and use) | Unlimited (create and use) | Unlimited (create and use) | Unlimited (create and use) | Unlimited (create and use) |
Expression Measurement pricing
Expression Measurement API
Pay as you go
Video with audio
Facial expression, Speech prosody, Vocal burst, Emotional language, Facemesh, Transcription
$0.0276 / min
Audio only
Speech prosody, Vocal burst, Emotional language, Transcription
$0.0213 / min
Video only
Facial expression, Facemesh
$0.015 / min
Images
Facial expression, Facemesh
$0.00068 / image
Text only
Emotional language
$0.00008 / word
Enterprise
Video with audio
Facial expression, Speech prosody, Vocal burst, Emotional language, Facemesh, Transcription
Volume discounts
Audio only
Speech prosody, Vocal burst, Emotional language, Transcription
Volume discounts
Video only
Facial expression, Facemesh
Volume discounts
Images
Facial expression, Facemesh
Volume discounts
Text only
Emotional language
Volume discounts