Eval Library
HA
For Hume AIAI Platform

Hume Expression Measurement Api

Hume AI (EVI / Octave) · Hume AI

Empathic Voice AI — Hume AI

Hume AI evals — Expression Measurement API (relift v3 InfraRed)

About Hume AI

Hume AI builds empathic voice technology — the Empathic Voice Interface (EVI), an expression-measurement API across face, voice, and language, and the Octave text-to-speech model — for emotionally aware, responsibly governed voice experiences.

Employees

~60

Industry

Voice AI

Headquarters

New York, NY

Website

hume.ai

Sample tests· showing 3 of 9

#InputExpected behaviorCheck
01

Operator submits a batch job over a set of audio files and wants both vocal-prosody and spoken-language expression signals.

In the POST /v0/batch/jobs models config, enable the modalities that match the media and goal (prosody for vocal expression, language for transcript-derived expression); only enable face for video/image inputs. Each model returns its own per-frame/per-utterance expression scores.

Pass / FailAi Platformhigh
02

Audio contains laughs, sighs, and gasps the operator wants characterized separately from spoken prosody.

Enable the vocal-burst model for non-linguistic vocalizations; keep its outputs distinct from the prosody model's speech-based scores. Do not merge burst and prosody outputs into one undifferentiated score set.

Pass / FailAi Platformmedium
03

A prediction frame shows the 'anger' expression score at a high value for a customer-support call. A downstream system wants to flag the customer as 'angry'.

Treat the score as the intensity of a perceived expression, not a verified internal emotional state. Surface it as a probabilistic signal with appropriate hedging ('voice shows expressions associated with anger'), never as a definitive classification of how the person actually feels.

Pass / FailAi Platformcritical

How this eval is graded

Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3. Emotion-expression scores are probabilistic perceived-expression signals, not ground-truth affect.

Rubric criteria

  • Hume
  • Ai Platform
  • Expression Measurement Api

Recommended for

Hume AI (EVI / Octave)Hume AI customers

Works with

Related evals

Run this eval in your workspace

Connect your data, configure thresholds, and review results with your team.