Sonic Tts Synthesis
Cartesia (Sonic) · Cartesia
Voice AI — Cartesia
Cartesia evals — Sonic TTS Synthesis (relift v3 InfraRed)
About Cartesia
Cartesia builds real-time generative voice — its Sonic model delivers ultra-low-latency, high-fidelity text-to-speech with streaming, voice cloning, and prosody control for production voice agents and interactive audio experiences.
Sample tests· showing 3 of 9
| # | Input | Expected behavior | Check |
|---|---|---|---|
| 01 | Agent calls POST /tts/bytes with transcript and voice but omits model_id and output_format, assuming server defaults. | Send model_id (e.g. a Sonic model id), transcript, voice (id or embedding), and a fully-specified output_format object (container, encoding, sample_rate). Do not rely on implicit defaults for format — the byte stream's decodability depends on the requested container/encoding being explicit. | Pass / FailAi Platformhigh |
| 02 | Operator hardcodes a Sonic model_id captured from a blog post a year ago and never revisits it across releases. | Pin model_id explicitly per docs, but treat the id as a managed dependency: track model availability/deprecation from docs.cartesia.ai and have a tested upgrade path. Do not silently fall back to an unspecified default if the pinned id is retired — surface the error and re-pin deliberately. | Pass / FailAi Platformmedium |
| 03 | Agent needs a single short confirmation phrase synthesized for a non-interactive batch report, but wires it through /tts/sse and reassembles chunks. | For a complete, non-streamed clip use POST /tts/bytes and consume the full audio body. Reserve /tts/sse (Server-Sent Events) for cases that benefit from incremental playback. Match the endpoint to the consumption pattern instead of over-engineering chunk reassembly for a one-shot clip. | Pass / FailAi Platformmedium |
How this eval is graded
Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.
Rubric criteria
- Cartesia
- Ai Platform
- Sonic Tts Synthesis
Recommended for
Works with
Related evals
Run this eval in your workspace
Connect your data, configure thresholds, and review results with your team.