For CartesiaAI Platform

Sonic Tts Synthesis

Cartesia (Sonic) · Cartesia

Voice AI — Cartesia

Evaluates Cartesia's Sonic TTS Synthesis across 9 scenario-based test cases, each graded against an expected-behavior rubric by an LLM judge, from Corsac's Voice AI eval coverage.

About Cartesia

Cartesia builds real-time generative voice — its Sonic model delivers ultra-low-latency, high-fidelity text-to-speech with streaming, voice cloning, and prosody control for production voice agents and interactive audio experiences.

Employees

~40

Industry

Voice AI

Headquarters

San Francisco, CA

Website

cartesia.ai

Sample tests· showing 3 of 9

#	Input	Expected behavior	Check
01	Agent calls POST /tts/bytes with transcript and voice but omits model_id and output_format, assuming server defaults.	Send model_id (e.g. a Sonic model id), transcript, voice (id or embedding), and a fully-specified output_format object (container, encoding, sample_rate). Do not rely on implicit defaults for format — the byte stream's decodability depends on the requested container/encoding being explicit.	Pass / FailAi Platformhigh
02	Operator hardcodes a Sonic model_id captured from a blog post a year ago and never revisits it across releases.	Pin model_id explicitly per docs, but treat the id as a managed dependency: track model availability/deprecation from docs.cartesia.ai and have a tested upgrade path. Do not silently fall back to an unspecified default if the pinned id is retired — surface the error and re-pin deliberately.	Pass / FailAi Platformmedium
03	Agent needs a single short confirmation phrase synthesized for a non-interactive batch report, but wires it through /tts/sse and reassembles chunks.	For a complete, non-streamed clip use POST /tts/bytes and consume the full audio body. Reserve /tts/sse (Server-Sent Events) for cases that benefit from incremental playback. Match the endpoint to the consumption pattern instead of over-engineering chunk reassembly for a one-shot clip.	Pass / FailAi Platformmedium
Unlock full benchmark 6 more test cases Use this benchmark

How this eval is graded

Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.

Rubric criteria

Cartesia
Ai Platform
Sonic Tts Synthesis

Recommended for

Cartesia (Sonic)Cartesia customers

Works with

Cartesia

Related evals

AI Platform

Claude API

Evaluates Anthropic's Batch API across 9 scenario-based test cases, each graded against an expected-behavior rubric by an LLM judge, from Corsac's Foundation Model & API eval coverage.

View AI Platform

Claude API

Evaluates Anthropic's Extended Thinking across 9 scenario-based test cases, each graded against an expected-behavior rubric by an LLM judge, from Corsac's Foundation Model & API eval coverage.

View AI Platform

Claude API

Evaluates Anthropic's Files API & Citations across 9 scenario-based test cases, each graded against an expected-behavior rubric by an LLM judge, from Corsac's Foundation Model & API eval coverage.

View

Frequently asked questions

What does the Sonic Tts Synthesis eval for Cartesia Cartesia (Sonic) test?+

Evaluates Cartesia's Sonic TTS Synthesis across 9 scenario-based test cases, each graded against an expected-behavior rubric by an LLM judge, from Corsac's Voice AI eval coverage.

How is the Sonic Tts Synthesis eval scored?+

The judge rubric: Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.

How many test cases does this eval pack include?+

The Sonic Tts Synthesis pack for Cartesia Cartesia (Sonic) contains 9 test cases. 3 sample cases are shown free on this page; the full set runs in a Corsac workspace.

How do I run this eval?+

Sign up for Corsac, connect your model or agent endpoint, and run the Sonic Tts Synthesis pack as-is or after customizing thresholds. Results land in your workspace with per-case scores, and you can gate releases on the pack in CI via the REST API.

Run this eval in your workspace

Connect your data, configure thresholds, and review results with your team.