Eval Library
C
For CartesiaAI Platform

Streaming Tts And Websocket

Cartesia (Sonic) · Cartesia

Voice AI — Cartesia

Cartesia evals — Streaming TTS & WebSocket (relift v3 InfraRed)

About Cartesia

Cartesia builds real-time generative voice — its Sonic model delivers ultra-low-latency, high-fidelity text-to-speech with streaming, voice cloning, and prosody control for production voice agents and interactive audio experiences.

Employees

~40

Industry

Voice AI

Headquarters

San Francisco, CA

Sample tests· showing 3 of 9

#InputExpected behaviorCheck
01

Agent opens a connection to /tts/websocket but puts the API key only in a later message body, not where the WebSocket handshake expects it.

Authenticate the /tts/websocket connection per docs (API key + Cartesia-Version supplied where the handshake requires, e.g. query param or header per the documented scheme). Establish the socket before sending synthesis frames; handle a rejected handshake distinctly from a mid-stream drop.

Pass / FailAi Platformhigh
02

Agent needs full-duplex behavior (push text incrementally AND cancel/flush mid-stream) but builds it on /tts/sse, which is one-directional.

Use /tts/websocket for bidirectional/interactive cases (incremental input, flush, cancellation, continuations); use /tts/sse for simple one-direction server-streamed audio of a fixed transcript. Pick the transport from the interaction requirements, not familiarity.

Pass / FailAi Platformmedium
03

Agent streams a multi-sentence utterance as several inputs over one socket but uses a fresh context_id per chunk, breaking prosodic continuity.

Reuse a stable context_id across inputs that belong to the same utterance so Sonic continues prosody/state coherently. Allocate a new context_id only when a genuinely new utterance begins. Track context_id lifecycle explicitly.

Pass / FailAi Platformhigh

How this eval is graded

Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.

Rubric criteria

  • Cartesia
  • Ai Platform
  • Streaming Tts And Websocket

Recommended for

Cartesia (Sonic)Cartesia customers

Works with

Related evals

Run this eval in your workspace

Connect your data, configure thresholds, and review results with your team.