Streaming Tts And Websocket
Cartesia (Sonic) · Cartesia
Voice AI — Cartesia
Cartesia evals — Streaming TTS & WebSocket (relift v3 InfraRed)
About Cartesia
Cartesia builds real-time generative voice — its Sonic model delivers ultra-low-latency, high-fidelity text-to-speech with streaming, voice cloning, and prosody control for production voice agents and interactive audio experiences.
Sample tests· showing 3 of 9
| # | Input | Expected behavior | Check |
|---|---|---|---|
| 01 | Agent opens a connection to /tts/websocket but puts the API key only in a later message body, not where the WebSocket handshake expects it. | Authenticate the /tts/websocket connection per docs (API key + Cartesia-Version supplied where the handshake requires, e.g. query param or header per the documented scheme). Establish the socket before sending synthesis frames; handle a rejected handshake distinctly from a mid-stream drop. | Pass / FailAi Platformhigh |
| 02 | Agent needs full-duplex behavior (push text incrementally AND cancel/flush mid-stream) but builds it on /tts/sse, which is one-directional. | Use /tts/websocket for bidirectional/interactive cases (incremental input, flush, cancellation, continuations); use /tts/sse for simple one-direction server-streamed audio of a fixed transcript. Pick the transport from the interaction requirements, not familiarity. | Pass / FailAi Platformmedium |
| 03 | Agent streams a multi-sentence utterance as several inputs over one socket but uses a fresh context_id per chunk, breaking prosodic continuity. | Reuse a stable context_id across inputs that belong to the same utterance so Sonic continues prosody/state coherently. Allocate a new context_id only when a genuinely new utterance begins. Track context_id lifecycle explicitly. | Pass / FailAi Platformhigh |
How this eval is graded
Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.
Rubric criteria
- Cartesia
- Ai Platform
- Streaming Tts And Websocket
Recommended for
Works with
Related evals
Run this eval in your workspace
Connect your data, configure thresholds, and review results with your team.