Batch Stt Rest
Deepgram Speech AI Platform · Deepgram
Speech AI Platform — Deepgram (Nova STT, Aura TTS, Voice Agent)
Deepgram evals — Batch STT (REST) (relift v3 InfraRed)
About Deepgram
Deepgram is a speech-AI platform offering streaming and batch speech-to-text (Nova), Aura text-to-speech, speaker diarization, redaction, and smart formatting across 30+ languages — used by voice-agent platforms, contact centers, and media teams.
Sample tests· showing 3 of 9
| # | Input | Expected behavior | Check |
|---|---|---|---|
| 01 | Operator submits POST https://api.deepgram.com/v1/listen?model=nova-3 with Content-Type: application/json and body {"url":"https://media.example.com/call.wav"}. | Use JSON body with {"url":...} for URL ingestion. Deepgram fetches the media server-side; the URL must be publicly fetchable (or signed). Verify the media server returns audio/* Content-Type and a reasonable Content-Length. Treat non-2xx fetch errors as a Deepgram 400 with details in body. | Pass / FailAi Platformhigh |
| 02 | Operator transcribes a 45-minute clinical interview and picks model=nova-3-medical. | Pick model=nova-3-medical for medical-domain audio (clinical vocabulary boost); pin the version. For general business audio use nova-3-general. Do not switch model per retry; benchmark on a held-out fixture before changing the default model in production. | Pass / FailAi Platformmedium |
| 03 | Operator submits batch with callback=https://ops.example.com/dg-hook to avoid long-polling. | Request returns immediately with a request_id; transcript is POSTed to the callback URL when ready. Implement the receiver as idempotent on request_id — Deepgram retries the callback on non-2xx responses [REQUIRES-VERIFICATION on retry count and backoff]. Sign or scope the receiver to defend agains… | Pass / FailAi Platformcritical |
How this eval is graded
Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.
Rubric criteria
- Deepgram
- Ai Platform
- Batch Stt Rest
Recommended for
Works with
Related evals
Run this eval in your workspace
Connect your data, configure thresholds, and review results with your team.