For Retell AIAI Platform

Retell Webhooks And Post Call

Retell AI · Retell AI

Voice AI Agents — Retell AI

Evaluates Retell AI's Webhooks & Post-call across 9 scenario-based test cases, each graded against an expected-behavior rubric by an LLM judge, from Corsac's Voice AI Agents eval coverage.

About Retell AI

Retell AI is a platform for building production phone-call voice agents — pairing a conversation engine with telephony, low-latency turn-taking, interruption handling, mid-call functions, post-call analysis, and batch outbound dialing.

Employees

~40

Industry

Voice AI Agents

Headquarters

San Francisco, CA

Website

retellai.com

Sample tests· showing 3 of 9

#	Input	Expected behavior	Check
01	The operator's webhook endpoint processes call_ended payloads without checking the x-retell-signature header, so anyone can POST forged events.	Verify the x-retell-signature header on every inbound webhook (Retell provides a verify helper in its SDKs) before trusting the payload; reject unsigned/invalid requests. Treat the endpoint as public and the signature as the trust boundary.	Pass / FailAi Platformcritical
02	The webhook handler reads the call object but ignores the top-level event field, treating call_started and call_ended the same way.	Branch on the top-level event ('call_started' \| 'call_ended' \| 'call_analyzed' \| 'transcript_updated') and only read fields valid for that event — e.g., post-call analysis is present on call_analyzed, not call_started. Do not assume every payload carries every field.	Pass / FailAi Platformhigh
03	The integrator expects all data on call_ended and finalizes the record there, but post-call analysis only arrives later on call_analyzed.	Expect call_analyzed after call_ended (analysis is computed post-call) and update the record when it arrives — do not finalize/close before call_analyzed if you need sentiment/disposition/summary. Key both events on call_id.	Pass / FailAi Platformhigh
Unlock full benchmark 6 more test cases Use this benchmark

How this eval is graded

Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.

Rubric criteria

Retell
Ai Platform
Webhooks And Post Call

Recommended for

Retell AIRetell AI customers

Works with

Retell AI

Related evals

AI Platform

Claude API

Evaluates Anthropic's Batch API across 9 scenario-based test cases, each graded against an expected-behavior rubric by an LLM judge, from Corsac's Foundation Model & API eval coverage.

View AI Platform

Claude API

Evaluates Anthropic's Extended Thinking across 9 scenario-based test cases, each graded against an expected-behavior rubric by an LLM judge, from Corsac's Foundation Model & API eval coverage.

View AI Platform

Claude API

Evaluates Anthropic's Files API & Citations across 9 scenario-based test cases, each graded against an expected-behavior rubric by an LLM judge, from Corsac's Foundation Model & API eval coverage.

View

Frequently asked questions

What does the Retell Webhooks And Post Call eval for Retell AI Retell AI test?+

Evaluates Retell AI's Webhooks & Post-call across 9 scenario-based test cases, each graded against an expected-behavior rubric by an LLM judge, from Corsac's Voice AI Agents eval coverage.

How is the Retell Webhooks And Post Call eval scored?+

The judge rubric: Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.

How many test cases does this eval pack include?+

The Retell Webhooks And Post Call pack for Retell AI Retell AI contains 9 test cases. 3 sample cases are shown free on this page; the full set runs in a Corsac workspace.

How do I run this eval?+

Sign up for Corsac, connect your model or agent endpoint, and run the Retell Webhooks And Post Call pack as-is or after customizing thresholds. Results land in your workspace with per-case scores, and you can gate releases on the pack in CI via the REST API.

Run this eval in your workspace

Connect your data, configure thresholds, and review results with your team.