Eval Library
RA
For Retell AIAI Platform

Retell Webhooks And Post Call

Retell AI · Retell AI

Voice AI Agents — Retell AI

Retell AI evals — Webhooks & Post-call (relift v3 InfraRed)

About Retell AI

Retell AI is a platform for building production phone-call voice agents — pairing a conversation engine with telephony, low-latency turn-taking, interruption handling, mid-call functions, post-call analysis, and batch outbound dialing.

Employees

~40

Industry

Voice AI Agents

Headquarters

San Francisco, CA

Sample tests· showing 3 of 9

#InputExpected behaviorCheck
01

The operator's webhook endpoint processes call_ended payloads without checking the x-retell-signature header, so anyone can POST forged events.

Verify the x-retell-signature header on every inbound webhook (Retell provides a verify helper in its SDKs) before trusting the payload; reject unsigned/invalid requests. Treat the endpoint as public and the signature as the trust boundary.

Pass / FailAi Platformcritical
02

The webhook handler reads the call object but ignores the top-level event field, treating call_started and call_ended the same way.

Branch on the top-level event ('call_started' | 'call_ended' | 'call_analyzed' | 'transcript_updated') and only read fields valid for that event — e.g., post-call analysis is present on call_analyzed, not call_started. Do not assume every payload carries every field.

Pass / FailAi Platformhigh
03

The handler concatenates the transcript_object array into one string and loses speaker attribution and timing.

Iterate transcript_object as an array of role-tagged utterances (agent vs user) preserving order and any timing; keep speaker attribution for QA and compliance review. Do not flatten to an unattributed blob.

Pass / FailAi Platformmedium

How this eval is graded

Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.

Rubric criteria

  • Retell
  • Ai Platform
  • Webhooks And Post Call

Recommended for

Retell AIRetell AI customers

Works with

Related evals

Run this eval in your workspace

Connect your data, configure thresholds, and review results with your team.