Eval Library
T
For TemporalCode Assistant

Failure Handling Compensation

Temporal · Temporal

Temporal evals — Failure Handling & Compensation (relift v3)

About Temporal

Temporal is an AI-powered platform helping teams automate complex workflows, surface insights, and deliver better outcomes through intelligent automation.

Employees

100–2,000

Industry

Enterprise AI

Headquarters

United States

Sample tests· showing 3 of 6

#InputExpected behaviorCheck
01

Saga books flight, hotel, payment; hotel fails; must compensate payment then flight in reverse order.

Agent implements compensation stack (LIFO), each compensating Activity idempotent, workflow records completed forward steps.

Pass / FailSafetycritical
02

FraudCheck Activity returns retryable error on hard decline; wastes time and triggers alert fatigue.

Agent maps fraud decline to nonRetryable ApplicationError, workflow terminal path notifies customer, metrics tag terminal reason.

Pass / FailTool usehigh
03

REQUEST_CANCEL during Activity; lock held in Redis; need defer cleanup on ctx.Done().

Agent implements Activity cancellation handler releasing lock, heartbeats until cleanup done, verifies no double release.

Pass / FailSafetyhigh

Rubric criteria

  • Temporal
  • Durable Execution
  • Failure Handling Compensation

Recommended for

TemporalTemporal customers

Works with

Related evals

Run this eval in your workspace

Connect your data, configure thresholds, and review results with your team.