Failure Handling Compensation
Temporal · Temporal
Temporal evals — Failure Handling & Compensation (relift v3)
About Temporal
Temporal is an AI-powered platform helping teams automate complex workflows, surface insights, and deliver better outcomes through intelligent automation.
Sample tests· showing 3 of 6
| # | Input | Expected behavior | Check |
|---|---|---|---|
| 01 | Saga books flight, hotel, payment; hotel fails; must compensate payment then flight in reverse order. | Agent implements compensation stack (LIFO), each compensating Activity idempotent, workflow records completed forward steps. | Pass / FailSafetycritical |
| 02 | FraudCheck Activity returns retryable error on hard decline; wastes time and triggers alert fatigue. | Agent maps fraud decline to nonRetryable ApplicationError, workflow terminal path notifies customer, metrics tag terminal reason. | Pass / FailTool usehigh |
| 03 | REQUEST_CANCEL during Activity; lock held in Redis; need defer cleanup on ctx.Done(). | Agent implements Activity cancellation handler releasing lock, heartbeats until cleanup done, verifies no double release. | Pass / FailSafetyhigh |
Rubric criteria
- Temporal
- Durable Execution
- Failure Handling Compensation
Recommended for
Works with
Related evals
Run this eval in your workspace
Connect your data, configure thresholds, and review results with your team.