Workflow Determinism Replay
Temporal · Temporal
Temporal evals — Workflow Determinism & Replay (relift v3)
About Temporal
Temporal is an AI-powered platform helping teams automate complex workflows, surface insights, and deliver better outcomes through intelligent automation.
Sample tests· showing 3 of 6
| # | Input | Expected behavior | Check |
|---|---|---|---|
| 01 | TypeScript OrderWorkflow calls `Date.now()` to pick shipping tier; Worker logs NondeterminismError on replay after pod restart. Event History already recorded a timer branch. Docs require workflow.Now() or SideEffect. | Agent removes raw Date.now(), uses workflow.Now() (or SideEffect for one-time external clock read), adds replay test in CI, and redeploys Worker with same Task Queue. | Pass / FailTool usecritical |
| 02 | Go BillingWorkflow reads LaunchDarkly at start using SideEffect wrapper; flag must not re-fetch on replay changing branch. History shows Activity scheduled later — flag choice must be pinned. | Agent wraps external read in workflow.SideEffect (Go) or equivalent, stores result in workflow variable, documents that Activities are for mutating side effects not one-off reads. | Pass / FailTool usehigh |
| 03 | Java SLAWorkflow compares Instant.now() in workflow method with Activity-returned deadline; mismatch after DST jump. Agent should use Workflow.currentTimeMillis() for workflow-side comparisons. | Agent uses Workflow.currentTimeMillis() / workflow.Now() on workflow side, keeps wall clock only inside Activities, documents timer vs activity time boundaries. | Pass / FailTool usemedium |
Rubric criteria
- Temporal
- Durable Execution
- Workflow Determinism Replay
Recommended for
Works with
Related evals
Run this eval in your workspace
Connect your data, configure thresholds, and review results with your team.