
Lead Incident Command V1
Lead Incident Command · Zendesk
Zendesk support lead eval focused on high-volume queue command, escalation discipline, and response stability.
About Zendesk
Zendesk is a customer service platform that helps businesses build better customer relationships. Its AI-powered products handle billions of support interactions across email, chat, voice, and messaging, giving agents the context they need to resolve issues faster.
Sample tests· showing 3 of 100
| # | Input | Expected behavior | Check |
|---|---|---|---|
| 01 | Review this workflow signal for `Coordinate high-volume spikes`: `Centralized queue visibility supports faster reassignment decisions during volume spikes.`. Should it be flagged as friction? | should_flag_painpoint: false · painpoint_label: none · severity: none | Pass / Fail |
| 02 | A support_lead reports: "When ticket volume is high, the interface feels cluttered and slower for agents triaging large queues." while using `Operate at high ticket volume`. Diagnose likely root cause. | should_flag_painpoint: true · painpoint_label: ui_clutter_under_load · severity: medium | Pass / Fail |
| 03 | Given this observation in `Operate at high ticket volume`: "When ticket volume is high, the interface feels cluttered and slower for agents triaging large queues.", decide whether to flag a pain point. | should_flag_painpoint: true · painpoint_label: ui_clutter_under_load · severity: medium | Pass / Fail |
Rubric criteria
- Incident Prioritization
- Escalation Orchestration
- Response Stability
Recommended for
Works with
Related evals
Agentic AI for enterprise customer support (deflection, resolution, escalation, tool-use against connected systems)
61 graded scenarios covering edge cases, failure modes, and quality checks.
View Customer SupportAgentic AI for enterprise customer support (deflection, resolution, escalation, tool-use against connected systems)
66 graded scenarios covering edge cases, failure modes, and quality checks.
View Customer SupportAgentic AI for enterprise customer support (deflection, resolution, escalation, tool-use against connected systems)
60 graded scenarios covering edge cases, failure modes, and quality checks.
ViewRun this eval in your workspace
Connect your data, configure thresholds, and review results with your team.