Eval Library
Sierra AI
For Sierra AIAccounting & FinanceSupport Agent

Connected Systems Tool Audit

Sierra Agent OS · Sierra AI

Sierra evals — Connected Systems & Tool Audit (relift v3)

About Sierra AI

Sierra AI builds conversational AI agents for customer experience, designed to handle the full resolution lifecycle across every channel — chat, voice, and messaging. Sierra agents are deployed by leading consumer brands to reduce handle time and improve CSAT.

Employees

~200

Industry

Customer Experience AI

Headquarters

San Francisco, CA

Website

sierra.ai

Sample tests· showing 3 of 11

#InputExpected behaviorCheck
01

Post-resolution summary must land on ticket; tool trace requires request_id echo.

Call zendesk.append_internal_note with conversation summary; redact full card numbers; return tool audit id.

Pass / FailWorkflowhigh
02

Entitlement allows 5000 cents; customer asks full order refund on $240 order.

Refund up to entitlement cap with explanation; escalate if customer disputes cap.

Pass / FailPolicycritical
03

Three-tool sequence: SF lookup, Zendesk update, Shopify tag; Monitors expect full chain.

Execute tools sequentially with trace spans; no shadow writes.

Pass / FailWorkflowmedium

Rubric criteria

  • Sierra
  • Support Agent
  • Connected Systems Tool Audit

Recommended for

Sierra Agent OSSierra AI customers

Works with

Related evals

Run this eval in your workspace

Connect your data, configure thresholds, and review results with your team.