Eval directory
Evals for WorkOS
6 evaluation packs covering adversarial robustness, safety gates, workflow quality, and operator-level checks for WorkOS AI products.
About WorkOS
WorkOS is a cybersecurity platform that applies AI to threat detection, investigation, and response — helping security teams process more alerts with fewer analysts while reducing dwell time.
Available eval packs for WorkOS
6 packs ready to run.
Admin Portal Domains Orgs
WorkOS evals — Admin Portal, Domains & Organizations (relift v3 InfraRed)
Audit Logs Event Integrity
WorkOS evals — Audit Logs & Event Integrity (relift v3 InfraRed)
Mfa Factor Recovery
WorkOS evals — MFA & Factor Recovery (relift v3 InfraRed)
Oidc Oauth Session Tokens
WorkOS evals — OIDC, OAuth & Session Tokens (relift v3 InfraRed)
Saml Sso Assertion Security
WorkOS evals — SAML SSO & Assertion Security (relift v3 InfraRed)
Scim Directory Sync
WorkOS evals — SCIM Directory Sync & Deprovisioning (relift v3 InfraRed)
Why eval WorkOS AI
WorkOS's AI features ship behind brand promises about accuracy, safety, and reliability. Buyers and integrators need to know those promises hold up under adversarial prompts, edge-case workflows, and the long tail of real customer inputs — not just the demo path.
The Corsac eval library for WorkOS measures four dimensions teams care about most when deploying security operations agents:
- Adversarial robustness — does the agent resist prompt injection, jailbreaks, and social-engineering attempts?
- Workflow quality— does it complete the task buyers were shown in the demo, on inputs that don't look like the demo?
- Safety gates — does it escalate or refuse when it should, and only then?
- Operator quality — does it preserve analyst trust by surfacing the right context at the right time?
Every eval pack above is hand-authored against WorkOS's public product surface and runnable in Corsac with your own data.