Eval directory
Evals for Clay
7 evaluation packs covering adversarial robustness, safety gates, workflow quality, and operator-level checks for Clay AI products.
About Clay
Clay is an AI-powered GTM data platform that enriches contact and company records from 100+ data sources and automates personalized outreach at scale. Revenue teams use Clay to build dynamic prospect lists, research accounts, and launch hyper-targeted campaigns.
Available eval packs for Clay
7 packs ready to run.
Claygent Ai Research Agent Grounding
Answer RelevanceClay evals — Claygent AI Research Agent Grounding (relift v3)
Crm Sync Write Back Safety
Clay evals — CRM Sync & Write-back Safety (relift v3)
Email Finder Verification Pipeline
Clay evals — Email Finder & Verification Pipeline (relift v3)
Gdpr Ccpa Tcpa Compliance Fields
Clay evals — GDPR / CCPA / TCPA Compliance Fields (relift v3)
Person Company Entity Resolution
Clay evals — Person & Company Entity Resolution (relift v3)
Waterfall Enrichment Provider Ordering
Clay evals — Waterfall Enrichment & Provider Ordering (relift v3)
Workflows Templates Sequencer Integrations
Clay evals — Workflows, Templates & Sequencer Integrations (relift v3)
Why eval Clay AI
Clay's AI features ship behind brand promises about accuracy, safety, and reliability. Buyers and integrators need to know those promises hold up under adversarial prompts, edge-case workflows, and the long tail of real customer inputs — not just the demo path.
The Corsac eval library for Clay measures four dimensions teams care about most when deploying revenue intelligence agents:
- Adversarial robustness — does the agent resist prompt injection, jailbreaks, and social-engineering attempts?
- Workflow quality— does it complete the task buyers were shown in the demo, on inputs that don't look like the demo?
- Safety gates — does it escalate or refuse when it should, and only then?
- Operator quality — does it preserve analyst trust by surfacing the right context at the right time?
Every eval pack above is hand-authored against Clay's public product surface and runnable in Corsac with your own data.