Eval directory
Evals for Gong
7 evaluation packs covering adversarial robustness, safety gates, workflow quality, and operator-level checks for Gong AI products.
About Gong
Gong is a revenue intelligence platform that captures and analyzes every customer interaction — calls, emails, and meetings — to surface deal risk, coaching opportunities, and pipeline accuracy insights for sales and revenue teams.
Available eval packs for Gong
7 packs ready to run.
Ai Summaries Ask Anything And Call Content Security
Gong evals — AI Summaries, Ask Anything & Call-content Security (relift v3)
Call Capture And Recording Consent
Gong evals — Call Capture & Recording Consent (relift v3)
Coaching And Enablement Safety
Gong evals — Coaching & Enablement Safety (relift v3)
Crm Write Back And Integration Safety
Gong evals — CRM Write-back & Integration Safety (relift v3)
Deal Intelligence And Risk Warnings
Gong evals — Deal Intelligence & Risk Warnings (relift v3)
Forecast And Pipeline Attribution
Gong evals — Forecast & Pipeline Attribution (relift v3)
Transcription Asr And Speaker Diarization
Transcription AccuracyGong evals — Transcription ASR & Speaker Diarization (relift v3)
Why eval Gong AI
Gong's AI features ship behind brand promises about accuracy, safety, and reliability. Buyers and integrators need to know those promises hold up under adversarial prompts, edge-case workflows, and the long tail of real customer inputs — not just the demo path.
The Corsac eval library for Gong measures four dimensions teams care about most when deploying revenue intelligence agents:
- Adversarial robustness — does the agent resist prompt injection, jailbreaks, and social-engineering attempts?
- Workflow quality— does it complete the task buyers were shown in the demo, on inputs that don't look like the demo?
- Safety gates — does it escalate or refuse when it should, and only then?
- Operator quality — does it preserve analyst trust by surfacing the right context at the right time?
Every eval pack above is hand-authored against Gong's public product surface and runnable in Corsac with your own data.