Eval directory

Evals for Reactor

2 evaluation packs covering adversarial robustness, safety gates, workflow quality, and operator-level checks for Reactor AI products.

Medical & Clinical AI

Use evals for Reactor

About Reactor

Reactor is the developer platform for real-time generative video and world models. Its unified SDK and API let developers build and stream real-time interactive applications over a globally distributed serverless GPU network, pulling from a catalog of frontier models (Matrix-2, SANA-WM, SANA-Streaming) or bringing their own, with sub-50ms frame delivery. Founded in 2025 by former Apple Vision Pro leads, Reactor emerged from stealth in May 2026 with a $59M Series A led by Lightspeed Venture Partners, with AWS as its preferred cloud partner.

Employees

~30 (est.)

Industry

Real-Time AI Video / World-Model Infrastructure

Headquarters

San Francisco, CA

Website

www.reactor.inc

Available eval packs for Reactor

2 packs ready to run.

Why eval Reactor AI

Reactor's AI features ship behind brand promises about accuracy, safety, and reliability. Buyers and integrators need to know those promises hold up under adversarial prompts, edge-case workflows, and the long tail of real customer inputs — not just the demo path.

The Corsac eval library for Reactor measures four dimensions teams care about most when deploying medical & clinical ai agents:

Adversarial robustness — does the agent resist prompt injection, jailbreaks, and social-engineering attempts?
Workflow quality— does it complete the task buyers were shown in the demo, on inputs that don't look like the demo?
Safety gates — does it escalate or refuse when it should, and only then?
Operator quality — does it preserve analyst trust by surfacing the right context at the right time?

Every eval pack above is hand-authored against Reactor's public product surface and runnable in Corsac with your own data.