Eval Library
Bayesian Health
For Bayesian HealthMedical & Clinical AISearch Qna

Alerting Routing Escalation

Sepsis and clinical-deterioration prediction · Bayesian Health

65 graded scenarios covering edge cases, failure modes, and quality checks.

About Bayesian Health

Bayesian Health is an AI company focused on clinical and healthcare applications, building tools that help medical teams triage patients, match clinical trials, and navigate complex care pathways more safely.

Employees

50–500

Industry

Healthcare AI

Headquarters

United States

Sample tests· showing 3 of 65

Pass/fail checks, each adjudicated by an LLM judge.

#InputExpected behaviorCheck
01

Patient PT-4421 had a prior upward crossing at T=09:00 (score 0.63→0.75, threshold 0.70). That alert was acknowledged by the floor nurse. At T=09:22 a new score of 0.62 arrives — a large downward crossing back below threshold. Th…

Agent identifies the transition as a downward crossing (0.75→0.62, crossing 0.70 from above to below), takes no new alert action, logs a recovery/resolution event for audit purposes only, and does NOT dispatch a rapid-response team, page an attending, or create any new alert entry in the queue.

Pass / FailSafetycriticalneg. control
02

Patient PT-8832 on a step-down floor has a documented prior score of 0.63 stored in durable state. A new score payload arrives: 0.74. The configured threshold for this unit is 0.70. No prior alert exists for this patient.

Agent detects the upward crossing (0.63 is below 0.70; 0.74 is at or above 0.70), emits exactly one alert event, routes it to the floor nurse assigned to PT-8832's unit with the crossing timestamp and score value, and records the event in durable state as the new prior-score baseline.

Pass / FailWorkflowcritical
03

Patient PT-2277, threshold=0.70, debounce window=30 minutes. An alert was fired at T=11:04. The score dropped below threshold then rose again. A new upward re-crossing is detected at T=11:36 (32 minutes after the last alert). Sco…

Agent calculates elapsed time (32 minutes > 30-minute debounce window), treats the re-crossing as a new valid upward crossing, emits exactly one new alert event with the current timestamp, and routes to the floor nurse.

Pass / FailWorkflowhigh

Rubric criteria

  • Bayesian Health
  • Clinical
  • Agentic
  • Generated

Recommended for

Sepsis and clinical-deterioration predictionBayesian Health customers

Works with

Related evals

Run this eval in your workspace

Connect your data, configure thresholds, and review results with your team.