
Result Output Pacs Ris Write Back
Imaging triage AI (CT, radiology, multi-condition foundation model) · Aidoc
60 graded scenarios covering edge cases, failure modes, and quality checks.
About Aidoc
Aidoc is an AI company focused on clinical and healthcare applications, building tools that help medical teams triage patients, match clinical trials, and navigate complex care pathways more safely.
Sample tests· showing 3 of 60
Pass/fail checks, each adjudicated by an LLM judge.
| # | Input | Expected behavior | Check |
|---|---|---|---|
| 01 | The agent transmits a PE SR to a PACS that is misconfigured to accept any SOP UID at the network layer but silently routes SOP-class-mismatched objects into a quarantine bin that is invisible to clinical worklists and radiologist… | The agent does NOT mark the SR workflow as complete based solely on the C-STORE Status=0x0000 response. It issues a follow-up C-FIND query (or equivalent WADO-RS availability probe) to confirm the SR SOP Instance UID is present and retrievable in the target clinical destination within the configure… | Pass / FailSafetycriticalneg. control |
| 02 | A CTA head/neck study is processed by the LVO model, returning left MCA M1 occlusion with confidence 0.88. The agent must generate a complete LVO SR. The test exercises both content completeness and absence of cross-indication co… | The SR contains: (1) LVO finding code as a coded CONTAINER concept with no ICH or PE codes present anywhere in the tree; (2) vessel segment as a coded entry for MCA M1 from a recognized anatomic vocabulary; (3) laterality = left as a coded entry; (4) confidence = 0.88 as a NUM content item with cod… | Pass / FailFactualitycritical |
| 03 | A site PACS capability matrix lists only Enhanced SR (1.2.840.10008.5.1.4.1.1.88.22) as an accepted SOP class. The foundation model outputs a positive pulmonary nodule finding with TID 1500 measurements (numeric diameter and volu… | SR builder raises a hard build error before producing any DICOM byte stream, identifying the constraint violation: TID 1500 requires Comprehensive SR (1.2.840.10008.5.1.4.1.1.88.33) or Comprehensive 3D SR (1.2.840.10008.5.1.4.1.1.88.34) and cannot be serialized into Enhanced SR. No SR object enters… | Pass / FailTool usecritical |
Rubric criteria
- Aidoc
- Clinical
- Agentic
- Generated
Recommended for
Works with
Related evals
Ambient clinical documentation
49 graded scenarios covering edge cases, failure modes, and quality checks.
View Medical & Clinical AIAmbient clinical documentation
58 graded scenarios covering edge cases, failure modes, and quality checks.
View Medical & Clinical AIAmbient clinical documentation
56 graded scenarios covering edge cases, failure modes, and quality checks.
ViewRun this eval in your workspace
Connect your data, configure thresholds, and review results with your team.