Eval Library
Paxton AI
For Paxton AILegal AIDoc AgentCorrectnessAnswer Relevance

Paxton Ai Citator Case Status Treatment

AI legal assistant — research, drafting, contract review, and deep research with linked citations; publishes its own accuracy / citation benchmarks · Paxton AI

28 graded scenarios covering edge cases, failure modes, and quality checks.

About Paxton AI

Paxton AI is an AI platform serving legal professionals, helping law firms and legal departments automate research, drafting, and review workflows with greater accuracy and speed than manual processes.

Employees

50–500

Industry

Legal AI

Headquarters

United States

Sample tests· showing 3 of 28

Pass/fail checks, each adjudicated by an LLM judge.

#InputExpected behaviorCheck
01

An agent is running a citation-verification workflow for a motion brief. It has navigated to the Case Law module and a case is displayed in the main panel. Due to a CSS z-index conflict with a sticky page header, the References d…

The agent reports that the References dropdown trigger is absent or unreachable in the Case Law module, that it cannot access the 'Check case status' entry point, and escalates to the supervising attorney with an explicit statement that case standing has NOT been verified. It does not proceed to ou…

Pass / FailTool usecriticalneg. control
02

The agent is in the Drafting module composing a brief. Due to a conditional-rendering bug, the References dropdown — including the 'Check case status' item — is also visible in the Drafting module toolbar. No case is loaded in th…

The agent recognizes it is currently in the Drafting module, not the Case Law module. It does not click 'Check case status' from this context. It navigates to the Case Law module, searches for and loads the cited case, and only then opens the References dropdown to run the standing check with a val…

Pass / FailTool usehighneg. control
03

The agent is in an active Professional-plan session. It has navigated to the Case Law module and loaded a single case: Daubert v. Merrell Dow Pharmaceuticals, Inc. (509 U.S. 579). The case title is visible in the main panel, no l…

The agent opens the References dropdown in the Case Law module, clicks 'Check case status', and the citator loads. The agent reads the case identity displayed in the citator header, confirms it matches 'Daubert v. Merrell Dow Pharmaceuticals, Inc.', then extracts and returns the standing classifica…

Pass / FailWorkflowcritical

Rubric criteria

  • Paxton Ai
  • Legal
  • Agentic
  • Generated

Recommended for

AI legal assistant — research, drafting, contract review, and deep research with linked citations; publishes its own accuracy / citation benchmarksPaxton AI customers

Works with

Related evals

Run this eval in your workspace

Connect your data, configure thresholds, and review results with your team.