
Legal Research Q A Contextual Research With Jurisdictional Coverage
AI legal assistant — research, drafting, contract review, and deep research with linked citations; publishes its own accuracy / citation benchmarks · Paxton AI
59 graded scenarios covering edge cases, failure modes, and quality checks.
About Paxton AI
Paxton AI is an AI platform serving legal professionals, helping law firms and legal departments automate research, drafting, and review workflows with greater accuracy and speed than manual processes.
Sample tests· showing 3 of 59
Pass/fail and graded 1–5 by an LLM judge (passes at ≥ 4), depending on the test.
| # | Input | Expected behavior | Check |
|---|---|---|---|
| 01 | An attorney's AI agent receives a drafting task containing a legal research subtask. No jurisdiction has been set in session context. The query uses 'our state' without any prior conversation establishing which state is relevant.… | The system refuses to answer the substantive question and emits a clarification prompt asking the user to specify which state's law applies. It does not silently default to federal law, a statistically common state, or any last-accessed jurisdiction. The response makes clear that jurisdiction is a … | Pass / FailPolicycriticalneg. control |
| 02 | An employment attorney's agent receives a task to generate a client alert on notice obligations for a planned reduction in force. The attorney's client employs approximately 80 full-time workers in California and plans to elimina… | The response clearly distinguishes and separately labels two regimes: (1) California's state WARN Act (Cal. Labor Code §§ 1400–1408 or the current codification) — stating the applicable employer-size threshold, covered-employee count threshold, notice period, and recipient list under state law; and… | Pass / FailFactualitycritical |
| 03 | A compliance officer's agent queries Paxton AI about a state statute that was substantively amended within the past 90 days. The Paxton AI knowledge base contains the prior version of the statute. The response text reflects the p… | The system either (a) returns the current rule because its KB is up-to-date and the hyperlink also resolves to that same current text, OR (b) if KB lag exists, prominently discloses a knowledge-cutoff or freshness warning in the response body — not buried in a footer or metadata field — advises the… | Pass / FailGroundingcritical |
Rubric criteria
- Paxton Ai
- Legal
- Agentic
- Generated
Recommended for
Works with
Related evals
Professional-grade AI legal assistant — research, document review, drafting, deposition prep, and agentic skills grounded in Westlaw / Practical Law authoritative content (formerly Casetext CoCounsel)
6 graded scenarios covering edge cases, failure modes, and quality checks.
View Legal AIProfessional-grade AI legal assistant — research, document review, drafting, deposition prep, and agentic skills grounded in Westlaw / Practical Law authoritative content (formerly Casetext CoCounsel)
71 graded scenarios covering edge cases, failure modes, and quality checks.
View Legal AIProfessional-grade AI legal assistant — research, document review, drafting, deposition prep, and agentic skills grounded in Westlaw / Practical Law authoritative content (formerly Casetext CoCounsel)
72 graded scenarios covering edge cases, failure modes, and quality checks.
ViewRun this eval in your workspace
Connect your data, configure thresholds, and review results with your team.