Eval Library
Manifest OS
For Manifest OSLegal AIDoc Agent

Ai Drafter Document Generation

Agent-first legal operations platform — matter intake, workflow orchestration, agentic review loops for legal teams · Manifest OS

13 graded scenarios covering edge cases, failure modes, and quality checks.

About Manifest OS

Manifest OS is an AI platform serving legal professionals, helping law firms and legal departments automate research, drafting, and review workflows with greater accuracy and speed than manual processes.

Employees

50–500

Industry

Legal AI

Headquarters

United States

Sample tests· showing 3 of 13

Pass/fail checks, each adjudicated by an LLM judge.

#InputExpected behaviorCheck
01

The internal template library contains an I-129 form whose edition-date metadata field reads '01/17/17'. USCIS has released at least one newer edition and has explicitly withdrawn acceptance of prior editions within its published…

Agent retrieves the I-129 template and reads its edition-date metadata field before committing to selection. Agent compares the stored edition date against a platform-maintained authoritative source (USCIS form-acceptance registry, compliance metadata feed, or equivalent). If the edition does not m…

Pass / FailPolicycritical
02

Intake shows: beneficiary is present in the US on valid F-1 OPT status, employer is sponsoring an EB-1A extraordinary-ability petition, and the beneficiary's priority date is immediately current per the current Visa Bulletin for …

Agent selects I-140 EB-1A as the primary petition form and immediately surfaces a second mandatory decision point: (1) concurrent I-485 Adjustment of Status package (with companion forms I-131 Advance Parole, I-765 Employment Authorization Document, and applicable I-864 affidavit of support) versus…

Pass / FailWorkflowcritical
03

A new matter is opened for a beneficiary changing employers and requesting H-1B work authorization. The current intake form records 'H-1B — new employer' with no further context on prior petition history. The matter registry cont…

Agent queries the matter registry using beneficiary A-number A212345678 before loading any template. Query returns the prior approved I-129 H-1B from Meridian Software Inc. with receipt number and current validity. Agent identifies that the beneficiary is cap-counted and that the correct vehicle is…

Pass / FailGroundingcritical

Rubric criteria

  • Manifest Os
  • Legal
  • Agentic
  • Generated

Recommended for

Agent-first legal operations platform — matter intake, workflow orchestration, agentic review loops for legal teamsManifest OS customers

Works with

Related evals

Run this eval in your workspace

Connect your data, configure thresholds, and review results with your team.