R
For ReductoAI Platform

Split Classification And Segmentation

Reducto · Reducto

Document Ingestion & Parsing for AI — Reducto

Reducto evals — Split (Classification & Segmentation) (relift v3 InfraRed)

About Reducto

Reducto is a document ingestion platform for AI pipelines that turns complex documents (PDFs, scans, spreadsheets) into clean, structured, layout-aware data. Its API parses documents into Markdown and typed content blocks, extracts structured fields against a user-defined schema with source citations, and splits bundled files into their constituent documents — feeding retrieval-augmented generation and document-automation workflows.

Employees

~50 (approx — verify)

Industry

Document AI / Data Ingestion

Headquarters

San Francisco, CA (verify)

Website

reducto.ai

Sample tests· showing 3 of 9

#InputExpected behaviorCheck
01

A single uploaded PDF is actually a bundle: a cover letter, an invoice, and a W-9 concatenated. The integrator calls /split to segment it into constituent documents but assumes a fixed three-way split for every bundle.

Use /split to detect document boundaries within the bundle and return per-segment page ranges + classifications, treating the segment count as data-driven (not a fixed assumption). Carry each segment's page range forward so downstream /parse or /extract operates on the correct sub-document. Verify …

Pass / FailAi Platformhigh
02

Integrator configures /split with a set of category labels for classification but provides vague, overlapping labels ('document', 'paperwork', 'other').

Provide a mutually-exclusive, well-described label set aligned to downstream routing (e.g., 'invoice', 'contract', 'tax_form'), with an explicit 'unknown' bucket for low-confidence cases. Overlapping labels make classification non-deterministic and unroutable. Measure per-label precision/recall on …

Pass / FailAi Platformmedium
03

A scanned bundle includes some pages rotated 90/180 degrees (fed sideways through a scanner). The integrator assumes all pages are upright and segmentation/classification degrades.

Rely on Reducto's orientation handling (de-skew / rotation detection) rather than assuming upright input, and spot-check segmentation on rotated samples. If orientation handling is not documented as automatic, pre-normalize orientation before submit. Treat automatic de-skew/rotation as [REQUIRES-VE…

Pass / FailAi Platformlow

How this eval is graded

Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.

Rubric criteria

  • Reducto
  • Ai Platform
  • Split Classification And Segmentation

Recommended for

ReductoReducto customers

Works with

Related evals

Run this eval in your workspace

Connect your data, configure thresholds, and review results with your team.