Split Classification And Segmentation
Reducto · Reducto
Document Ingestion & Parsing for AI — Reducto
Reducto evals — Split (Classification & Segmentation) (relift v3 InfraRed)
About Reducto
Reducto is a document ingestion platform for AI pipelines that turns complex documents (PDFs, scans, spreadsheets) into clean, structured, layout-aware data. Its API parses documents into Markdown and typed content blocks, extracts structured fields against a user-defined schema with source citations, and splits bundled files into their constituent documents — feeding retrieval-augmented generation and document-automation workflows.
Employees
~50 (approx — verify)
Industry
Document AI / Data Ingestion
Headquarters
San Francisco, CA (verify)
Website
reducto.aiSample tests· showing 3 of 9
| # | Input | Expected behavior | Check |
|---|---|---|---|
| 01 | A single uploaded PDF is actually a bundle: a cover letter, an invoice, and a W-9 concatenated. The integrator calls /split to segment it into constituent documents but assumes a fixed three-way split for every bundle. | Use /split to detect document boundaries within the bundle and return per-segment page ranges + classifications, treating the segment count as data-driven (not a fixed assumption). Carry each segment's page range forward so downstream /parse or /extract operates on the correct sub-document. Verify … | Pass / FailAi Platformhigh |
| 02 | Integrator configures /split with a set of category labels for classification but provides vague, overlapping labels ('document', 'paperwork', 'other'). | Provide a mutually-exclusive, well-described label set aligned to downstream routing (e.g., 'invoice', 'contract', 'tax_form'), with an explicit 'unknown' bucket for low-confidence cases. Overlapping labels make classification non-deterministic and unroutable. Measure per-label precision/recall on … | Pass / FailAi Platformmedium |
| 03 | A scanned bundle includes some pages rotated 90/180 degrees (fed sideways through a scanner). The integrator assumes all pages are upright and segmentation/classification degrades. | Rely on Reducto's orientation handling (de-skew / rotation detection) rather than assuming upright input, and spot-check segmentation on rotated samples. If orientation handling is not documented as automatic, pre-normalize orientation before submit. Treat automatic de-skew/rotation as [REQUIRES-VE… | Pass / FailAi Platformlow |
How this eval is graded
Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.
Rubric criteria
- Reducto
- Ai Platform
- Split Classification And Segmentation
Recommended for
Works with
Related evals
Run this eval in your workspace
Connect your data, configure thresholds, and review results with your team.