Eval Library
C
For CohereAI Platform

Embed

Cohere API · Cohere

Foundation Model & API — Cohere

Cohere evals — Embed (relift v3 InfraRed)

About Cohere

Cohere builds enterprise foundation models and the tools around them — the Command model family, best-in-class Rerank and Embed endpoints, and grounded retrieval-augmented generation with inline citations — deployable across major clouds and private VPCs.

Employees

~400

Industry

Foundation Model

Headquarters

Toronto, Canada

Website

cohere.com

Sample tests· showing 3 of 9

#InputExpected behaviorCheck
01

Agent embeds corpus passages and user queries with the same input_type, then wonders why retrieval recall is poor.

Set input_type='search_document' when embedding corpus passages for indexing and input_type='search_query' when embedding the query at search time. The asymmetric input_type is required for the retrieval embedding space to align.

Pass / FailAi Platformcritical
02

A downstream classifier is trained on embeddings produced with input_type='search_document', then serves with input_type='classification'.

Use input_type='classification' for classification features and input_type='clustering' for clustering, and keep the same input_type at train and inference time. Mixing input_types between training and serving shifts the embedding distribution and degrades the model.

Pass / FailAi Platformhigh
03

Documents longer than the embed model's max token length are sent whole; the agent assumes the full document is represented in the vector.

Chunk documents to the embed model's documented max token length and embed chunks (storing chunk→parent mapping) rather than relying on silent truncation that drops the tail. Exact max length is [REQUIRES-VERIFICATION] per model version.

Pass / FailAi Platformmedium

How this eval is graded

Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.

Rubric criteria

  • Cohere
  • Ai Platform
  • Embed

Recommended for

Cohere APICohere customers

Works with

Related evals

Run this eval in your workspace

Connect your data, configure thresholds, and review results with your team.