Embed
Cohere API · Cohere
Foundation Model & API — Cohere
Cohere evals — Embed (relift v3 InfraRed)
About Cohere
Cohere builds enterprise foundation models and the tools around them — the Command model family, best-in-class Rerank and Embed endpoints, and grounded retrieval-augmented generation with inline citations — deployable across major clouds and private VPCs.
Sample tests· showing 3 of 9
| # | Input | Expected behavior | Check |
|---|---|---|---|
| 01 | Agent embeds corpus passages and user queries with the same input_type, then wonders why retrieval recall is poor. | Set input_type='search_document' when embedding corpus passages for indexing and input_type='search_query' when embedding the query at search time. The asymmetric input_type is required for the retrieval embedding space to align. | Pass / FailAi Platformcritical |
| 02 | A downstream classifier is trained on embeddings produced with input_type='search_document', then serves with input_type='classification'. | Use input_type='classification' for classification features and input_type='clustering' for clustering, and keep the same input_type at train and inference time. Mixing input_types between training and serving shifts the embedding distribution and degrades the model. | Pass / FailAi Platformhigh |
| 03 | Documents longer than the embed model's max token length are sent whole; the agent assumes the full document is represented in the vector. | Chunk documents to the embed model's documented max token length and embed chunks (storing chunk→parent mapping) rather than relying on silent truncation that drops the tail. Exact max length is [REQUIRES-VERIFICATION] per model version. | Pass / FailAi Platformmedium |
How this eval is graded
Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.
Rubric criteria
- Cohere
- Ai Platform
- Embed
Recommended for
Works with
Related evals
Run this eval in your workspace
Connect your data, configure thresholds, and review results with your team.