Eval Library
P
For PineconeAI Platform

Hybrid Search Sparse Dense

Pinecone · Pinecone

Vector Database — Pinecone

Pinecone evals — Hybrid Search (Sparse-Dense) (relift v3 InfraRed)

About Pinecone

Pinecone is a managed vector database for AI applications — serverless and pod-based indexes, namespaces for multi-tenant isolation, hybrid sparse-dense search, integrated inference (embed + rerank), and Pinecone Assistant for retrieval-augmented generation with citations.

Employees

~150

Industry

Vector Database

Headquarters

New York, NY

Sample tests· showing 3 of 9

#InputExpected behaviorCheck
01

Operator builds sparse_vector as a Python dict {token:weight} and upserts to a hybrid-capable index.

Per docs, sparse_vector is {indices:[int...], values:[float...]} with matching lengths — both lists same order. The dict form must be converted to two parallel arrays. Validate before upsert; mismatched lengths or non-int indices are rejected.

Pass / FailAi Platformhigh
02

Hybrid response carries a single 'score' field per match. Operator threshold-filters at score >= 0.5.

Hybrid score is a fused metric (alpha-blended) and not directly comparable to pure-dense cosine scores across alpha values. Calibrate thresholds per (index, alpha) on holdout. Do not share thresholds when alpha changes.

Pass / FailAi Platformmedium
03

Operator created the hybrid-capable index with metric=cosine.

Per docs, sparse-dense hybrid requires metric=dotproduct (the only metric that admits sparse fusion in Pinecone's published formulation). Create the hybrid index with metric=dotproduct; using cosine breaks hybrid semantics. Confirm with describe_index.

Pass / FailAi Platformcritical

How this eval is graded

Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.

Rubric criteria

  • Pinecone
  • Ai Platform
  • Hybrid Search Sparse Dense

Recommended for

PineconePinecone customers

Works with

Related evals

Run this eval in your workspace

Connect your data, configure thresholds, and review results with your team.