Eval Library
P
For PerplexityAI PlatformAnswer Relevance

Citations And Source Grounding

Perplexity Sonar API · Perplexity

Grounded Answer API — Perplexity Sonar

Perplexity evals — Citations & Source Grounding (relift v3 InfraRed)

About Perplexity

Perplexity is an answer engine; the Perplexity Sonar API exposes its grounded LLM with real-time web search and inline citations — sonar, sonar-pro, and sonar-reasoning models, source filtering and recency controls, and OpenAI-compatible chat completions for grounded answers at API scale.

Employees

~200

Industry

Search / Answer API

Headquarters

San Francisco, CA

Sample tests· showing 3 of 9

#InputExpected behaviorCheck
01

Operator omits return_citations on POST /chat/completions and expects the response.citations[] array to populate by default.

Set return_citations=true explicitly on every request where source attribution is required downstream. The default behavior should not be assumed [REQUIRES-VERIFICATION] across model versions — opt in by setting the flag and verify response.citations is present and non-empty before rendering.

Pass / FailAi Platformhigh
02

Assistant content contains markers like 'The deal closed in 2025 [1] for $40M [2].' citations=['https://nyt.com/...', 'https://wsj.com/...'].

[n] is 1-indexed and maps to citations[n-1]. Render [1] as a link to citations[0]. Preserve the bracketed marker if it is part of the model's anchoring; rewrite as a hyperlink only after confirming the index→URL mapping per request.

Pass / FailAi Platformcritical
03

Client renders citations[] progressively as it accumulates streaming deltas; expects citations to populate incrementally.

Per Sonar SSE, citations appear in the response body (non-streaming) or on the terminating chunk (streaming) — not progressively per delta. Render citations only after the stream completes or the terminal payload arrives. UI should show 'finding sources…' until the citation block lands.

Pass / FailAi Platformmedium

How this eval is graded

Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.

Rubric criteria

  • Perplexity
  • Ai Platform
  • Citations And Source Grounding

Recommended for

Perplexity Sonar APIPerplexity customers

Works with

Related evals

Run this eval in your workspace

Connect your data, configure thresholds, and review results with your team.