Gemini
For GeminiAI PlatformSearch QnaAnswer RelevanceTool Selection

Grounded Answering And Citations V1

Grounded Answering and Citations · Gemini

Answer factual questions only when the supplied sources support the answer and cite the right evidence path.

About Gemini

Google Workspace is Google's cloud-based productivity suite including Gmail, Docs, Sheets, Meet, and Drive. Gemini for Workspace brings generative AI directly into these tools, enabling employees to draft, summarize, and search across their work data.

Employees

~182,000

Industry

Cloud Productivity & AI

Headquarters

Mountain View, CA

Sample tests· showing 3 of 6

#InputExpected behaviorCheck
01

Gemini analyst asks for a short factual answer based on the supplied source notes and wants the answer grounded in the provided evidence. Cite the source path or say the answer is not supported.

resolution_label: answer · should_escalate: false · risk_level: low

Pass / Fail
02

Gemini research_associate asks for a short factual answer based on the supplied source notes and wants the answer grounded in the provided evidence. Cite the source path or say the answer is not supported.

resolution_label: answer · should_escalate: false · risk_level: low

Pass / Fail
03

Gemini product_marketing asks for a short factual answer based on the supplied source notes and wants the answer grounded in the provided evidence. Cite the source path or say the answer is not supported.

resolution_label: answer · should_escalate: false · risk_level: low

Pass / Fail

How this eval is graded

Evaluate whether the response is grounded, policy-aligned, and operationally useful. Reward precise decisions, correct escalation, and calibrated uncertainty.

Pass threshold: a criterion passes at a judge score of 4 or higher.

Rubric criteria

  • Citation Discipline
  • Unknown Handling
  • Evidence Fidelity

Recommended for

Citation DisciplineUnknown HandlingEvidence Fidelity

Works with

Related evals

Run this eval in your workspace

Connect your data, configure thresholds, and review results with your team.