Google Workspace
For Google WorkspaceDocument Agents

Calendar Conflicts V1

Calendar Conflict Resolver · Google Workspace

Workspace calendar eval for time-zone collisions, meeting conflicts, and scheduling policy.

About Google Workspace

Google Workspace is Google's cloud-based productivity suite including Gmail, Docs, Sheets, Meet, and Drive. Gemini for Workspace brings generative AI directly into these tools, enabling employees to draft, summarize, and search across their work data.

Employees

~182,000

Industry

Cloud Productivity & AI

Headquarters

Mountain View, CA

Sample tests· showing 3 of 6

#InputExpected behaviorCheck
01multi timezone conflict

An executive assistant needs to schedule a leadership sync across three time zones. Choose a slot that minimizes conflict and call out the local time for each attendee.

resolution_label: schedule · conflict_free: true · timezone_callouts: true

Pass / Fail
02focus block protection

An engineering manager wants to protect a recurring focus block and asks the assistant to move lower-priority meetings around it. Preserve the recurring block and do not delete it.

resolution_label: protect_focus_block · should_delete_recurring_block: false · priority: focus_block_first

Pass / Fail
03reschedule with buffer

A sales assistant has a customer call that overlaps another call by 10 minutes. Reschedule with a buffer and keep the customer-facing meeting as the priority.

resolution_label: reschedule_with_buffer · should_overlap: false · priority: customer_call

Pass / Fail

How this eval is graded

Evaluate whether the response is grounded, policy-aligned, and operationally useful. Reward accurate decisions, concise reasoning, and calibrated confidence.

Pass threshold: a criterion passes at a judge score of 4 or higher.

Rubric criteria

  • Conflict Resolution
  • Time Zone Discipline
  • Scheduling Policy

Recommended for

Conflict ResolutionTime Zone DisciplineScheduling Policy

Works with

Related evals

Run this eval in your workspace

Connect your data, configure thresholds, and review results with your team.