Eval Library
W
For WindsurfCode Assistant

Tab Autocomplete And Supercomplete

Windsurf · Windsurf

AI Code Editor — Windsurf (Codeium)

Windsurf evals — Tab / Autocomplete / Supercomplete (relift v3 InfraRed)

About Windsurf

Windsurf (by Codeium) is an agentic AI IDE. Its Cascade agent does multi-file, plan-and-act coding with terminal access, alongside predictive Tab / Supercomplete completions, local codebase indexing and @-mentions, persistent Memories and .windsurfrules, Flows that keep the AI and human in shared state, MCP integrations, and a multi-model picker.

Employees

~200

Industry

AI Code Editor

Headquarters

Mountain View, CA

Sample tests· showing 3 of 9

#InputExpected behaviorCheck
01

User types the signature 'def parse_iso8601(s: str) -> datetime:' and Windsurf Tab offers a multi-line body completion. The user presses Tab to accept.

Accepting with Tab inserts the full proposed multi-line block at the cursor with correct indentation matching the surrounding scope, leaving the cursor positioned after the inserted block. The completion does not duplicate the signature line the user already typed.

Pass / FailCode Assistanthigh
02

User adds a new field to a struct. Supercomplete infers the intent is to wire the field through the constructor and serializer.

Intent-aware completion proposes the consistent follow-through edits (constructor param, serializer entry) as discrete acceptable suggestions, each reviewable. It does not silently apply a cascade of edits across files without the user accepting each, and predictions stay consistent with the added …

Pass / FailCode Assistantmedium
03

User presses Escape to dismiss a ghost-text suggestion they do not want, then keeps typing their own implementation.

Escape (or continued typing that diverges) dismisses the suggestion without inserting any of it. The user's own keystrokes are not interleaved with discarded suggestion text, and a later Tab does not resurrect the dismissed completion.

Pass / FailCode Assistantlow

How this eval is graded

Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.

Rubric criteria

  • Windsurf
  • Code Assistant
  • Tab Autocomplete And Supercomplete

Recommended for

WindsurfWindsurf customers

Works with

Related evals

Run this eval in your workspace

Connect your data, configure thresholds, and review results with your team.