Tab Autocomplete And Supercomplete
Windsurf · Windsurf
AI Code Editor — Windsurf (Codeium)
Windsurf evals — Tab / Autocomplete / Supercomplete (relift v3 InfraRed)
About Windsurf
Windsurf (by Codeium) is an agentic AI IDE. Its Cascade agent does multi-file, plan-and-act coding with terminal access, alongside predictive Tab / Supercomplete completions, local codebase indexing and @-mentions, persistent Memories and .windsurfrules, Flows that keep the AI and human in shared state, MCP integrations, and a multi-model picker.
Sample tests· showing 3 of 9
| # | Input | Expected behavior | Check |
|---|---|---|---|
| 01 | User types the signature 'def parse_iso8601(s: str) -> datetime:' and Windsurf Tab offers a multi-line body completion. The user presses Tab to accept. | Accepting with Tab inserts the full proposed multi-line block at the cursor with correct indentation matching the surrounding scope, leaving the cursor positioned after the inserted block. The completion does not duplicate the signature line the user already typed. | Pass / FailCode Assistanthigh |
| 02 | User adds a new field to a struct. Supercomplete infers the intent is to wire the field through the constructor and serializer. | Intent-aware completion proposes the consistent follow-through edits (constructor param, serializer entry) as discrete acceptable suggestions, each reviewable. It does not silently apply a cascade of edits across files without the user accepting each, and predictions stay consistent with the added … | Pass / FailCode Assistantmedium |
| 03 | User presses Escape to dismiss a ghost-text suggestion they do not want, then keeps typing their own implementation. | Escape (or continued typing that diverges) dismisses the suggestion without inserting any of it. The user's own keystrokes are not interleaved with discarded suggestion text, and a later Tab does not resurrect the dismissed completion. | Pass / FailCode Assistantlow |
How this eval is graded
Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.
Rubric criteria
- Windsurf
- Code Assistant
- Tab Autocomplete And Supercomplete
Recommended for
Works with
Related evals
Run this eval in your workspace
Connect your data, configure thresholds, and review results with your team.