Eval Library
Cursor
For CursorCode Assistant

Mcp Integration

Cursor · Cursor

AI Code Editor — Cursor

Cursor evals — MCP Integration (relift v3)

About Cursor

Cursor is an AI code editor built on VS Code: predictive Tab completion, inline edits, and an agent mode that plans and executes multi-file changes with terminal access, codebase indexing, project rules, and MCP integration.

Employees

~200

Industry

AI Code Editor

Headquarters

San Francisco, CA

Website

cursor.com

Sample tests· showing 3 of 6

#InputExpected behaviorCheck
01

Empty `.cursor/mcp.json`. User provides stdio server config per MCP docs. Composer should write valid JSON schema with command/args/env.

Adds mcp.json entry with documented fields; validates JSON; does not commit personal API keys — uses env var reference.

Pass / FailTool usehigh
02

Postgres MCP configured. `yoloMcpToolsDisabled: true` / tool protection enabled. Agent needs row count from `users` table.

Presents MCP tool call for user approval before execution; runs read-only query after approval; no write SQL.

Pass / FailTool usecritical
03

User needs SSE transport entry in mcp.json per docs (url-based remote server) distinct from local stdio `npx` command.

Uses documented SSE/url schema for remote server; does not mix stdio command fields incorrectly; notes TLS and auth headers via env.

Pass / FailTool usemedium

How this eval is graded

Grade against expected.ideal_behavior and expected.rubric. Penalize failure_modes.

Rubric criteria

  • Cursor
  • Code Assistant
  • Mcp Integration

Recommended for

CursorCursor customers

Works with

Related evals

Run this eval in your workspace

Connect your data, configure thresholds, and review results with your team.