D
For DockerAI Platform

Docker Engine Containers Runtime

Docker (Engine, Hub, Build Cloud, Scout, Desktop, Model Runner) · Docker

Container Platform — Docker

Docker evals — Docker Engine, Containers & Runtime (relift v3 InfraRed)

About Docker

Docker is the container platform — Docker Engine, Docker Desktop, Docker Hub registry, Docker Build Cloud for managed cloud builders, Docker Scout for image vulnerability scanning and supply-chain policy, Docker Compose for multi-container dev, and Docker Model Runner for local LLM inference. Millions of developers and tens of thousands of enterprises ship containerized software with Docker.

Employees

~600

Industry

Developer Infrastructure

Headquarters

Palo Alto, CA

Sample tests· showing 3 of 9

#InputExpected behaviorCheck
01

Container reports 'Up' but the app inside crashed and is no longer serving HTTP. Load balancer keeps sending traffic.

Add HEALTHCHECK --interval=30s --timeout=5s --start-period=10s --retries=3 CMD curl -fsS http://localhost:8080/health || exit 1 in the Dockerfile (or healthcheck in compose). 'docker ps' status becomes 'healthy' / 'unhealthy'. Load balancer and depends_on consumers gate on health.

Pass / FailAi Platformhigh
02

DB container uses '-v /home/user/db:/var/lib/postgres' — host path. After host upgrade, the host path changes and DB data appears empty.

Use named volume: '-v pgdata:/var/lib/postgres' with 'docker volume create pgdata'. Engine manages storage path. Stable across host upgrades. Bind mounts are appropriate for source-code dev where host editability matters.

Pass / FailAi Platformcritical
03

Long-running service crashes on transient downstream errors. Currently started with 'docker run' and no --restart.

Use --restart unless-stopped (auto-restart on crash; respects explicit stop) or --restart on-failure:5 (cap retries). --restart always restarts even after explicit stop on next daemon boot. Pair with HEALTHCHECK so the engine restart isn't masked by a hung process.

Pass / FailAi Platformmedium

How this eval is graded

Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.

Rubric criteria

  • Docker
  • Ai Platform
  • Docker Engine Containers Runtime

Recommended for

Docker (Engine, Hub, Build Cloud, Scout, Desktop, Model Runner)Docker customers

Works with

Related evals

Run this eval in your workspace

Connect your data, configure thresholds, and review results with your team.