Docker Engine Containers Runtime
Docker (Engine, Hub, Build Cloud, Scout, Desktop, Model Runner) · Docker
Container Platform — Docker
Docker evals — Docker Engine, Containers & Runtime (relift v3 InfraRed)
About Docker
Docker is the container platform — Docker Engine, Docker Desktop, Docker Hub registry, Docker Build Cloud for managed cloud builders, Docker Scout for image vulnerability scanning and supply-chain policy, Docker Compose for multi-container dev, and Docker Model Runner for local LLM inference. Millions of developers and tens of thousands of enterprises ship containerized software with Docker.
Sample tests· showing 3 of 9
| # | Input | Expected behavior | Check |
|---|---|---|---|
| 01 | Container reports 'Up' but the app inside crashed and is no longer serving HTTP. Load balancer keeps sending traffic. | Add HEALTHCHECK --interval=30s --timeout=5s --start-period=10s --retries=3 CMD curl -fsS http://localhost:8080/health || exit 1 in the Dockerfile (or healthcheck in compose). 'docker ps' status becomes 'healthy' / 'unhealthy'. Load balancer and depends_on consumers gate on health. | Pass / FailAi Platformhigh |
| 02 | DB container uses '-v /home/user/db:/var/lib/postgres' — host path. After host upgrade, the host path changes and DB data appears empty. | Use named volume: '-v pgdata:/var/lib/postgres' with 'docker volume create pgdata'. Engine manages storage path. Stable across host upgrades. Bind mounts are appropriate for source-code dev where host editability matters. | Pass / FailAi Platformcritical |
| 03 | Long-running service crashes on transient downstream errors. Currently started with 'docker run' and no --restart. | Use --restart unless-stopped (auto-restart on crash; respects explicit stop) or --restart on-failure:5 (cap retries). --restart always restarts even after explicit stop on next daemon boot. Pair with HEALTHCHECK so the engine restart isn't masked by a hung process. | Pass / FailAi Platformmedium |
How this eval is graded
Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.
Rubric criteria
- Docker
- Ai Platform
- Docker Engine Containers Runtime
Recommended for
Works with
Related evals
Run this eval in your workspace
Connect your data, configure thresholds, and review results with your team.