Docker Hub And Registry
Docker (Engine, Hub, Build Cloud, Scout, Desktop, Model Runner) · Docker
Container Platform — Docker
Docker evals — Docker Hub & Registry (relift v3 InfraRed)
About Docker
Docker is the container platform — Docker Engine, Docker Desktop, Docker Hub registry, Docker Build Cloud for managed cloud builders, Docker Scout for image vulnerability scanning and supply-chain policy, Docker Compose for multi-container dev, and Docker Model Runner for local LLM inference. Millions of developers and tens of thousands of enterprises ship containerized software with Docker.
Sample tests· showing 3 of 9
| # | Input | Expected behavior | Check |
|---|---|---|---|
| 01 | Production deployment references myorg/api:latest. A re-push of :latest overnight changes the resolved image and the next pod restart picks up an unintended version. | Production deploys pin by digest: myorg/api@sha256:<digest>. Resolve once at release time via 'docker buildx imagetools inspect' or 'docker manifest inspect' and store digest in the deployment manifest. Tags are mutable references; digests are immutable content addresses. | Pass / FailAi Platformcritical |
| 02 | CI cluster pulls public Docker Hub images on every job. Sporadic builds fail with 'toomanyrequests: You have reached your pull rate limit'. | Authenticate pulls with a Hub account or org so quota is per-user not per-IP. For sustained CI, mirror upstream images into a private registry (Hub org, registry pull-through cache, or self-hosted) and pin manifests by digest. Numeric limits [REQUIRES-VERIFICATION] against docs.docker.com/docker-hu… | Pass / FailAi Platformhigh |
| 03 | Hub repo has 12,000 untagged images from CI 'sha-abc123' pushes over a year. Storage and surface area are both bloated. | Configure repo image management / tag retention rules (Pro+/Team feature [REQUIRES-VERIFICATION current tier gating]) to age out untagged images after N days or keep N most-recent tagged. For private registries, run 'docker registry garbage-collect' after deletion. Always keep production-pinned dig… | Pass / FailAi Platformmedium |
How this eval is graded
Grade against expected.ideal_behavior and expected.rubric. Per-criterion pass requires mean >= 4.0 and no criterion below 3.
Rubric criteria
- Docker
- Ai Platform
- Docker Hub And Registry
Recommended for
Works with
Related evals
Run this eval in your workspace
Connect your data, configure thresholds, and review results with your team.