Zero-Vulnerability Container Scanning¶

Container Vulnerability Scanning

Zero-vulnerability pipeline implementation. Start with local validation before enabling CI enforcement.

# Build stage
FROM golang:1.21 AS builder
WORKDIR /build
COPY . .
RUN go build -o app .

# Runtime stage
FROM gcr.io/distroless/base-debian12
COPY --from=builder /build/app /app
ENTRYPOINT ["/app"]

Builder stage: 200+ packages, build tools, headers. Runtime stage: Just the binary.

Trivy scan targets the runtime stage only.

Exception Management¶

Not every HIGH vulnerability is exploitable in your context:

# .trivyignore
# CVE-2023-12345: DoS in HTTP library
# We don't expose HTTP directly, behind Envoy proxy
# Re-evaluate when library updates
CVE-2023-12345

# CVE-2023-67890: Privilege escalation in sudo
# Distroless image has no sudo
CVE-2023-67890

Document WHY each exception exists. Review quarterly.

SBOM Generation¶

Software Bill of Materials proves what's in your images:

- name: Generate SBOM
  uses: anchore/sbom-action@v0
  with:
    image: app:${{ github.sha }}
    format: cyclonedx-json
    output-file: sbom.json

- name: Upload SBOM
  uses: actions/upload-artifact@v4
  with:
    name: sbom-${{ github.sha }}
    path: sbom.json

SBOM lists every package, version, license. Critical for:

Audit compliance
Supply chain security
Vulnerability response (which images have log4j?)

See SDLC Hardening for audit integration.

Continuous Scanning¶

Images safe today might be vulnerable tomorrow when new CVEs drop:

# .github/workflows/scan-registry.yml
name: Continuous Scan

on:
  schedule:
    - cron: '0 6 * * *'  # Daily at 6am

jobs:
  scan-registry:
    runs-on: ubuntu-latest
    steps:
      - name: Scan all production images
        run: |
          for image in $(gcloud container images list --repository=gcr.io/project); do
            trivy image \
              --severity CRITICAL,HIGH \
              --exit-code 0 \
              --format json \
              --output scan-results.json \
              $image

            # Parse results, file issues for vulnerabilities
            if jq -e '.Results[].Vulnerabilities | length > 0' scan-results.json; then
              gh issue create \
                --title "Vulnerabilities in $image" \
                --body "$(cat scan-results.json)"
            fi
          done

New CVE drops? Issue created automatically. Team knows which images need rebuilds.

Grype as Alternative¶

Grype offers similar functionality with different tradeoffs:

- name: Scan with Grype
  uses: anchore/scan-action@v3
  with:
    image: app:${{ github.sha }}
    fail-build: true
    severity-cutoff: high

Trivy vs Grype:

Feature	Trivy	Grype
Speed	Faster	Slower
Database	Multiple sources	Anchore feeds
License scanning	Yes	Yes
Config scanning	Yes (K8s, Terraform)	No
Maintainer	Aqua Security	Anchore

Both work. Trivy has broader scanning (not just containers).

Policy Enforcement with Kyverno¶

Verify scanned images in admission control:

apiVersion: kyverno.io/v1
kind: ClusterPolicy
metadata:
  name: require-scanned-images
spec:
  validationFailureAction: Enforce
  rules:
    - name: check-image-signature
      match:
        any:
          - resources:
              kinds:
                - Pod
      verifyImages:
        - imageReferences:
            - "gcr.io/project/*"
          attestors:
            - entries:
                - keys:
                    publicKeys: |-
                      -----BEGIN PUBLIC KEY-----
                      ...
                      -----END PUBLIC KEY-----

Only signed, scanned images can deploy.

See Policy-as-Code with Kyverno for admission control patterns.

Breaking Builds Without Breaking Teams¶

Developers hate blocked builds. Make it easy to fix:

1. Show exact vulnerability¶

- name: Scan and report
  run: |
    trivy image \
      --format table \
      --severity CRITICAL,HIGH \
      app:${{ github.sha }} \
      | tee scan-results.txt

    if [ $? -ne 0 ]; then
      echo "::error::Vulnerabilities found. See scan results above."
      echo "::notice::Fix by updating base image or pinning versions."
      exit 1
    fi

2. Suggest fix¶

❌ Build failed: CRITICAL vulnerability CVE-2023-12345 in libssl

Fix options:
1. Update base image: python:3.11-slim → python:3.11.7-slim
2. Pin package version in requirements.txt
3. Add to .trivyignore with justification

Documentation: https://...

3. Document exception process¶

Make it clear when .trivyignore is acceptable and when it's not.

Metrics and Dashboards¶

Track vulnerability prevention:

# Export to Prometheus
- name: Export metrics
  run: |
    trivy image \
      --format json \
      app:${{ github.sha }} \
      > results.json

    CRITICAL=$(jq '[.Results[].Vulnerabilities[] | select(.Severity=="CRITICAL")] | length' results.json)
    HIGH=$(jq '[.Results[].Vulnerabilities[] | select(.Severity=="HIGH")] | length' results.json)

    curl -X POST metrics-endpoint \
      -d "vulnerabilities{severity=\"critical\"} $CRITICAL"
    curl -X POST metrics-endpoint \
      -d "vulnerabilities{severity=\"high\"} $HIGH"

Dashboard shows:

Vulnerabilities blocked per week
Time to remediate HIGH findings
Exception count (rising = problem)
Base image update lag

The Before/After Comparison¶

Aspect	Post-Push Scanning	Pre-Push Scanning
When found	After in registry	Before push
Vulnerable images	Can be deployed	Never reach registry
Developer feedback	Days later (ticket)	Immediate (CI failure)
Audit evidence	"We scan regularly"	"Vulnerable images blocked"
Response time	Hours to days	Seconds (auto-fail)

Implementation Checklist¶

Rolling out zero-vulnerability pipelines:

Choose scanner - Trivy or Grype
Select base images - Distroless where possible
Set severity threshold - CRITICAL + HIGH to start
Integrate in CI - Scan before push
Document exceptions - .trivyignore with rationale
Generate SBOMs - Every build
Continuous scanning - Daily registry scans
Track metrics - Vulnerabilities blocked, remediation time
Team training - How to fix findings, when to ignore

Common Pitfalls¶

Pitfall 1: Scanning After Push¶

Defeats the purpose. Scan before push, not after.

Pitfall 2: Trusting `latest` Tags¶

FROM python:latest  # Which version? When scanned?

Pin versions:

FROM python:3.11.7-slim  # Explicit, scannable

Pitfall 3: Ignoring MEDIUM Severity¶

MEDIUM today might be HIGH tomorrow. Track them.

Pitfall 4: No Exception Review¶

Exceptions accumulate. Review quarterly. Remove stale ignores.

The Full Stack¶

Container security is layered:

Base image selection - Start with minimal images
Build-time scanning - Block before push (this post)
Admission control - Kyverno verifies images
Runtime security - Pod Security Standards, AppArmor, seccomp
Continuous scanning - Catch new CVEs in existing images

Each layer complements the others.

Related Patterns¶

Zero-vulnerability pipelines fit into broader SDLC hardening:

SDLC Hardening - Build security into pipelines
Policy-as-Code with Kyverno - Admission control for images
Pre-commit Hooks as Security Gates - Catch issues before commit The CRITICAL CVE never reached production. Trivy blocked the build. The developer updated the base image. The pipeline turned green. Zero vulnerabilities deployed.