Graceful Degradation¶

When the optimal path fails, fall back to progressively more expensive but reliable alternatives.

Key Insight

Degrade performance, not availability. Every operation should have a guaranteed fallback that always succeeds.

Overview¶

Graceful degradation is a design principle that ensures systems continue operating when components fail. Rather than crashing or returning errors, the system automatically falls back to slower but working alternatives.

flowchart TD
    subgraph request[Request]
        A[Operation Requested]
    end

    subgraph tiers[Fallback Tiers]
        T1[Tier 1: Optimal]
        T2[Tier 2: Acceptable]
        T3[Tier 3: Guaranteed]
    end

    subgraph result[Result]
        Success[Success]
    end

    A --> T1
    T1 -->|Works| Success
    T1 -->|Fails| T2
    T2 -->|Works| Success
    T2 -->|Fails| T3
    T3 --> Success

    %% Ghostty Hardcore Theme
    style A fill:#65d9ef,color:#1b1d1e
    style T1 fill:#a7e22e,color:#1b1d1e
    style T2 fill:#fd971e,color:#1b1d1e
    style T3 fill:#f92572,color:#1b1d1e
    style Success fill:#a7e22e,color:#1b1d1e

The key insight: degrade performance, not availability.

The Tiered Fallback Pattern¶

Every graceful degradation implementation follows this structure:

Tier	Characteristics	Example
Tier 1: Optimal	Fast, cheap, preferred	Volume mount read
Tier 2: Acceptable	Slower, costlier, reliable	API call
Tier 3: Guaranteed	Expensive but always works	Full rebuild

Each tier must:

Detect failure of the previous tier
Attempt its operation independently
Report which tier succeeded (observability)

Real-World Examples¶

Cache Access Pattern¶

From From 5 Seconds to 5 Milliseconds:

Volume Mount → API Call → Rebuild Cache
    1-5ms        50ms        5000ms

# Kubernetes volume mount with optional flag
volumes:
  - name: cache-volume
    configMap:
      name: deployment-cache
      optional: true  # Tier 1 can fail gracefully

func GetDeployments(image string) ([]Deployment, error) {
    // Tier 1: Try volume mount
    if data, err := os.ReadFile("/etc/cache/deployments.json"); err == nil {
        return parseDeployments(data, image)
    }

    // Tier 2: Try API call
    if data, err := k8s.GetConfigMap("deployment-cache"); err == nil {
        return parseDeployments(data, image)
    }

    // Tier 3: Rebuild from cluster scan
    return scanClusterForImage(image)
}

CI/CD Dependency Resolution¶

Artifact Cache → Dependency Cache → Fresh Install
    seconds          minutes          minutes+

- uses: actions/cache@v4
  id: artifact-cache
  with:
    path: dist/
    key: build-${{ hashFiles('src/**') }}

- uses: actions/cache@v4
  if: steps.artifact-cache.outputs.cache-hit != 'true'
  id: dep-cache
  with:
    path: node_modules/
    key: deps-${{ hashFiles('package-lock.json') }}

- name: Install dependencies
  if: steps.dep-cache.outputs.cache-hit != 'true'
  run: npm ci

- name: Build
  if: steps.artifact-cache.outputs.cache-hit != 'true'
  run: npm run build

API Resilience¶

Primary Endpoint → Secondary Endpoint → Cached Response → Static Fallback

Authentication¶

SSO → API Token → Service Account → Anonymous (read-only)