Skip to content
Metrics

Roofline Model

Visual performance model that shows achievable FLOP/s as a function of arithmetic intensity, with two ceilings: memory bandwidth and compute.

Definition

The roofline model plots an operation's achievable floating-point throughput against its arithmetic intensity on a log-log chart. Two ceilings bound performance: below the ridge point, throughput scales with memory bandwidth (the memory-bound regime); above it, throughput is capped by peak compute FLOPS. An operation's attained performance is the minimum of these two ceilings. By measuring where a kernel sits on the roofline, engineers can determine whether to invest in reducing memory traffic (e.g., kernel fusion, quantization) or increasing compute utilisation (e.g., better tiling).

More Metrics terms