Skip to content
Hardware

NVLink

NVIDIA's high-speed GPU-to-GPU interconnect, delivering up to 900 GB/s bidirectional bandwidth on H100 NVLink 4.0 for tensor and pipeline parallelism.

Definition

NVLink is NVIDIA's proprietary high-bandwidth interconnect for connecting multiple GPUs on the same node. NVLink 4.0 (H100 SXM) provides 900 GB/s of bidirectional bandwidth per GPU vs. PCIe 5.0's 128 GB/s, making all-reduce operations in tensor parallelism far more efficient. Up to 8 GPUs can be connected in a fully connected NVSwitch fabric within an HGX node, enabling 70+ TB/s of all-to-all bandwidth. NVLink is essential for large-model tensor parallelism beyond 2 GPUs; without it, all-reduce becomes a throughput bottleneck.

More Hardware terms