Skip to content
Exercises/VRAM Calculator
💾Ch 2-3beginner

VRAM Calculator

Calculate memory requirements for model inference: weights, KV cache, activations, and overhead vs GPU capacity.

Model Preset

Precision

4K tokens
1

Target GPU

VRAM Usage75.7 GB / 80 GB
Weights
Weights: 70.0 GBKV Cache: 0.7 GBActivations: 3.5 GBOverhead: 1.5 GB

Status

Fits on GPU

4.3 GB headroom remaining

Weights

70.0 GB

70B × FP8

KV Cache

0.7 GB

4,096 ctx × 1 batch

Activations

3.5 GB

~5% of weights

Total

75.7 GB