💾Ch 2-3beginner
VRAM Calculator
Calculate memory requirements for model inference: weights, KV cache, activations, and overhead vs GPU capacity.
Model Preset
Precision
4K tokens
1
Target GPU
VRAM Usage75.7 GB / 80 GB
Weights
Weights: 70.0 GBKV Cache: 0.7 GBActivations: 3.5 GBOverhead: 1.5 GB
Status
Fits on GPU
4.3 GB headroom remaining
Weights
70.0 GB
70B × FP8
KV Cache
0.7 GB
4,096 ctx × 1 batch
Activations
3.5 GB
~5% of weights
Total
75.7 GB