nvfp4
Collection
collection of mixed dtypes between weight layers, 3d matrix layers and 2d blacklisted layers • 5 items • Updated
Pin down what is needed between 50s,40s,30s cards
Translate info for use in TensorRT fp4 converting / onnx
Anyone that has made fp4 onnx before with NVIDIA’s Model Optimizer PTQ help in the Community tab is welcomed.
Currently working on correct Quantize/Dequantize nodes for onnx export to build fp4 RT engine for 50s cards.
txt_attn dtype notes
txt_attn weights: NVFP4txt_attn weights: BF16txt_attn weights: FP8 (float8_e4m3fn)txt_attn scaling: absmaxtxt_attn weights: FP8 (float8_e5m2)txt_attn scaling: absmaxBase model
black-forest-labs/FLUX.2-klein-4B