Commit History

Upload unet_int8_sdpa_fp8_ocp/params.safetensors with huggingface_hub
e6e3c03
verified

GiusFra commited on

Upload unet_int8_sdpa_fp8_ocp/quant_params.json with huggingface_hub
832910d
verified

GiusFra commited on

Upload unet_int8_sdpa_fp8_vae_int8_v2/vae_quant_params.json with huggingface_hub
9436fb6
verified

GiusFra commited on

Upload unet_int8_sdpa_fp8_vae_int8_v2/vae_params.safetensors with huggingface_hub
f4b3910
verified

GiusFra commited on

Upload unet_int8_sdpa_fp8_vae_int8_v2/vae_params.safetensors with huggingface_hub
b1a165a
verified

GiusFra commited on

Upload unet_int8_sdpa_fp8_vae_int8_v2/params.safetensors with huggingface_hub
7d0b300
verified

GiusFra commited on

Upload unet_int8_sdpa_fp8_vae_int8_v2/quant_params.json with huggingface_hub
75d97e8
verified

GiusFra commited on

Upload unet_int8_sdpa_fp8_vae_int8_v2/vae_params.safetensors with huggingface_hub
2266191
verified

GiusFra commited on

Upload unet_int8_sdpa_fp8_vae_int8_v2/vae_quant_params.json with huggingface_hub
bb58fb1
verified

GiusFra commited on

Create config.json
ae57958
verified

GiusFra commited on

Upload unet_int8_sdpa_fp8_vae_int8/vae_quant_params.json with huggingface_hub
f61f04f
verified

GiusFra commited on

Upload unet_int8_sdpa_fp8_vae_int8/unet_quant_params.json with huggingface_hub
59590aa
verified

GiusFra commited on

Upload unet_int8_sdpa_fp8_vae_int8/vae_params.safetensors with huggingface_hub
1dbb8b4
verified

GiusFra commited on

Upload unet_int8_sdpa_fp8_vae_int8/unet_params.safetensors with huggingface_hub
99cda0b
verified

GiusFra commited on

Upload all_quant_int8_sdpa_fp8/params.safetensors with huggingface_hub
8e60988
verified

GiusFra commited on

Upload all_quant_int8_sdpa_fp8/quant_params.json with huggingface_hub
008bca6
verified

GiusFra commited on

[math_model] Make it more obvious that softmax scale comes from the quantizer
db5a15b

nickfraser commited on

Create math_model.py
6f59b43
verified

GiusFra commited on

Upload nvidia_fp8_unet/params.safetensors with huggingface_hub
d9e66a0
verified

GiusFra commited on

Upload nvidia_fp8_unet/quant_params.json with huggingface_hub
730c8f5
verified

GiusFra commited on

Upload nvidia_fp8_unet/results_mlperf.json with huggingface_hub
f4037ed
verified

GiusFra commited on

Upload nvidia_fp8_unet/args.json with huggingface_hub
4e70299
verified

GiusFra commited on

Create config.json
b0f9624
verified

GiusFra commited on

Create config.json
b7db598
verified

GiusFra commited on

Create config.json
864a3a2
verified

GiusFra commited on

Create config.json
25e566b
verified

GiusFra commited on

Updated sdpa fp8 models
fa0155f

nickfraser commited on

Added models that are fully quantized with FP8.
cfd94d7

nickfraser commited on

Added SDPA math model & test
3fea540

nickfraser commited on

Fix names
740d40f

GiusFra commited on

MI250 QKV fused and all linear layers sym, FP8 attention, guidance scale 8, calib steps 8
b8d5ec9
verified

GiusFra commited on

Fix names
08a2fb9

GiusFra commited on

MI250 QKV fused and all linear layers sym, FP8 attention, guidance scale 8, calib steps 10
7c9637e
verified

GiusFra commited on

MI250 QKV fused and all layers sym, FP8 attention, guidance scale 8, calib steps 10
99f92dc
verified

GiusFra commited on

MI250 QKV fused and all layers sym, FP8 attention, guidance scale 8, calib steps 10
4d701a1
verified

GiusFra commited on

updated quant_params with QKV fusion
6751dca
verified

GiusFra commited on

update int8+fp8 safetensors with fused QKV
7d9a30f
verified

GiusFra commited on

update int8+fp8 safetensors
16771c1
verified

GiusFra commited on

updated quant_params for FNUZ
f4b2bb6
verified

GiusFra commited on

add missing smoothquant_mul
6b39796
verified

GiusFra commited on

update int8+fp8 safetensors
9886f46
verified

GiusFra commited on

update int8+fp8 safetensors
fa8dc75
verified

GiusFra commited on

update int8+fp8 quant_param
7a5baa7
verified

GiusFra commited on

Upload sdxl.safetensors with huggingface_hub
7c9bbe7
verified

bowenbaoamd commited on

Upload sdxl.json with huggingface_hub
8b25dab
verified

bowenbaoamd commited on