Upload unet_int8_sdpa_fp8_ocp/params.safetensors with huggingface_hub e6e3c03 verified GiusFra commited on Mar 4
Upload unet_int8_sdpa_fp8_ocp/quant_params.json with huggingface_hub 832910d verified GiusFra commited on Mar 4
Upload unet_int8_sdpa_fp8_vae_int8_v2/vae_quant_params.json with huggingface_hub 9436fb6 verified GiusFra commited on Feb 28
Upload unet_int8_sdpa_fp8_vae_int8_v2/vae_params.safetensors with huggingface_hub f4b3910 verified GiusFra commited on Feb 28
Upload unet_int8_sdpa_fp8_vae_int8_v2/vae_params.safetensors with huggingface_hub b1a165a verified GiusFra commited on Feb 27
Upload unet_int8_sdpa_fp8_vae_int8_v2/params.safetensors with huggingface_hub 7d0b300 verified GiusFra commited on Feb 27
Upload unet_int8_sdpa_fp8_vae_int8_v2/quant_params.json with huggingface_hub 75d97e8 verified GiusFra commited on Feb 27
Upload unet_int8_sdpa_fp8_vae_int8_v2/vae_params.safetensors with huggingface_hub 2266191 verified GiusFra commited on Feb 27
Upload unet_int8_sdpa_fp8_vae_int8_v2/vae_quant_params.json with huggingface_hub bb58fb1 verified GiusFra commited on Feb 27
Upload unet_int8_sdpa_fp8_vae_int8/vae_quant_params.json with huggingface_hub f61f04f verified GiusFra commited on Feb 20
Upload unet_int8_sdpa_fp8_vae_int8/unet_quant_params.json with huggingface_hub 59590aa verified GiusFra commited on Feb 20
Upload unet_int8_sdpa_fp8_vae_int8/vae_params.safetensors with huggingface_hub 1dbb8b4 verified GiusFra commited on Feb 20
Upload unet_int8_sdpa_fp8_vae_int8/unet_params.safetensors with huggingface_hub 99cda0b verified GiusFra commited on Feb 20
Upload all_quant_int8_sdpa_fp8/params.safetensors with huggingface_hub 8e60988 verified GiusFra commited on Feb 19
Upload all_quant_int8_sdpa_fp8/quant_params.json with huggingface_hub 008bca6 verified GiusFra commited on Feb 19
[math_model] Make it more obvious that softmax scale comes from the quantizer db5a15b nickfraser commited on Dec 17, 2024
Upload nvidia_fp8_unet/params.safetensors with huggingface_hub d9e66a0 verified GiusFra commited on Oct 3, 2024
Upload nvidia_fp8_unet/quant_params.json with huggingface_hub 730c8f5 verified GiusFra commited on Oct 3, 2024
Upload nvidia_fp8_unet/results_mlperf.json with huggingface_hub f4037ed verified GiusFra commited on Oct 3, 2024
Upload nvidia_fp8_unet/args.json with huggingface_hub 4e70299 verified GiusFra commited on Oct 3, 2024
MI250 QKV fused and all linear layers sym, FP8 attention, guidance scale 8, calib steps 8 b8d5ec9 verified GiusFra commited on Jul 20, 2024
MI250 QKV fused and all linear layers sym, FP8 attention, guidance scale 8, calib steps 10 7c9637e verified GiusFra commited on Jul 20, 2024
MI250 QKV fused and all layers sym, FP8 attention, guidance scale 8, calib steps 10 99f92dc verified GiusFra commited on Jul 20, 2024
MI250 QKV fused and all layers sym, FP8 attention, guidance scale 8, calib steps 10 4d701a1 verified GiusFra commited on Jul 20, 2024