Description

This repo contains fp8 model files for aya-expanse-32b.

Quantization parameter

  • activation_scheme : dynamic
  • quant_method : fp8
Downloads last month
2
Safetensors
Model size
32.3B params
Tensor type
FP16
·
F8_E4M3
·
Inference API
Unable to determine this model's library. Check the docs .

Model tree for minyichen/aya-expanse-32b-Dynamic-fp8

Quantized
(17)
this model