SpectraSuite/QuantLM_3.9B_8bit_Unpacked
Text Generation
•
Updated
•
14
QuantLMs, unpacked to FP16 format - compatible with FP16 GEMMs. After unpacking, QuantLMs have the same architecture as LLaMa.