phi-1_5-nf4-fp16compute-doublequant-BNB / generation_config.json

Commit History

ADD doublequant 4 bit quantization of phi 1.5B model
18d7875

DaertML commited on