This is not TRUE bitnet since it's trained with float AND THEN QUANTIZED TO {-1,0,1}
#3 opened about 2 months ago
by
qmsoqm
Implementation/training code
1
#2 opened 2 months ago
by
jvh
finetuning Bitnet-Llama-70M
#1 opened 2 months ago
by
dcd12345678