qubitron commited on
Commit
4432b35
·
verified ·
1 Parent(s): cf308b2

Add INT8 and INT4 quantized weights

Browse files
llada_int4_quantized.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b42f17257baa051badbedd1d4e577fe868a8fa9fc834efb3c7a54ffca9538685
3
+ size 4788526525
llada_int8_quantized.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d5c9b729256e750902a8a91033deecacd091d2b8779a325828580aa9476eb3be
3
+ size 8537189053