size mismatch
#12 opened 3 months ago
by
Bleking
Are there any plans to publish a version of the model with only pruning and no distillation?
#11 opened 3 months ago
by
kurogane
any chance of pruned 70b?
#10 opened 4 months ago
by
pszemraj
Update config.json
1
#8 opened 4 months ago
by
deshpandeabhi
i could load the model but not create an inference
#7 opened 4 months ago
by
rijotomjackson
Weight Error in Notebook
8
#5 opened 4 months ago
by
atharvanighot
Impact on effective context size ?
#3 opened 4 months ago
by
BernardH
Is Instruct 4B-Width also going to be published?
2
#1 opened 4 months ago
by
Qubitium