[AUTOMATED] Model Memory Requirements
#9 opened 27 days ago
by
model-sizer-bot
The model weights are not tied. Please use the `tie_weights` method before using the `infer_auto_device` function.
1
#8 opened 3 months ago
by
carlosmoises
GPTQ 4bit 128g
#7 opened 5 months ago
by
pszemraj

3B Model
#6 opened 5 months ago
by
aszfcxcgszdx
GGML f16, q4_0, q4_1, q4_2, q4_3
#4 opened 5 months ago
by
oeathus
Can anyone make ggml 4bit q4_0 version?
3
#3 opened 5 months ago
by
4eJIoBek

safetensors shards of 2GB
3
#1 opened 5 months ago
by
antplsdev