Edit model card

omost-llama-3-8b-4bits is Omost's llama-3 model with 8k context length in nf4.

Downloads last month
9,031
Safetensors
Model size
4.65B params
Tensor type
BF16
F32
U8
Inference API (serverless) has been turned off for this model.