omost-llama-3-8b-4bits is Omost's llama-3 model with 8k context length in nf4.

Downloads last month
1,455
Safetensors
Model size
4.65B params
Tensor type
BF16
F32
U8
Inference Examples
Inference API (serverless) has been turned off for this model.