Edit model card

omost-llama-3-8b-4bits is Omost's llama-3 model with 8k context length in nf4.

Downloads last month
19,562
Safetensors
Model size
4.65B params
Tensor type
BF16
F32
U8
Inference API
Input a message to start chatting with lllyasviel/omost-llama-3-8b-4bits.
Inference API (serverless) has been turned off for this model.