Edit model card

Model : Llama 3 8B 4bits

This is a Llama 3 8B model quantized on 4 bits using BitsandBytes. For fast upload enjoy !.

License : see https://huggingface.co/meta-llama/Meta-Llama-3-8B

Downloads last month
69
Safetensors
Model size
4.65B params
Tensor type
FP16
F32
U8
Inference API
Input a message to start chatting with corneille97/Llama-3-8B-4bits-turbo.
This model can be loaded on Inference API (serverless).