Edit model card

Llama-3-Orca-1.0-8B-GGUF

Quant of https://huggingface.co/Locutusque/Llama-3-Orca-1.0-8B

  • f32
  • f16
  • Q8_0
  • Q4_K_M
  • Q2_K
Downloads last month
185
GGUF
Model size
8.03B params
Architecture
llama

2-bit

4-bit

5-bit

8-bit

16-bit

Inference API
This model can be loaded on Inference API (serverless).

Collection including leafspark/Llama-3-Orca-1.0-8B-GGUF