Note:

This repo hosts only a Q5_K_S iMatrix of Llama 3 Lumimaid 8B v0.1 OAS. GGUF quant is from Lewdiculous/Llama-3-Lumimaid-8B-v0.1-OAS-GGUF-IQ-Imatrix. The additional files in this GGUF repo is for personal usage using Text Gen Webui with llamacpp_hf.

GGUF

Model size

8.03B params

Architecture

llama

Hardware compatibility

5-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support