image

This repository contains model weights for the unofficial LQ8 quantizations of Gemma4 E2B.

LQ8 is an experimental quantization technique that is still in early beta, designed to provide fp16-level quality with the same or lower memory footprint as Q8_0.

LQ8 is currently compatible with llama.cpp and Ollama out of the box. Please create a discussion if you find a bug.

File Name Quant Type Bit Depth Size Download Link
model-LQ8.gguf LQ8 ~8 bpw 4.33 GB 📥 Download LQ8
Downloads last month
530
GGUF
Model size
5B params
Architecture
gemma4
Hardware compatibility
Log In to add your hardware

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for reecdev/Gemma-4-E2B-LQ8-GGUF

Quantized
(242)
this model