gemma3 — quantized
Collection
Quantized Gemma3 multimodal models. • 2 items • Updated
This is a GPTQ (4-bit) quantized version of google/gemma-3-4b-it.
from transformers import AutoProcessor, AutoModelForImageTextToText
import torch
model = AutoModelForImageTextToText.from_pretrained(
"{REPO_ID}",
torch_dtype=torch.float16,
device_map="auto",
trust_remote_code=True,
)
processor = AutoProcessor.from_pretrained("{REPO_ID}", trust_remote_code=True)
Replace {REPO_ID} with the repo ID of this model.
See google/gemma-3-4b-it for the original FP16 model.