internvl25 — quantized
Collection
Quantized InternVL2.5 models. • 2 items • Updated
This is a AWQ (4-bit) quantized version of OpenGVLab/InternVL2_5-4B.
from transformers import AutoProcessor, AutoModelForImageTextToText
import torch
model = AutoModelForImageTextToText.from_pretrained(
"{REPO_ID}",
torch_dtype=torch.float16,
device_map="auto",
trust_remote_code=True,
)
processor = AutoProcessor.from_pretrained("{REPO_ID}", trust_remote_code=True)
Replace {REPO_ID} with the repo ID of this model.
See OpenGVLab/InternVL2_5-4B for the original FP16 model.
Base model
OpenGVLab/InternVL2_5-4B