Generating nonsense output and then broke

#23
by joycejiang - opened

The model is giving me a bunch of nonsense output "íliaa enemies enemiesclicclic Sue SueUob pil Silvererde wc Sept", and if I ran it again it broke
'RuntimeError: CUDA error: device-side assert triggered CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.'

  model = AutoGPTQForCausalLM.from_quantized(model_name, 
  model_basename="gptq_model-4bit--1g",
  device_map="auto", 
  use_safetensors=True, 
  use_triton=False, 
  trust_remote_code=True, 
  quantize_config=None,
  inject_fused_attention=False)

There was a bug in AutoGPTQ 0.3.0 that caused gibberish in some cases. Please update to AutoGPTQ 0.3.2 which was released yesterday and fixes this issue:

pip3 uninstall -y auto-gptq
pip3 install auto-gptq==0.3.2

Sign up or log in to comment