OpenAssistant/oasst-sft-4-pythia-12b-epoch-3.5 · Probability tensor contains either `inf`, `nan` or element

I am trying to make inference with this code:

from transformers import GPTNeoXForCausalLM, AutoTokenizer
tokenizer = AutoTokenizer.from_pretrained("OpenAssistant/oasst-sft-4-pythia-12b-epoch-3.5")
model = GPTNeoXForCausalLM.from_pretrained("OpenAssistant/oasst-sft-4-pythia-12b-epoch-3.5",device_map="auto", load_in_8bit=True)
message = "<|prompter|>What is a meme, and what's the history behind this word?<|endoftext|><|assistant|>"
inputs = tokenizer(message, return_tensors="pt").to(model.device)
tokens = model.generate(**inputs, max_new_tokens=1000, do_sample=True, temperature=0.8)
tokenizer.decode(tokens[0])

and it's returning this error all the time:
RuntimeError: probability tensor contains either inf, nan or element < 0
Does anyone know how to solve it, my tensors have no 'nan' values or 'inf' I have checked them.

OpenAssistant
/

oasst-sft-4-pythia-12b-epoch-3.5

Probability tensor contains either `inf`, `nan` or element < 0