Higher error rate than original

#4
by PartyParrot - opened

Both the Q8_0 and f16 versions run into infinite repeats much more often than the original GLM-OCR transformers model. I'd recommend everyone to use the original until this issue is fixed.

Sign up or log in to comment