Decreased performance with recent updated model?

#14

by Roy-Shih - opened Jul 21, 2023

Jul 21, 2023

Compared with the model when it was first released (the version about 1 month ago), the performance of the new version of the model seems to have dropped a lot, especially in Chinese.
I also found that the Chinese output does not seem to be generated token by token using TextIteratorStreamer, but paragraph by paragraph. Did something go wrong with the sampling?

btw, is there any way to get the previous version of the model?

jfrankle

Jul 21, 2023

We didn't alter the model...

Roy-Shih

Jul 21, 2023

I think I might be missing something? Would like to ask is there any difference between "6e6da7b9cdb21eefe6dd8ac9a083554a99d4ce5e" commit and the previous? Because I saw that has updated the .bin file?

jfrankle

Jul 21, 2023

No new versions of this particular model have been trained, so I can promise you nothing has changed.

Roy-Shih

Jul 21, 2023

•

edited Jul 21, 2023

Got it, testing it, thanks for the quick reply

when I update transformers lib to 4.31.0, it can work.

thank!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment