Feedback / ChatML format issues.

by SerialKicked - opened May 23

May 23

•

This ain't the test I promised because I hit this recurrent issue with your model:

Sometimes, it tries to add "<|user|>" and "<|system|>" fields to its response (and yes, it's adding stuff that would fit user or system info after that). It's relatively rare, but it sounds like something went wrong with the formatting at some point during the merge or the tuning of the parent models. If it was "<|im_start|>user" or "<|im_start|>system" I would at least get it, but here, I'm not sure. That would suggest that it's not trained on ChatML, but on whatever this is instead.

Any idea?

grimjim

Owner May 23

•

edited May 23

I've not seen that, but once in a blue moon I've seen an output seemingly stall before ending. Maybe my front-end is hiding it from me. Not all the merge components were trained for ChatML, which might explain why the problem is happening only occasionally. It's possible fine-tuning could address this.

Will test out Alpaca format later.

saishf

May 24

I've not seen that, but once in a blue moon I've seen an output seemingly stall before ending. Maybe my front-end is hiding it from me. Not all the merge components were trained for ChatML, which might explain why the problem is happening only occasionally. It's possible fine-tuning could address this.

Will test out Alpaca format later.

Seems to be Zephyr's prompt "<|user|>\n ... <|system|>" and i found this relating to cut off messages. "Truncating Response"

SerialKicked

May 24

•

edited May 24

obligatory

To be fair, most of the time it works, it's relatively easy to filter out, and the model is still quite good (in my super official first test). Also it's the first mistral model for which I don't have to push the Rep.Pen. a ton to get decently written and non repetitive outputs.

grimjim

Owner May 24

My personal take on this model is that it's a mildly unstable merge, but the tradeoff is improved creativity.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment