Recursive loop when implement model

by ann-lab52 - opened Feb 20, 2024

Feb 20, 2024

Currently, we tried to implement the model with OpenAI API standard. The setup is all good but the model's output having recursive loop issue. This happen on both Completion API and Chat Completion API, we don't know what causing this because other models we implement before didn't met this issue. Is there any solution for this?

-----Issue Example-----

Chat Completion Input

"{messages": [{"role": "user", "content": "Hi"}]}

Chat Completion Output

{ "message": { "role": "assistant","content": "浈鳙ichern��"}

Completion Input

"{"prompt": "San Francisco is a"}

Completion Output

{"text": " Growinggo\n[ xu/CompleteiabilityIONfidfCoderocomnDidLoadlify iable hf iphabit hoypoendiff"}

qnguyen3

Owner Feb 21, 2024

Hi @ann-lab52 , i believe the tokenizer is not updated to use with Chat Completion.

qnguyen3

Owner Feb 21, 2024

•

edited Feb 21, 2024

@ann-lab52 updated, now you should be able to use it with Completion API . however, i think this model is having some issue doing inference on VLLM + TGI at the same time, not sure why. for now please use vanilla transformers if you run into any problem

ann-lab52

Feb 22, 2024

•

edited Feb 22, 2024

Hi @qnguyen3 , thanks for the fast reply. Unfortunately as you expected, the tokenizer is not the only cause and this issue persisted even though we used the latest model update. Hopefully this will be fixed in your future works. Also, if possible, we would be happy to have your permission to quantize your models and make them available to the community follows your model's license. Please let us know if you have any other requirements, we will be happy to follow your instructions.

qnguyen3

Owner Feb 22, 2024

HIi @ann-lab52 . That is absolutely fine! No worries about that

qnguyen3

Owner Feb 22, 2024

btw, if you are still looking for a working alternative, here is one: vilm/Quyen-Mini-v0.1

This new model is actually an update of this current model. It uses Qwen 1.5 (1.8B) as the backbone and finetune on the same dataset + DPO. The score is also better than this model.

ann-lab52 changed discussion status to closed Feb 23, 2024

ann-lab52

Feb 26, 2024

Thanks @qnguyen3 , model vilm/Quyen-Mini-v0.1 works excellent. Also, quan-1.8B-chat-AWQ was uploaded and other AWQ version of your models will be release soon. Thanks again for your permission.

qnguyen3

Owner Feb 26, 2024

Thank you you much @ann-lab52 , i will add AWQ!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment