split_special_tokens set to true

by mortendybdal - opened Apr 23

Discussion

mortendybdal

Apr 23

Hi Syvai,

Awesome work you guys are doing!!

I have tried out you model. I have been running you model on Runpod using this command:

vllm serve syvai/hviske-v5 --host 0.0.0.0 --port 8006 --trust-remote-code --dtype bfloat16

This didn't work and I finally found out that split_special_tokens was set to true, causing the vLLM's CohereASR decoder prompt to be split into subword garbage by SentencePiece instead of being recognized as their special token IDs. Once I set the split_special_tokens to false, the result was great.

I don't know if it is just because I am running it using vllm cli, but I just wanted to let you know:-)

mhenrichsen

syv.ai org Apr 24

Hi morten

Thanks for letting us know :) This is just a test release and not the official release - better things are coming!

mortendybdal

Apr 24

Looking forward!!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment