English comma in generated Arabic text

#6
by akhooli - opened

I tested the chat model in 8bits and noticed the English comma (,) instead of the Arabic one (،) in generated Arabic text.
Is this caused by model training data?

This issue can be resolved by using RegEx replacement. But for sure if they manage to do that to training data instead that's much better...

Sign up or log in to comment