Differences with mistral-7b-v0.2?

#7
by mallorbc - opened

From my understanding, the v0.2 model also had a 32k context window(without sliding window). Is the only difference here then the different tokenizers?

yes im also confused ?
was this given other unicode characters ?
such as chinese and japanese and sancrit and amarhiric ? ( the non standards ?) was it trained on multi lingugal ?? what are the actual changes ?

Hi there, the main changes are as mentionned the improved tokenizer and so the higher vocabulary!

Sign up or log in to comment