versae's picture
BPE and Unigram SentencePiece tokenizers for French, Italian, Portuguese, Romanian, and Spanish
20ab046