Asking about `tokenizer.model` file

#1
by bibekyess - opened

Hi! I was wondering how did you obtain tokenizer.model file? In other available korean base llama models like beomi/llama-2-ko-7b, they used FastTokenizer provided by HF tokenizers not the sentencepiece package, so the output doesn't contain tokenizer.model. Can you share how you obtained tokenizer.model file from tokenizer.json?
Thank you for your help!

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment