llama-tokenizer-hf / README.md
dinhanhx's picture
Update README.md
477bda8
|
raw
history blame contribute delete
No virus
394 Bytes
metadata
license: llama2

A copy of only tokenizer from https://huggingface.co/meta-llama/Llama-2-7b-hf

Le code example

from transformers import LlamaTokenizerFast
tokenizer = LlamaTokenizerFast.from_pretrained("dinhanhx/llama-tokenizer-hf")
text = "Do bạch kim rất quý nên sẽ dùng để lắp vô xương"
print(tokenizer.convert_ids_to_tokens(tokenizer.encode(text)))