File size: 394 Bytes
8ad7d9c 5527bc9 87042f9 477bda8 c2ecdd5 87042f9 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 |
---
license: llama2
---
A copy of only tokenizer from https://huggingface.co/meta-llama/Llama-2-7b-hf
Le code example
```python
from transformers import LlamaTokenizerFast
tokenizer = LlamaTokenizerFast.from_pretrained("dinhanhx/llama-tokenizer-hf")
text = "Do bạch kim rất quý nên sẽ dùng để lắp vô xương"
print(tokenizer.convert_ids_to_tokens(tokenizer.encode(text)))
```
|