File size: 394 Bytes
8ad7d9c
 
 
5527bc9
87042f9
 
 
 
 
477bda8
c2ecdd5
 
87042f9
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
---
license: llama2
---

A copy of only tokenizer from https://huggingface.co/meta-llama/Llama-2-7b-hf

Le code example
```python
from transformers import LlamaTokenizerFast
tokenizer = LlamaTokenizerFast.from_pretrained("dinhanhx/llama-tokenizer-hf")
text = "Do bạch kim rất quý nên sẽ dùng để lắp vô xương"
print(tokenizer.convert_ids_to_tokens(tokenizer.encode(text)))
```