Update README.md
Browse files
README.md
CHANGED
@@ -33,7 +33,7 @@ The continued pre-training data for Gemma2 9B CPT SEA-LIONv3 base model encompas
|
|
33 |
- **Languages:** English, Chinese, Vietnamese, Indonesian, Thai, Tagalog, Tamil, Malay, Khmer, Lao, Burmese
|
34 |
- **License:** [Gemma Community License](https://ai.google.dev/gemma/terms)
|
35 |
|
36 |
-
For
|
37 |
|
38 |
### Benchmark Performance
|
39 |
We evaluated Gemma2 9B CPT SEA-LIONv3 base model on general language capabilities.
|
|
|
33 |
- **Languages:** English, Chinese, Vietnamese, Indonesian, Thai, Tagalog, Tamil, Malay, Khmer, Lao, Burmese
|
34 |
- **License:** [Gemma Community License](https://ai.google.dev/gemma/terms)
|
35 |
|
36 |
+
For tokenisation, the model employs the default tokenizer used in Gemma-2-9B.
|
37 |
|
38 |
### Benchmark Performance
|
39 |
We evaluated Gemma2 9B CPT SEA-LIONv3 base model on general language capabilities.
|