File size: 502 Bytes
8257f7c 4535863 3266fa7 8257f7c e0973a0 8257f7c 2491e22 8257f7c 69ab5b7 8257f7c 89bce89 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 |
---
license: apache-2.0
datasets:
- vietgpt/wikipedia_vi
- oscar-corpus/OSCAR-2301
language:
- vi
- en
pipeline_tag: text-generation
---
# Concept of open-llama-7b-vi
This is a OpenLLama model finetuned on texts in the Vietnamese language.
## Model architecture
The model architecture is the same as the original OpenLLama model
## Training Data
The models are trained on the Vietnamese version of Wikipedia.
The generated corpus files are 1.5GB in total, containing approximately 1.3M sentences. |