File size: 502 Bytes
8257f7c
 
 
4535863
3266fa7
8257f7c
 
 
e0973a0
8257f7c
 
2491e22
8257f7c
 
 
 
 
69ab5b7
8257f7c
 
 
 
89bce89
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
---
license: apache-2.0
datasets:
- vietgpt/wikipedia_vi
- oscar-corpus/OSCAR-2301
language:
- vi
- en
pipeline_tag: text-generation
---

# Concept of open-llama-7b-vi

This is a OpenLLama model finetuned on texts in the Vietnamese language.

## Model architecture

The model architecture is the same as the original OpenLLama model

## Training Data

The models are trained on the Vietnamese version of Wikipedia.
The generated corpus files are 1.5GB in total, containing approximately 1.3M sentences.