ren-lee commited on
Commit
08716ed
·
verified ·
1 Parent(s): e2dd082

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -4
README.md CHANGED
@@ -23,10 +23,7 @@ pipeline_tag: text-generation
23
 
24
  SEA-LION ([https://sea-lion.ai/](https://sea-lion.ai/)) is a collection of Large Language Models (LLMs) which has been pretrained and instruct-tuned for the Southeast Asia (SEA) region.
25
 
26
- Llama3.1 8B CPT SEA-Lionv3 Instruct is a multilingual model which has been fine-tuned with around **9,000,000 English instruction-completion pairs** alongside a smaller pool of around **6,000,000 instruction-completion pairs** from other ASEAN languages, such as Indonesian, Thai and Vietnamese.
27
- These instructions have been carefully curated and rewritten to ensure the model was trained on truly open, commercially permissive and high quality datasets.
28
-
29
- Llama3.1 8B CPT SEA-Lionv3 Instruct has undergone additional supervised fine-tuning and alignment compared to the now deprecated Llama3 8B CPT SEA-Lionv2.1 Instruct. These improvements have increased the model's capabilities in chat interactions and its ability to follow instructions accurately.
30
 
31
  SEA-LION stands for _Southeast Asian Languages In One Network_.
32
 
 
23
 
24
  SEA-LION ([https://sea-lion.ai/](https://sea-lion.ai/)) is a collection of Large Language Models (LLMs) which has been pretrained and instruct-tuned for the Southeast Asia (SEA) region.
25
 
26
+ Llama3.1 8B CPT SEA-LIONv3 Instruct is a multilingual model that has been fine-tuned in two stages on approximately **12.3M English instruction-completion pairs** alongside a pool of **4.5M Southeast Asian instruction-completion pairs** from ASEAN languages, such as Indonesian, Thai, Vietnamese and Tamil.
 
 
 
27
 
28
  SEA-LION stands for _Southeast Asian Languages In One Network_.
29