infCapital
/

viet-llama2-ft

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

hungeni commited on Sep 28, 2023

Commit

3608f10

•

1 Parent(s): c3e3e7f

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -1,7 +1,11 @@
 ---
 datasets:
-- QingyiSi/Alpaca-CoT
 - tatsu-lab/alpaca
 - GAIR/lima
 language:
 - vi
@@ -9,6 +13,6 @@ language:
 + LLaMa2 - 7B Chat models, extend vocab size to 44800 for Vietnamese understanding.
 + Continual Pre-Train with 2B Vietnames Tokens aligned from VnNews Corpus, 10K vnthuquan books, wikipedia_vi
-+ Fine-Tuning with vietllama2-tiny dataset, the combination of [Alpaca, CoT, LIMA, daily chat] then translated into Vietnamese using OpenAI GPT-3
 + For more information: email me at duyhunghd6@gmail.com | http://fb.com/hungbui2013

 ---
 datasets:
 - tatsu-lab/alpaca
+- ewof/alpaca-instruct-unfiltered
+- databricks/databricks-dolly-15k
+- teknium/GPTeacher-General-Instruct
+- garage-bAInd/Open-Platypus
+- Honkware/oasst1-alpaca-json
 - GAIR/lima
 language:
 - vi
 + LLaMa2 - 7B Chat models, extend vocab size to 44800 for Vietnamese understanding.
 + Continual Pre-Train with 2B Vietnames Tokens aligned from VnNews Corpus, 10K vnthuquan books, wikipedia_vi
++ Fine-Tuning with vietllama2-tiny dataset, the combination of vaious dataset then translated into Vietnamese using OpenAI GPT-3
 + For more information: email me at duyhunghd6@gmail.com | http://fb.com/hungbui2013