beomi
/

llama-2-ko-7b

Text Generation

text-generation-inference

Model card Files Files and versions Community

beomi commited on Jul 24, 2023

Commit

9f1bbde

·

1 Parent(s): 6f9a966

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -14,6 +14,8 @@ tags:
 - llama-2-ko
 ---
 # **Llama-2-Ko** 🦙🇰🇷
 Llama-2-Ko serves as an advanced iteration of Llama 2, benefiting from an expanded vocabulary and the inclusion of a Korean corpus in its further pretraining. Just like its predecessor, Llama-2-Ko operates within the broad range of generative text models that stretch from 7 billion to 70 billion parameters. This repository focuses on the 7B pretrained version, which is tailored to fit the Hugging Face Transformers format. For access to the other models, feel free to consult the index provided below.
@@ -34,7 +36,8 @@ Llama-2-Ko is an auto-regressive language model that uses an optimized transform
 ||Training Data|Params|Content Length|GQA|Tokens|LR|
 |---|---|---|---|---|---|---|
-|Llama 2|*A new mix of Korean online data*|7B|4k|&#10007;|>40B|1e<sup>-5</sup>|
 **Vocab Expansion**

 - llama-2-ko
 ---
+> 🚧 Note: this repo is under construction 🚧
 # **Llama-2-Ko** 🦙🇰🇷
 Llama-2-Ko serves as an advanced iteration of Llama 2, benefiting from an expanded vocabulary and the inclusion of a Korean corpus in its further pretraining. Just like its predecessor, Llama-2-Ko operates within the broad range of generative text models that stretch from 7 billion to 70 billion parameters. This repository focuses on the 7B pretrained version, which is tailored to fit the Hugging Face Transformers format. For access to the other models, feel free to consult the index provided below.
 ||Training Data|Params|Content Length|GQA|Tokens|LR|
 |---|---|---|---|---|---|---|
+|Llama 2|*A new mix of Korean online data*|7B|4k|&#10007;|>40B*|1e<sup>-5</sup>|
+*Plan to train upto 200B tokens
 **Vocab Expansion**