pkupie
/

Llama-2-7b-mon

Model card Files Files and versions Community

KobayashiKanna01 commited on 13 days ago

Commit

30971b3

•

1 Parent(s): 9f37d2b

Update README.md

Files changed (1) hide show

README.md +22 -3

README.md CHANGED Viewed

@@ -1,3 +1,22 @@
----
-license: llama2
----

+---
+license: llama2
+datasets:
+- pkupie/mc2_corpus
+- togethercomputer/RedPajama-Data-1T
+language:
+- en
+- mn
+base_model:
+- meta-llama/Llama-2-7b-hf
+---
+A continually pre-trained model based on Llama-2-7b-hf.
+We use the **Traditional Mongolian texts** in MC^2 and **English texts** in RedPajama with a proportion of **4:1** for training.
+#### Hyper-parameters:
+ * lr: 3e-5
+ * batch size: 1M (2K*512)
+ * lr scheduler: cosine
+ * min lr: 1e-6
+ * lr decay iters: 10240