llm-jp
/

llm-jp-13b-v1.0-mdsfmt

Text Generation

Model card Files Files and versions Community

losyer8 commited on Oct 19, 2023

Commit

a6156ac

•

1 Parent(s): 90dd7af

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -74,7 +74,7 @@ Please refer [README.md of llm-ja-tokenizer](https://github.com/llm-jp/llm-jp-to
 ## Datasets
-- **Pre-training:**
 The models have been pre-trained on approximately 287.5B tokens, sourced from a blend of the following datasets.
@@ -88,7 +88,7 @@ The models have been pre-trained on approximately 287.5B tokens, sourced from a
 Pretraining was done by 10-hold shards that consists approx. 27-28B tokens. We further finalized the pretraining with additional cleaned 27B tokens data.
-- **Instruction tuning:**
 The models have been fine-tuned on the following datasets.

 ## Datasets
+### Pre-training
 The models have been pre-trained on approximately 287.5B tokens, sourced from a blend of the following datasets.
 Pretraining was done by 10-hold shards that consists approx. 27-28B tokens. We further finalized the pretraining with additional cleaned 27B tokens data.
+### Instruction tuning
 The models have been fine-tuned on the following datasets.