HuggingFaceTB
/

finemath-ablation-fwedu

Model card Files Files and versions Community

loubnabnl HF staff commited on 8 days ago

Commit

f844eda

•

1 Parent(s): 01b4bf7

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ base_model:
 ## Model summary
 This model is part of the 📐 [FineMath](https://huggingface.co/datasets/HuggingFaceTB/finemath) ablations, we continue pretraining [Llama-3.2-3B](https://huggingface.co/meta-llama/Llama-3.2-3B) base on different math datasets for 60B tokens.
-The model has 3.21B parameters and 4096 context length. It was trained on 60B tokens from [FineWeb-Edu](https://huggingface.co/datasets/HuggingFaceFW/fineweb-edu), tokenized using `llama3` tokenizer.
 - **License**: Apache-2
 - **Languages**: English

 ## Model summary
 This model is part of the 📐 [FineMath](https://huggingface.co/datasets/HuggingFaceTB/finemath) ablations, we continue pretraining [Llama-3.2-3B](https://huggingface.co/meta-llama/Llama-3.2-3B) base on different math datasets for 60B tokens.
+The model has 3.21B parameters and 4096 context length. It was trained on **60B tokens** from [FineWeb-Edu](https://huggingface.co/datasets/HuggingFaceFW/fineweb-edu), tokenized using `llama3` tokenizer.
 - **License**: Apache-2
 - **Languages**: English