Update README.md
Browse files
README.md
CHANGED
@@ -89,7 +89,7 @@ Keeping this in mind:
|
|
89 |
|
90 |
## Results
|
91 |
|
92 |
-
I trained StableLM-3B-4e1t repeatedly on [https://huggingface.co/datasets/euclaise/TinyCoT
|
93 |
|
94 |
I trained once with ReMask/ReMask-CoT, once without regularization to match Masked Thought (w/ partial label-masking for CoT), and once with SFT.
|
95 |
|
|
|
89 |
|
90 |
## Results
|
91 |
|
92 |
+
I trained StableLM-3B-4e1t repeatedly on [TinyCoT](https://huggingface.co/datasets/euclaise/TinyCoT), along with 1000 examples from [reddit-instruct-curated](https://huggingface.co/datasets/euclaise/reddit-instruct-curated) and 1000 examples from [oasst2-curated](https://huggingface.co/datasets/sablo/oasst2_curated).
|
93 |
|
94 |
I trained once with ReMask/ReMask-CoT, once without regularization to match Masked Thought (w/ partial label-masking for CoT), and once with SFT.
|
95 |
|