fix typo
Browse files
README.md
CHANGED
@@ -80,7 +80,7 @@ trl chat --model_name_or_path HuggingFaceTB/SmolLM-135M-Instruct --device cpu
|
|
80 |
Additionally, the generated content may not always be factually accurate, logically consistent, or free from biases present in the training data, we invite users to leverage them as assistive tools rather than definitive sources of information. We find that they can handle general knowledge questions, creative writing and basic Python programming. But they are English only and may have difficulty with arithmetics, editing tasks and complex reasoning. For more details about the models' capabilities, please refer to our [blog post](https://huggingface.co/blog/smollm).
|
81 |
|
82 |
## Training parameters
|
83 |
-
We train the models using the [
|
84 |
- 1 epoch
|
85 |
- lr 1e-3
|
86 |
- cosine schedule
|
|
|
80 |
Additionally, the generated content may not always be factually accurate, logically consistent, or free from biases present in the training data, we invite users to leverage them as assistive tools rather than definitive sources of information. We find that they can handle general knowledge questions, creative writing and basic Python programming. But they are English only and may have difficulty with arithmetics, editing tasks and complex reasoning. For more details about the models' capabilities, please refer to our [blog post](https://huggingface.co/blog/smollm).
|
81 |
|
82 |
## Training parameters
|
83 |
+
We train the models using the [alignment-handbook](https://github.com/huggingface/alignment-handbook) with the datasets mentioned in the changelog, using these parameters for v0.2:
|
84 |
- 1 epoch
|
85 |
- lr 1e-3
|
86 |
- cosine schedule
|