HuggingFaceTB
/

SmolLM-135M-Instruct

Text Generation

alignment-handbook

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

loubnabnl HF staff commited on Aug 17, 2024

Commit

6e6cf67

•

1 Parent(s): 0a3fbf4

fix typo

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -80,7 +80,7 @@ trl chat --model_name_or_path HuggingFaceTB/SmolLM-135M-Instruct --device cpu
 Additionally, the generated content may not always be factually accurate, logically consistent, or free from biases present in the training data, we invite users to leverage them as assistive tools rather than definitive sources of information. We find that they can handle general knowledge questions, creative writing and basic Python programming. But they are English only and may have difficulty with arithmetics, editing tasks and complex reasoning. For more details about the models' capabilities, please refer to our [blog post](https://huggingface.co/blog/smollm).
 ## Training parameters
-We train the models using the [alignement-handbook](https://github.com/huggingface/alignment-handbook) with the datasets mentioned in the changelog, using these parameters for v0.2:
 - 1 epoch
 - lr 1e-3
 - cosine schedule

 Additionally, the generated content may not always be factually accurate, logically consistent, or free from biases present in the training data, we invite users to leverage them as assistive tools rather than definitive sources of information. We find that they can handle general knowledge questions, creative writing and basic Python programming. But they are English only and may have difficulty with arithmetics, editing tasks and complex reasoning. For more details about the models' capabilities, please refer to our [blog post](https://huggingface.co/blog/smollm).
 ## Training parameters
+We train the models using the [alignment-handbook](https://github.com/huggingface/alignment-handbook) with the datasets mentioned in the changelog, using these parameters for v0.2:
 - 1 epoch
 - lr 1e-3
 - cosine schedule