End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -4,8 +4,7 @@ library_name: transformers
 model_name: SmolLM2-FT-DPO2
 tags:
 - generated_from_trainer
-- smol-course
-- module_1
 - trl
 - dpo
 licence: license
@@ -29,7 +28,7 @@ print(output["generated_text"])
 ## Training procedure
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/thatupiso-code-org/huggingface/runs/qr19ujp2)
 This model was trained with DPO, a method introduced in [Direct Preference Optimization: Your Language Model is Secretly a Reward Model](https://huggingface.co/papers/2305.18290).

 model_name: SmolLM2-FT-DPO2
 tags:
 - generated_from_trainer
+- dpo-smolK12-100
 - trl
 - dpo
 licence: license
 ## Training procedure
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/thatupiso-code-org/huggingface/runs/xpcn3ywm)
 This model was trained with DPO, a method introduced in [Direct Preference Optimization: Your Language Model is Secretly a Reward Model](https://huggingface.co/papers/2305.18290).

runs/Dec12_20-07-50_3393362a4d02/events.out.tfevents.1734034093.3393362a4d02.7235.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:c4fe8439c4d0e4bf0e674877c5e8b1f87e3d1eca8060c9c4efde206930a3d294
+size 13260

runs/Dec12_20-23-09_3393362a4d02/events.out.tfevents.1734035019.3393362a4d02.7235.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:0f9f0e0fe7b9f89ae5720ca6baf57453966f4c8c3158a921f02bb12b8d8aa3bf
+size 27116

runs/Dec12_20-24-40_3393362a4d02/events.out.tfevents.1734035190.3393362a4d02.7235.2 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:853133961a02f0d971d003a31cb5cef46086c39367d37c950dd84c673478a3d0
+size 40308

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a1c3b48e802c46b7298aea7b9c84993251d389226ee3bef09e82528f7370ec07
 size 6072

 version https://git-lfs.github.com/spec/v1
+oid sha256:82c7343ca0657002fe93055a64e96268492c8ea1017027804786a56752e20079
 size 6072