YangZhoumill
/

Qwen2.5-0.5B-Instruct

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

YangZhoumill commited on Apr 24

Commit

4fb19df

·

verified ·

1 Parent(s): b9b59cc

End of training

Files changed (2) hide show

README.md +5 -3
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,17 +1,19 @@
 ---
 base_model: Qwen/Qwen2.5-0.5B-Instruct
 library_name: transformers
-model_name: Qwen2.5-0.5B-Instruct
 tags:
 - generated_from_trainer
 - trl
 - sft
 licence: license
 ---
-# Model Card for Qwen2.5-0.5B-Instruct
-This model is a fine-tuned version of [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct).
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start

 ---
 base_model: Qwen/Qwen2.5-0.5B-Instruct
+datasets: YangZhoumill/bestofn
 library_name: transformers
+model_name: Qwen2.5-0.5B-Instruct-4230573
 tags:
 - generated_from_trainer
+- open-r1
 - trl
 - sft
 licence: license
 ---
+# Model Card for Qwen2.5-0.5B-Instruct-4230573
+This model is a fine-tuned version of [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) on the [YangZhoumill/bestofn](https://huggingface.co/datasets/YangZhoumill/bestofn) dataset.
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:09263e9fb122adda888ae642193e378ea2eff11e0fa1b379f8df1d8f6aeb18a3
 size 6136

 version https://git-lfs.github.com/spec/v1
+oid sha256:b1dc49088efffeadf50faa5e6ee5854a2fc275419c798d59c7babce4b8fb3e65
 size 6136