YangZhoumill commited on
Commit
4fb19df
·
verified ·
1 Parent(s): b9b59cc

End of training

Browse files
Files changed (2) hide show
  1. README.md +5 -3
  2. training_args.bin +1 -1
README.md CHANGED
@@ -1,17 +1,19 @@
1
  ---
2
  base_model: Qwen/Qwen2.5-0.5B-Instruct
 
3
  library_name: transformers
4
- model_name: Qwen2.5-0.5B-Instruct
5
  tags:
6
  - generated_from_trainer
 
7
  - trl
8
  - sft
9
  licence: license
10
  ---
11
 
12
- # Model Card for Qwen2.5-0.5B-Instruct
13
 
14
- This model is a fine-tuned version of [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct).
15
  It has been trained using [TRL](https://github.com/huggingface/trl).
16
 
17
  ## Quick start
 
1
  ---
2
  base_model: Qwen/Qwen2.5-0.5B-Instruct
3
+ datasets: YangZhoumill/bestofn
4
  library_name: transformers
5
+ model_name: Qwen2.5-0.5B-Instruct-4230573
6
  tags:
7
  - generated_from_trainer
8
+ - open-r1
9
  - trl
10
  - sft
11
  licence: license
12
  ---
13
 
14
+ # Model Card for Qwen2.5-0.5B-Instruct-4230573
15
 
16
+ This model is a fine-tuned version of [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct) on the [YangZhoumill/bestofn](https://huggingface.co/datasets/YangZhoumill/bestofn) dataset.
17
  It has been trained using [TRL](https://github.com/huggingface/trl).
18
 
19
  ## Quick start
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:09263e9fb122adda888ae642193e378ea2eff11e0fa1b379f8df1d8f6aeb18a3
3
  size 6136
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b1dc49088efffeadf50faa5e6ee5854a2fc275419c798d59c7babce4b8fb3e65
3
  size 6136