ellen625 commited on
Commit
cc55274
1 Parent(s): 46d340f

End of training

Browse files
Files changed (3) hide show
  1. README.md +5 -3
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -1,8 +1,8 @@
1
  ---
2
  license: other
 
3
  tags:
4
  - generated_from_trainer
5
- base_model: facebook/opt-125m
6
  model-index:
7
  - name: opt125_wiki_rlo_k3
8
  results: []
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [facebook/opt-125m](https://huggingface.co/facebook/opt-125m) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 2.2410
19
 
20
  ## Model description
21
 
@@ -43,7 +43,7 @@ The following hyperparameters were used during training:
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
  - lr_scheduler_warmup_steps: 500
46
- - num_epochs: 1
47
  - mixed_precision_training: Native AMP
48
 
49
  ### Training results
@@ -51,6 +51,8 @@ The following hyperparameters were used during training:
51
  | Training Loss | Epoch | Step | Validation Loss |
52
  |:-------------:|:------:|:----:|:---------------:|
53
  | 2.3927 | 0.8340 | 500 | 2.2552 |
 
 
54
 
55
 
56
  ### Framework versions
 
1
  ---
2
  license: other
3
+ base_model: facebook/opt-125m
4
  tags:
5
  - generated_from_trainer
 
6
  model-index:
7
  - name: opt125_wiki_rlo_k3
8
  results: []
 
15
 
16
  This model is a fine-tuned version of [facebook/opt-125m](https://huggingface.co/facebook/opt-125m) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 2.1968
19
 
20
  ## Model description
21
 
 
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
  - lr_scheduler_warmup_steps: 500
46
+ - num_epochs: 3
47
  - mixed_precision_training: Native AMP
48
 
49
  ### Training results
 
51
  | Training Loss | Epoch | Step | Validation Loss |
52
  |:-------------:|:------:|:----:|:---------------:|
53
  | 2.3927 | 0.8340 | 500 | 2.2552 |
54
+ | 2.2887 | 1.6681 | 1000 | 2.2072 |
55
+ | 2.2463 | 2.5021 | 1500 | 2.1982 |
56
 
57
 
58
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3a1f62750a9f311d2657c2125cc7136a4f08f31c0214f216d519cff7e7d2f600
3
  size 500979600
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fed5f395f9ba22e663e0156e6ce611cf75c6fb820d622d1621ce8efed2d7d147
3
  size 500979600
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9da493e0c0b83bbc1be80c1017da11c34ec3102e0d2dfe077db8ff9c2dec7c22
3
  size 4920
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eb410edaa3a781f9795f39dbd9195342bcd6f348c1e2dab3ab2b8d9173abf47b
3
  size 4920