ellen625
/

opt125_wiki_rlo_k3

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ellen625 commited on May 21

Commit

cc55274

•

1 Parent(s): 46d340f

End of training

Files changed (3) hide show

README.md +5 -3
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,8 +1,8 @@
 ---
 license: other
 tags:
 - generated_from_trainer
-base_model: facebook/opt-125m
 model-index:
 - name: opt125_wiki_rlo_k3
   results: []
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/opt-125m](https://huggingface.co/facebook/opt-125m) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.2410
 ## Model description
@@ -43,7 +43,7 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- num_epochs: 1
 - mixed_precision_training: Native AMP
 ### Training results
@@ -51,6 +51,8 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 2.3927        | 0.8340 | 500  | 2.2552          |
 ### Framework versions

 ---
 license: other
+base_model: facebook/opt-125m
 tags:
 - generated_from_trainer
 model-index:
 - name: opt125_wiki_rlo_k3
   results: []
 This model is a fine-tuned version of [facebook/opt-125m](https://huggingface.co/facebook/opt-125m) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.1968
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 2.3927        | 0.8340 | 500  | 2.2552          |
+| 2.2887        | 1.6681 | 1000 | 2.2072          |
+| 2.2463        | 2.5021 | 1500 | 2.1982          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3a1f62750a9f311d2657c2125cc7136a4f08f31c0214f216d519cff7e7d2f600
 size 500979600

 version https://git-lfs.github.com/spec/v1
+oid sha256:fed5f395f9ba22e663e0156e6ce611cf75c6fb820d622d1621ce8efed2d7d147
 size 500979600

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9da493e0c0b83bbc1be80c1017da11c34ec3102e0d2dfe077db8ff9c2dec7c22
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:eb410edaa3a781f9795f39dbd9195342bcd6f348c1e2dab3ab2b8d9173abf47b
 size 4920