gnumanth
/

dadjokes-tuned-opt

Text Generation

gnumanth/dadjokes-trained-opt

Inference Endpoints

text-generation-inference

Model card Files Files and versions Metrics Training metrics Community

gnumanth commited on Feb 24

Commit

2dcf7da

•

1 Parent(s): 547c0a8

chore: more info

Files changed (1) hide show

README.md +14 -17

README.md CHANGED Viewed

@@ -1,35 +1,30 @@
 ---
-license: other
 base_model: facebook/opt-350m
 tags:
 - trl
 - sft
-- generated_from_trainer
 model-index:
 - name: tmp_trainer
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# tmp_trainer
-This model is a fine-tuned version of [facebook/opt-350m](https://huggingface.co/facebook/opt-350m) on an unknown dataset.
 ## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
@@ -44,11 +39,13 @@ The following hyperparameters were used during training:
 ### Training results
 ### Framework versions
 - Transformers 4.38.1
 - Pytorch 2.1.0+cu121
 - Datasets 2.17.1
-- Tokenizers 0.15.1

 ---
+license: mit
 base_model: facebook/opt-350m
 tags:
 - trl
 - sft
+- gnumanth/dadjokes-trained-otp
 model-index:
 - name: tmp_trainer
   results: []
+datasets:
+- gnumanth/dad-jokes
+language:
+- en
+pipeline_tag: text-generation
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+#
+This model is a fine-tuned version of [facebook/opt-350m](https://huggingface.co/facebook/opt-350m) on an [gnumanth/dad-jokes](https://huggingface.co/datasets/gnumanth/dad-jokes)dataset.
 ## Model description
+SFT Trained simple model for fun!
 ### Training hyperparameters
 ### Training results
+```
+TrainOutput(global_step=18, training_loss=2.2378472222222223, metrics={'train_runtime': 149.7511, 'train_samples_per_second': 0.881, 'train_steps_per_second': 0.12, 'total_flos': 9828797644800.0, 'train_loss': 2.2378472222222223, 'epoch': 3.0})
+```
 ### Framework versions
 - Transformers 4.38.1
 - Pytorch 2.1.0+cu121
 - Datasets 2.17.1
+- Tokenizers 0.15.1