alpindale
/

pygm-350m-experimental

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

alpindale commited on Feb 25, 2023

Commit

014d821

•

1 Parent(s): 4d3ba2e

Update README.md

Files changed (1) hide show

README.md +8 -17

README.md CHANGED Viewed

@@ -4,33 +4,28 @@ tags:
 metrics:
 - accuracy
 model-index:
-- name: pygmalion-training
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# pygmalion-training
-This model is a fine-tuned version of [/notebooks/pygmalion/pygmalion-350m/](https://huggingface.co//notebooks/pygmalion/pygmalion-350m/) on an unknown dataset.
 It achieves the following results on the evaluation set:
 - Loss: 2.2731
 - Accuracy: 0.5187
 ## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
@@ -43,10 +38,6 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: linear
 - num_epochs: 3.0
-### Training results
 ### Framework versions
 - Transformers 4.27.0.dev0

 metrics:
 - accuracy
 model-index:
+- name: pygmalion-350m
   results: []
 ---
+# pygmalion-350m
+This model is a fine-tuned version of [PygmalionAI/pygmalion-350m](https://huggingface.co/PygmalionAI/pygmalion-350m/) on a 2.4MB dataset.
 It achieves the following results on the evaluation set:
 - Loss: 2.2731
 - Accuracy: 0.5187
 ## Model description
+A proof-of-concept model based on PygmalionAI/pygmalion-350m, which was in turn based on OPT-350m.
+This model was fine-tuned purely for testing purposes.
+## Fine-tuning process
+Fine-tuned on an A100-80GB with HF's `run_clm.py` script. It was run through 3 epochs with 8 batch size using 2.4MB dataset (split 75/25 between training and validation sets).
+## Training and evaluation data
 ### Training hyperparameters
 - lr_scheduler_type: linear
 - num_epochs: 3.0
 ### Framework versions
 - Transformers 4.27.0.dev0