varun-v-rao
/

gpt2-large-lora-2.95M-snli-model1

Text Classification

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

varun-v-rao commited on about 1 month ago

Commit

e33f1ab

•

1 Parent(s): 766d597

End of training

Files changed (2) hide show

README.md +7 -7
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: 0.8795976427555375
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -29,8 +29,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai-community/gpt2-large](https://huggingface.co/openai-community/gpt2-large) on the snli dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3222
-- Accuracy: 0.8796
 ## Model description
@@ -52,7 +52,7 @@ The following hyperparameters were used during training:
 - learning_rate: 2e-05
 - train_batch_size: 128
 - eval_batch_size: 128
-- seed: 66
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 3
@@ -61,9 +61,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step  | Validation Loss | Accuracy |
 |:-------------:|:-----:|:-----:|:---------------:|:--------:|
-| 0.442         | 1.0   | 4292  | 0.3550          | 0.8650   |
-| 0.3973        | 2.0   | 8584  | 0.3296          | 0.8755   |
-| 0.3926        | 3.0   | 12876 | 0.3222          | 0.8796   |
 ### Framework versions

     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.8768542979069295
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [openai-community/gpt2-large](https://huggingface.co/openai-community/gpt2-large) on the snli dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3263
+- Accuracy: 0.8769
 ## Model description
 - learning_rate: 2e-05
 - train_batch_size: 128
 - eval_batch_size: 128
+- seed: 26
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 3
 | Training Loss | Epoch | Step  | Validation Loss | Accuracy |
 |:-------------:|:-----:|:-----:|:---------------:|:--------:|
+| 0.4349        | 1.0   | 4292  | 0.3539          | 0.8661   |
+| 0.4061        | 2.0   | 8584  | 0.3339          | 0.8745   |
+| 0.3941        | 3.0   | 12876 | 0.3263          | 0.8769   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d962bec3eaa44850606994b13bd3c5eb77467a34ffc2415ab4b5e5b8f4b43e1b
 size 3096181368

 version https://git-lfs.github.com/spec/v1
+oid sha256:915a342950caa961d4fe61bb065d4d8536584c7b3240c73c607974c27937e613
 size 3096181368