gokulsrinivasagan
/

distilbert_lda_50_v1_sst2

@@ -1,28 +1,13 @@
 ---
 library_name: transformers
-language:
-- en
 base_model: gokulsrinivasagan/distilbert_lda_50_v1
 tags:
 - generated_from_trainer
-datasets:
-- glue
 metrics:
 - accuracy
 model-index:
 - name: distilbert_lda_50_v1_sst2
-  results:
-  - task:
-      name: Text Classification
-      type: text-classification
-    dataset:
-      name: GLUE SST2
-      type: glue
-      args: sst2
-    metrics:
-    - name: Accuracy
-      type: accuracy
-      value: 0.5091743119266054
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -30,10 +15,10 @@ should probably proofread and complete it, then remove this comment. -->
 # distilbert_lda_50_v1_sst2
-This model is a fine-tuned version of [gokulsrinivasagan/distilbert_lda_50_v1](https://huggingface.co/gokulsrinivasagan/distilbert_lda_50_v1) on the GLUE SST2 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6954
-- Accuracy: 0.5092
 ## Model description
@@ -52,7 +37,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.001
 - train_batch_size: 256
 - eval_batch_size: 256
 - seed: 10
@@ -64,15 +49,12 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 0.7089        | 1.0   | 264  | 0.6971          | 0.5092   |
-| 0.6871        | 2.0   | 528  | 0.6957          | 0.5092   |
-| 0.6867        | 3.0   | 792  | 0.6989          | 0.5092   |
-| 0.6868        | 4.0   | 1056 | 0.6954          | 0.5092   |
-| 0.6867        | 5.0   | 1320 | 0.6973          | 0.5092   |
-| 0.6866        | 6.0   | 1584 | 0.6989          | 0.5092   |
-| 0.6862        | 7.0   | 1848 | 0.6970          | 0.5092   |
-| 0.6866        | 8.0   | 2112 | 0.6971          | 0.5092   |
-| 0.6867        | 9.0   | 2376 | 0.6970          | 0.5092   |
 ### Framework versions

 ---
 library_name: transformers
 base_model: gokulsrinivasagan/distilbert_lda_50_v1
 tags:
 - generated_from_trainer
 metrics:
 - accuracy
 model-index:
 - name: distilbert_lda_50_v1_sst2
+  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # distilbert_lda_50_v1_sst2
+This model is a fine-tuned version of [gokulsrinivasagan/distilbert_lda_50_v1](https://huggingface.co/gokulsrinivasagan/distilbert_lda_50_v1) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5851
+- Accuracy: 0.8177
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 5e-05
 - train_batch_size: 256
 - eval_batch_size: 256
 - seed: 10
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 0.3785        | 1.0   | 264  | 0.4006          | 0.8291   |
+| 0.2139        | 2.0   | 528  | 0.4261          | 0.8406   |
+| 0.1523        | 3.0   | 792  | 0.4886          | 0.8154   |
+| 0.1085        | 4.0   | 1056 | 0.5392          | 0.8268   |
+| 0.0809        | 5.0   | 1320 | 0.5836          | 0.8303   |
+| 0.0646        | 6.0   | 1584 | 0.5851          | 0.8177   |
 ### Framework versions

logs/events.out.tfevents.1733313019.ki-g0008.1206436.10 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1b78dd56fb89648a435a679c6e7fd4132e5195cc1e625039ec06613d17d16c13
-size 7753

 version https://git-lfs.github.com/spec/v1
+oid sha256:50dac5ef42aa9fae719056eef4e4ee67d6f7d8fced02679f773d5fc06e178bee
+size 8641

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:46d56be59d8a39217978180cafbcbeb0c43d01fa6031699ae3b787c920d7bb67
 size 267832560

 version https://git-lfs.github.com/spec/v1
+oid sha256:564600e209fc3bd63b3333e9816869ea36ec40e22f51b7b00712e0cb8bf857cc
 size 267832560