gokulsrinivasagan commited on
Commit
6a0b16a
·
verified ·
1 Parent(s): 35ecfa0

Model save

Browse files
README.md CHANGED
@@ -1,32 +1,14 @@
1
  ---
2
  library_name: transformers
3
- language:
4
- - en
5
  base_model: gokulsrinivasagan/bert_tiny_lda_20_v1
6
  tags:
7
  - generated_from_trainer
8
- datasets:
9
- - glue
10
  metrics:
11
  - accuracy
12
  - f1
13
  model-index:
14
  - name: bert_tiny_lda_20_v1_mrpc
15
- results:
16
- - task:
17
- name: Text Classification
18
- type: text-classification
19
- dataset:
20
- name: GLUE MRPC
21
- type: glue
22
- args: mrpc
23
- metrics:
24
- - name: Accuracy
25
- type: accuracy
26
- value: 0.6838235294117647
27
- - name: F1
28
- type: f1
29
- value: 0.8122270742358079
30
  ---
31
 
32
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -34,12 +16,12 @@ should probably proofread and complete it, then remove this comment. -->
34
 
35
  # bert_tiny_lda_20_v1_mrpc
36
 
37
- This model is a fine-tuned version of [gokulsrinivasagan/bert_tiny_lda_20_v1](https://huggingface.co/gokulsrinivasagan/bert_tiny_lda_20_v1) on the GLUE MRPC dataset.
38
  It achieves the following results on the evaluation set:
39
- - Loss: 0.6233
40
- - Accuracy: 0.6838
41
- - F1: 0.8122
42
- - Combined Score: 0.7480
43
 
44
  ## Model description
45
 
@@ -58,7 +40,7 @@ More information needed
58
  ### Training hyperparameters
59
 
60
  The following hyperparameters were used during training:
61
- - learning_rate: 0.001
62
  - train_batch_size: 256
63
  - eval_batch_size: 256
64
  - seed: 10
@@ -70,17 +52,14 @@ The following hyperparameters were used during training:
70
 
71
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Combined Score |
72
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:--------------:|
73
- | 0.7043 | 1.0 | 15 | 0.6264 | 0.6838 | 0.8122 | 0.7480 |
74
- | 0.6367 | 2.0 | 30 | 0.6249 | 0.6838 | 0.8122 | 0.7480 |
75
- | 0.6299 | 3.0 | 45 | 0.6277 | 0.6838 | 0.8122 | 0.7480 |
76
- | 0.6347 | 4.0 | 60 | 0.6265 | 0.6838 | 0.8122 | 0.7480 |
77
- | 0.633 | 5.0 | 75 | 0.6261 | 0.6838 | 0.8122 | 0.7480 |
78
- | 0.6335 | 6.0 | 90 | 0.6233 | 0.6838 | 0.8122 | 0.7480 |
79
- | 0.6321 | 7.0 | 105 | 0.6244 | 0.6838 | 0.8122 | 0.7480 |
80
- | 0.6341 | 8.0 | 120 | 0.6249 | 0.6838 | 0.8122 | 0.7480 |
81
- | 0.6283 | 9.0 | 135 | 0.6298 | 0.6838 | 0.8122 | 0.7480 |
82
- | 0.6357 | 10.0 | 150 | 0.6238 | 0.6838 | 0.8122 | 0.7480 |
83
- | 0.635 | 11.0 | 165 | 0.6249 | 0.6838 | 0.8122 | 0.7480 |
84
 
85
 
86
  ### Framework versions
 
1
  ---
2
  library_name: transformers
 
 
3
  base_model: gokulsrinivasagan/bert_tiny_lda_20_v1
4
  tags:
5
  - generated_from_trainer
 
 
6
  metrics:
7
  - accuracy
8
  - f1
9
  model-index:
10
  - name: bert_tiny_lda_20_v1_mrpc
11
+ results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
  ---
13
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
16
 
17
  # bert_tiny_lda_20_v1_mrpc
18
 
19
+ This model is a fine-tuned version of [gokulsrinivasagan/bert_tiny_lda_20_v1](https://huggingface.co/gokulsrinivasagan/bert_tiny_lda_20_v1) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.9865
22
+ - Accuracy: 0.6225
23
+ - F1: 0.7004
24
+ - Combined Score: 0.6615
25
 
26
  ## Model description
27
 
 
40
  ### Training hyperparameters
41
 
42
  The following hyperparameters were used during training:
43
+ - learning_rate: 5e-05
44
  - train_batch_size: 256
45
  - eval_batch_size: 256
46
  - seed: 10
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Combined Score |
54
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:--------------:|
55
+ | 0.6315 | 1.0 | 15 | 0.6004 | 0.6863 | 0.8123 | 0.7493 |
56
+ | 0.6013 | 2.0 | 30 | 0.5958 | 0.6887 | 0.8037 | 0.7462 |
57
+ | 0.5707 | 3.0 | 45 | 0.5935 | 0.6985 | 0.8099 | 0.7542 |
58
+ | 0.5415 | 4.0 | 60 | 0.6069 | 0.6985 | 0.8032 | 0.7509 |
59
+ | 0.4866 | 5.0 | 75 | 0.6274 | 0.6789 | 0.7737 | 0.7263 |
60
+ | 0.397 | 6.0 | 90 | 0.7453 | 0.6985 | 0.8006 | 0.7496 |
61
+ | 0.3039 | 7.0 | 105 | 0.8151 | 0.6520 | 0.7418 | 0.6969 |
62
+ | 0.2217 | 8.0 | 120 | 0.9865 | 0.6225 | 0.7004 | 0.6615 |
 
 
 
63
 
64
 
65
  ### Framework versions
logs/events.out.tfevents.1733323231.ki-g0008.1207389.20 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d622b0898d764fc505e3edff7df820f2821f0a88b39ff995287686cd261e8296
3
- size 9467
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2232c66cffc166e9b7162b56c89cbb7baf2f4cdae7a2a396e3b492f08e35d981
3
+ size 10441
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:68a38be3994676b64c43fbd96460ca5464c4be77556838a5240fdaf7a7e3cfb9
3
  size 131856744
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0254df5fadf9c19ad9266a3edb092aa260f5c28c28eb9550dee81b76ffb72861
3
  size 131856744