jeongseokoh commited on
Commit
b8cb208
·
verified ·
1 Parent(s): 8c1b16a

End of training

Browse files
Files changed (2) hide show
  1. README.md +19 -19
  2. model.safetensors +1 -1
README.md CHANGED
@@ -1,7 +1,7 @@
1
  ---
2
  library_name: transformers
3
  license: apache-2.0
4
- base_model: allenai/longformer-base-4096
5
  tags:
6
  - generated_from_trainer
7
  metrics:
@@ -10,22 +10,22 @@ metrics:
10
  - recall
11
  - f1
12
  model-index:
13
- - name: longformer_best_model
14
  results: []
15
  ---
16
 
17
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
18
  should probably proofread and complete it, then remove this comment. -->
19
 
20
- # longformer_best_model
21
 
22
- This model is a fine-tuned version of [allenai/longformer-base-4096](https://huggingface.co/allenai/longformer-base-4096) on an unknown dataset.
23
  It achieves the following results on the evaluation set:
24
- - Loss: 0.6012
25
- - Accuracy: 0.8372
26
- - Precision: 0.8443
27
- - Recall: 0.8193
28
- - F1: 0.8316
29
 
30
  ## Model description
31
 
@@ -56,16 +56,16 @@ The following hyperparameters were used during training:
56
 
57
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 |
58
  |:-------------:|:-----:|:------:|:---------------:|:--------:|:---------:|:------:|:------:|
59
- | 0.4507 | 1.0 | 22233 | 0.4954 | 0.7706 | 0.7553 | 0.7875 | 0.7711 |
60
- | 0.4513 | 2.0 | 44466 | 0.4579 | 0.7932 | 0.7783 | 0.8090 | 0.7933 |
61
- | 0.4075 | 3.0 | 66699 | 0.4450 | 0.8066 | 0.7823 | 0.8392 | 0.8097 |
62
- | 0.4149 | 4.0 | 88932 | 0.4322 | 0.8176 | 0.8154 | 0.8120 | 0.8137 |
63
- | 0.2742 | 5.0 | 111165 | 0.4450 | 0.8210 | 0.8265 | 0.8039 | 0.8150 |
64
- | 0.3334 | 6.0 | 133398 | 0.4725 | 0.8262 | 0.8254 | 0.8190 | 0.8222 |
65
- | 0.2323 | 7.0 | 155631 | 0.5031 | 0.8293 | 0.8434 | 0.8008 | 0.8215 |
66
- | 0.2097 | 8.0 | 177864 | 0.5324 | 0.8314 | 0.8378 | 0.8138 | 0.8256 |
67
- | 0.1882 | 9.0 | 200097 | 0.5783 | 0.8357 | 0.8363 | 0.8269 | 0.8316 |
68
- | 0.1289 | 10.0 | 222330 | 0.6012 | 0.8372 | 0.8443 | 0.8193 | 0.8316 |
69
 
70
 
71
  ### Framework versions
 
1
  ---
2
  library_name: transformers
3
  license: apache-2.0
4
+ base_model: jeongseokoh/longformer_best_seq_cls_model
5
  tags:
6
  - generated_from_trainer
7
  metrics:
 
10
  - recall
11
  - f1
12
  model-index:
13
+ - name: longformer_best_seq_cls_model
14
  results: []
15
  ---
16
 
17
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
18
  should probably proofread and complete it, then remove this comment. -->
19
 
20
+ # longformer_best_seq_cls_model
21
 
22
+ This model is a fine-tuned version of [jeongseokoh/longformer_best_seq_cls_model](https://huggingface.co/jeongseokoh/longformer_best_seq_cls_model) on an unknown dataset.
23
  It achieves the following results on the evaluation set:
24
+ - Loss: 0.5130
25
+ - Accuracy: 0.8837
26
+ - Precision: 0.8544
27
+ - Recall: 0.8224
28
+ - F1: 0.8381
29
 
30
  ## Model description
31
 
 
56
 
57
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 |
58
  |:-------------:|:-----:|:------:|:---------------:|:--------:|:---------:|:------:|:------:|
59
+ | 0.3609 | 1.0 | 22233 | 0.4264 | 0.8112 | 0.7526 | 0.7216 | 0.7368 |
60
+ | 0.379 | 2.0 | 44466 | 0.4479 | 0.8195 | 0.7832 | 0.7012 | 0.7399 |
61
+ | 0.3192 | 3.0 | 66699 | 0.4212 | 0.8364 | 0.7524 | 0.8245 | 0.7868 |
62
+ | 0.3119 | 4.0 | 88932 | 0.4435 | 0.8543 | 0.8077 | 0.7902 | 0.7989 |
63
+ | 0.1877 | 5.0 | 111165 | 0.4269 | 0.8627 | 0.8126 | 0.8125 | 0.8125 |
64
+ | 0.2629 | 6.0 | 133398 | 0.4114 | 0.8709 | 0.8410 | 0.7984 | 0.8192 |
65
+ | 0.1674 | 7.0 | 155631 | 0.4552 | 0.8757 | 0.8254 | 0.8378 | 0.8315 |
66
+ | 0.1753 | 8.0 | 177864 | 0.4683 | 0.8794 | 0.8444 | 0.8221 | 0.8331 |
67
+ | 0.2499 | 9.0 | 200097 | 0.4943 | 0.8835 | 0.8497 | 0.8284 | 0.8390 |
68
+ | 0.0389 | 10.0 | 222330 | 0.5130 | 0.8837 | 0.8544 | 0.8224 | 0.8381 |
69
 
70
 
71
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:847453cb5748284e03c6daace769ca9c57367878d394aa12d4a200e464a3b27a
3
  size 594681256
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:875b3be0d291d744319bd2b68b613144b195e6b7c865ad278754a17287ed7186
3
  size 594681256