LogischeIP commited on
Commit
d2ad407
1 Parent(s): 344d352

End of training

Browse files
README.md CHANGED
@@ -18,11 +18,11 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.7249
22
- - Accuracy: 0.8669
23
- - F1: 0.8749
24
- - Auc Roc: 0.9317
25
- - Log Loss: 0.7249
26
 
27
  ## Model description
28
 
@@ -50,15 +50,19 @@ The following hyperparameters were used during training:
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: cosine
52
  - lr_scheduler_warmup_steps: 500
53
- - num_epochs: 2
54
  - mixed_precision_training: Native AMP
55
 
56
  ### Training results
57
 
58
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Auc Roc | Log Loss |
59
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:-------:|:--------:|
60
- | 1.1736 | 1.0 | 1618 | 0.6146 | 0.8507 | 0.8592 | 0.9256 | 0.6146 |
61
- | 0.6452 | 2.0 | 3236 | 0.7249 | 0.8669 | 0.8749 | 0.9317 | 0.7249 |
 
 
 
 
62
 
63
 
64
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.9470
22
+ - Accuracy: 0.8644
23
+ - F1: 0.8728
24
+ - Auc Roc: 0.9185
25
+ - Log Loss: 0.9470
26
 
27
  ## Model description
28
 
 
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: cosine
52
  - lr_scheduler_warmup_steps: 500
53
+ - num_epochs: 6
54
  - mixed_precision_training: Native AMP
55
 
56
  ### Training results
57
 
58
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Auc Roc | Log Loss |
59
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:-------:|:--------:|
60
+ | 1.0534 | 1.0 | 1618 | 0.6479 | 0.8694 | 0.8696 | 0.9298 | 0.6479 |
61
+ | 0.6971 | 2.0 | 3236 | 1.0859 | 0.8371 | 0.8581 | 0.9236 | 1.0859 |
62
+ | 0.5832 | 3.0 | 4854 | 0.9261 | 0.8495 | 0.8672 | 0.9255 | 0.9261 |
63
+ | 0.4402 | 4.0 | 6472 | 0.8507 | 0.8719 | 0.8804 | 0.9251 | 0.8507 |
64
+ | 0.3475 | 5.0 | 8090 | 0.9284 | 0.8657 | 0.8735 | 0.9198 | 0.9283 |
65
+ | 0.2985 | 6.0 | 9708 | 0.9470 | 0.8644 | 0.8728 | 0.9185 | 0.9470 |
66
 
67
 
68
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:02f0b5d0a85a3b873b4a183a19c22b0acf256a4c08e0f9b1355b1c34211241e5
3
  size 497780432
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bba396cdd3954fa6449e56fd26d07b8e71a29415aa7f0202a2844805faf46396
3
  size 497780432
runs/Jan24_20-13-09_7d2b1e2d4e1a/events.out.tfevents.1706127190.7d2b1e2d4e1a.26662.10 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:98f8c469a0ac1b736fdfd722766b3a5e4730843bb57eb3a0558723ec437aebca
3
+ size 8670
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3de59719afd225fb332ed6d1b42b6d228a99601e6a3b5db147d421b6288ddab6
3
  size 4664
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4687dd4a4162f59df791f1cccd877165e6c3d736f89908c5329d32a5eaea2f11
3
  size 4664