Emil7018 commited on
Commit
d6f97c2
·
verified ·
1 Parent(s): 25352fe

End of training

Browse files
README.md CHANGED
@@ -19,9 +19,9 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 0.2926
23
- - Accuracy: 0.9253
24
- - F1: 0.9252
25
 
26
  ## Model description
27
 
@@ -46,14 +46,19 @@ The following hyperparameters were used during training:
46
  - seed: 42
47
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
  - lr_scheduler_type: linear
49
- - num_epochs: 2
 
50
 
51
  ### Training results
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
54
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
55
- | No log | 1.0 | 313 | 0.2275 | 0.9258 | 0.9258 |
56
- | 0.1527 | 2.0 | 626 | 0.2926 | 0.9253 | 0.9252 |
 
 
 
 
57
 
58
 
59
  ### Framework versions
 
19
 
20
  This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 0.2637
23
+ - Accuracy: 0.9093
24
+ - F1: 0.9091
25
 
26
  ## Model description
27
 
 
46
  - seed: 42
47
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
  - lr_scheduler_type: linear
49
+ - num_epochs: 10
50
+ - mixed_precision_training: Native AMP
51
 
52
  ### Training results
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
55
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
56
+ | 0.337 | 1.0 | 313 | 0.2637 | 0.9093 | 0.9091 |
57
+ | 0.154 | 2.0 | 626 | 0.2963 | 0.9045 | 0.9049 |
58
+ | 0.0558 | 3.0 | 939 | 0.4010 | 0.9182 | 0.9179 |
59
+ | 0.0128 | 4.0 | 1252 | 0.6283 | 0.9201 | 0.9198 |
60
+ | 0.0037 | 5.0 | 1565 | 0.6197 | 0.9196 | 0.9196 |
61
+ | 0.0027 | 6.0 | 1878 | 0.6015 | 0.9216 | 0.9211 |
62
 
63
 
64
  ### Framework versions
all_results.json ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 6.0,
3
+ "eval_accuracy": 0.9093421052631578,
4
+ "eval_f1": 0.9090618533224998,
5
+ "eval_loss": 0.2637197971343994,
6
+ "eval_runtime": 19.5155,
7
+ "eval_samples_per_second": 389.434,
8
+ "eval_steps_per_second": 12.195
9
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d0f37bba18cf7f2878c3221a3f1160d03452441d2049204b13d9536bc453af60
3
  size 598445936
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:45d98326df7f3956f269b83ba767d4760d4b0ac48fc58ab5ec9da6f46a8f73fa
3
  size 598445936
runs/Oct03_15-42-32_ce7fa55b11ce/events.out.tfevents.1759506153.ce7fa55b11ce.1127.6 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2994107774013b19bffe06edc5ec8117606be3bb313e7513554577873936379a
3
- size 8686
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:40b61bce6f4a98c551922756be15c9f26679a9460fe56d34de2fc52d3e470741
3
+ size 9620
runs/Oct03_15-42-32_ce7fa55b11ce/events.out.tfevents.1759507536.ce7fa55b11ce.1127.7 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a8c18c06af98f5f7356415838129486ab36aab818cd2b021df055d7ba005aa32
3
+ size 457
test_results.json ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 6.0,
3
+ "eval_accuracy": 0.9093421052631578,
4
+ "eval_f1": 0.9090618533224998,
5
+ "eval_loss": 0.2637197971343994,
6
+ "eval_runtime": 19.5155,
7
+ "eval_samples_per_second": 389.434,
8
+ "eval_steps_per_second": 12.195
9
+ }