Vichentito commited on
Commit
b902f18
1 Parent(s): 975d2c3

End of training

Browse files
Files changed (2) hide show
  1. README.md +8 -18
  2. model.safetensors +1 -1
README.md CHANGED
@@ -15,9 +15,9 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model was trained from scratch on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 0.6835
19
- - Bleu: 25.7459
20
- - Gen Len: 45.7999
21
 
22
  ## Model description
23
 
@@ -42,30 +42,20 @@ The following hyperparameters were used during training:
42
  - seed: 42
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
- - num_epochs: 4
46
 
47
  ### Training results
48
 
49
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
50
  |:-------------:|:------:|:----:|:---------------:|:-------:|:-------:|
51
- | No log | 0.3027 | 300 | 0.6651 | 24.827 | 46.2006 |
52
- | 0.9651 | 0.6054 | 600 | 0.6967 | 24.0607 | 45.3494 |
53
- | 0.9651 | 0.9082 | 900 | 0.7045 | 23.7928 | 46.1327 |
54
- | 1.001 | 1.2109 | 1200 | 0.7084 | 23.9299 | 46.4082 |
55
- | 0.8741 | 1.5136 | 1500 | 0.7156 | 23.9047 | 45.8685 |
56
- | 0.8741 | 1.8163 | 1800 | 0.7121 | 23.9386 | 45.7796 |
57
- | 0.8763 | 2.1191 | 2100 | 0.7083 | 24.5377 | 45.8846 |
58
- | 0.8763 | 2.4218 | 2400 | 0.7032 | 24.6723 | 46.1827 |
59
- | 0.7689 | 2.7245 | 2700 | 0.6988 | 24.7631 | 45.8793 |
60
- | 0.7599 | 3.0272 | 3000 | 0.6961 | 25.2701 | 45.7947 |
61
- | 0.7599 | 3.3300 | 3300 | 0.6935 | 25.4704 | 45.7461 |
62
- | 0.6782 | 3.6327 | 3600 | 0.6861 | 25.7835 | 45.9797 |
63
- | 0.6782 | 3.9354 | 3900 | 0.6835 | 25.7459 | 45.7999 |
64
 
65
 
66
  ### Framework versions
67
 
68
  - Transformers 4.41.2
69
- - Pytorch 2.3.0+cu121
70
  - Datasets 2.19.2
71
  - Tokenizers 0.19.1
 
15
 
16
  This model was trained from scratch on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.6778
19
+ - Bleu: 29.2544
20
+ - Gen Len: 42.705
21
 
22
  ## Model description
23
 
 
42
  - seed: 42
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
+ - num_epochs: 8
46
 
47
  ### Training results
48
 
49
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
50
  |:-------------:|:------:|:----:|:---------------:|:-------:|:-------:|
51
+ | No log | 2.1277 | 300 | 0.7834 | 26.6199 | 41.9314 |
52
+ | 0.3038 | 4.2553 | 600 | 0.6428 | 27.5908 | 42.4358 |
53
+ | 0.3038 | 6.3830 | 900 | 0.6778 | 29.2544 | 42.705 |
 
 
 
 
 
 
 
 
 
 
54
 
55
 
56
  ### Framework versions
57
 
58
  - Transformers 4.41.2
59
+ - Pytorch 2.1.0
60
  - Datasets 2.19.2
61
  - Tokenizers 0.19.1
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a82ac66d4636a2339ab0beab4c1963c52642b945139f2136311e0a5d58519137
3
  size 990345064
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f80fd3005a7c42553118395fc244681f20c26391d35d2965dfa126fef8b9e008
3
  size 990345064