JoseLuis95 commited on
Commit
59bc8ee
1 Parent(s): 0ee7b79

Training complete

Browse files
Files changed (3) hide show
  1. README.md +10 -8
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -18,9 +18,9 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [google/mt5-base](https://huggingface.co/google/mt5-base) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 4.9746
22
- - Bleu: 4.6434
23
- - Gen Len: 16.6341
24
 
25
  ## Model description
26
 
@@ -40,19 +40,21 @@ More information needed
40
 
41
  The following hyperparameters were used during training:
42
  - learning_rate: 5.6e-05
43
- - train_batch_size: 2
44
- - eval_batch_size: 2
45
  - seed: 42
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
- - num_epochs: 2
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
53
  |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
54
- | No log | 1.0 | 61 | 5.8517 | 4.6627 | 17.5854 |
55
- | No log | 2.0 | 122 | 4.9746 | 4.6434 | 16.6341 |
 
 
56
 
57
 
58
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [google/mt5-base](https://huggingface.co/google/mt5-base) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 7.2943
22
+ - Bleu: 0.5766
23
+ - Gen Len: 9.6098
24
 
25
  ## Model description
26
 
 
40
 
41
  The following hyperparameters were used during training:
42
  - learning_rate: 5.6e-05
43
+ - train_batch_size: 8
44
+ - eval_batch_size: 8
45
  - seed: 42
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
+ - num_epochs: 4
49
 
50
  ### Training results
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
53
  |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
54
+ | No log | 1.0 | 16 | 10.5602 | 0.076 | 3.8049 |
55
+ | No log | 2.0 | 32 | 8.3458 | 0.1439 | 5.1951 |
56
+ | No log | 3.0 | 48 | 7.5921 | 0.4483 | 9.2683 |
57
+ | No log | 4.0 | 64 | 7.2943 | 0.5766 | 9.6098 |
58
 
59
 
60
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:69c43b5c6bc9d61bb61d05e68acbfd87f70ec9903fd404121003b1529089d2ba
3
  size 2329638768
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b0f4173980d3de2eec1b43c9d6ef5c50aafe30499e2268e53ac63e89872e3641
3
  size 2329638768
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:599ca58558683298ee873ff88b9a83abba0e9fb2f4b049c3cb2e399e975ee1c4
3
  size 4984
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:852ab5934ebf62194288c520669a4ee89a1ef83cad6ec8185d903b70e02620b3
3
  size 4984