patrixtano commited on
Commit
ca67270
1 Parent(s): bef632d

End of training

Browse files
Files changed (2) hide show
  1. README.md +9 -9
  2. model.safetensors +1 -1
README.md CHANGED
@@ -16,8 +16,8 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [google/mt5-base](https://huggingface.co/google/mt5-base) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.0348
20
- - Score: 29.6164
21
  - Char Order: 6
22
  - Word Order: 0
23
  - Beta: 2
@@ -40,8 +40,8 @@ More information needed
40
 
41
  The following hyperparameters were used during training:
42
  - learning_rate: 2e-05
43
- - train_batch_size: 16
44
- - eval_batch_size: 16
45
  - seed: 42
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
@@ -49,11 +49,11 @@ The following hyperparameters were used during training:
49
 
50
  ### Training results
51
 
52
- | Training Loss | Epoch | Step | Validation Loss | Score | Char Order | Word Order | Beta |
53
- |:-------------:|:-----:|:----:|:---------------:|:-------:|:----------:|:----------:|:----:|
54
- | 0.1076 | 1.0 | 2900 | 0.0491 | 29.5250 | 6 | 0 | 2 |
55
- | 0.0713 | 2.0 | 5800 | 0.0363 | 29.5776 | 6 | 0 | 2 |
56
- | 0.0628 | 3.0 | 8700 | 0.0348 | 29.6164 | 6 | 0 | 2 |
57
 
58
 
59
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [google/mt5-base](https://huggingface.co/google/mt5-base) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.0213
20
+ - Score: 28.7290
21
  - Char Order: 6
22
  - Word Order: 0
23
  - Beta: 2
 
40
 
41
  The following hyperparameters were used during training:
42
  - learning_rate: 2e-05
43
+ - train_batch_size: 4
44
+ - eval_batch_size: 4
45
  - seed: 42
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
 
49
 
50
  ### Training results
51
 
52
+ | Training Loss | Epoch | Step | Validation Loss | Score | Char Order | Word Order | Beta |
53
+ |:-------------:|:-----:|:-----:|:---------------:|:-------:|:----------:|:----------:|:----:|
54
+ | 0.0604 | 1.0 | 11598 | 0.0277 | 28.6422 | 6 | 0 | 2 |
55
+ | 0.041 | 2.0 | 23196 | 0.0224 | 28.7007 | 6 | 0 | 2 |
56
+ | 0.0366 | 3.0 | 34794 | 0.0213 | 28.7290 | 6 | 0 | 2 |
57
 
58
 
59
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ca8d75cab9af779afc73de06b281abe2cf9a34e46153129204dcb8804b0cab25
3
  size 2329638768
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0248c3310446c293f038466ed848a189b455bf3d7480182dbdb682652ae10db6
3
  size 2329638768