ZhiguangHan commited on
Commit
181fc35
1 Parent(s): 47d8f2f

End of training

Browse files
Files changed (2) hide show
  1. README.md +13 -18
  2. model.safetensors +1 -1
README.md CHANGED
@@ -17,11 +17,11 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 1.4093
21
- - Accuracy: 0.128
22
- - Mse: 1.5841
23
- - Log-distance: 0.6809
24
- - S Score: 0.4800
25
 
26
  ## Model description
27
 
@@ -46,24 +46,19 @@ The following hyperparameters were used during training:
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
- - num_epochs: 12
50
 
51
  ### Training results
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | Mse | Log-distance | S Score |
54
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:------------:|:-------:|
55
- | 2.1991 | 1.0 | 250 | 1.5275 | 0.106 | 1.7120 | 0.7824 | 0.4048 |
56
- | 2.1115 | 2.0 | 500 | 1.5009 | 0.106 | 1.7469 | 0.8062 | 0.3844 |
57
- | 1.8295 | 3.0 | 750 | 1.4483 | 0.108 | 1.7239 | 0.7902 | 0.3972 |
58
- | 1.7033 | 4.0 | 1000 | 1.4335 | 0.112 | 1.7052 | 0.7759 | 0.4088 |
59
- | 1.6426 | 5.0 | 1250 | 1.4224 | 0.12 | 1.5337 | 0.6427 | 0.5112 |
60
- | 1.5923 | 6.0 | 1500 | 1.4236 | 0.126 | 1.6061 | 0.7015 | 0.4628 |
61
- | 1.5529 | 7.0 | 1750 | 1.4284 | 0.122 | 1.5984 | 0.6967 | 0.4676 |
62
- | 1.546 | 8.0 | 2000 | 1.4132 | 0.124 | 1.6032 | 0.6948 | 0.4704 |
63
- | 1.5364 | 9.0 | 2250 | 1.4306 | 0.116 | 1.6403 | 0.7282 | 0.4460 |
64
- | 1.5365 | 10.0 | 2500 | 1.4107 | 0.118 | 1.5702 | 0.6681 | 0.4948 |
65
- | 1.5145 | 11.0 | 2750 | 1.4182 | 0.118 | 1.6041 | 0.7063 | 0.4596 |
66
- | 1.5103 | 12.0 | 3000 | 1.4093 | 0.128 | 1.5841 | 0.6809 | 0.4800 |
67
 
68
 
69
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 1.4565
21
+ - Accuracy: 0.124
22
+ - Mse: 2.0170
23
+ - Log-distance: 0.7037
24
+ - S Score: 0.4680
25
 
26
  ## Model description
27
 
 
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
+ - num_epochs: 15
50
 
51
  ### Training results
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | Mse | Log-distance | S Score |
54
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:------------:|:-------:|
55
+ | 3.3423 | 1.0 | 250 | 1.5914 | 0.074 | 2.3687 | 0.6693 | 0.4992 |
56
+ | 2.3267 | 2.0 | 500 | 1.6213 | 0.102 | 2.6627 | 0.7153 | 0.4836 |
57
+ | 1.8968 | 3.0 | 750 | 1.4707 | 0.118 | 2.0674 | 0.7576 | 0.4228 |
58
+ | 1.7414 | 4.0 | 1000 | 1.4367 | 0.124 | 2.0643 | 0.7534 | 0.4236 |
59
+ | 1.6639 | 5.0 | 1250 | 1.4493 | 0.12 | 2.0268 | 0.7079 | 0.4624 |
60
+ | 1.6088 | 6.0 | 1500 | 1.4538 | 0.116 | 1.9955 | 0.6762 | 0.4924 |
61
+ | 1.5823 | 7.0 | 1750 | 1.4565 | 0.124 | 2.0170 | 0.7037 | 0.4680 |
 
 
 
 
 
62
 
63
 
64
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5ff146c9ee1d84eaff45949aef09c03df7e1ae8ce5187e25d46c5e32e4e8b8e6
3
  size 1200729512
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5190cb07e53c304389db58de1a22ae951612932203087cef042bb9667f5be13a
3
  size 1200729512