Rzoro commited on
Commit
323bb25
1 Parent(s): f16ba15

Model save

Browse files
Files changed (2) hide show
  1. README.md +26 -26
  2. pytorch_model.bin +1 -1
README.md CHANGED
@@ -13,8 +13,8 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  This model was trained from scratch on the None dataset.
15
  It achieves the following results on the evaluation set:
16
- - Loss: 1.0050
17
- - Map@3: 0.7222
18
 
19
  ## Model description
20
 
@@ -33,7 +33,7 @@ More information needed
33
  ### Training hyperparameters
34
 
35
  The following hyperparameters were used during training:
36
- - learning_rate: 1e-05
37
  - train_batch_size: 1
38
  - eval_batch_size: 1
39
  - seed: 0
@@ -48,29 +48,29 @@ The following hyperparameters were used during training:
48
 
49
  | Training Loss | Epoch | Step | Validation Loss | Map@3 |
50
  |:-------------:|:-----:|:----:|:---------------:|:------:|
51
- | 1.1981 | 0.04 | 200 | 1.0091 | 0.7250 |
52
- | 1.1225 | 0.08 | 400 | 1.0614 | 0.7157 |
53
- | 1.0582 | 0.13 | 600 | 1.0616 | 0.7187 |
54
- | 0.9549 | 0.17 | 800 | 1.2926 | 0.7140 |
55
- | 1.0004 | 0.21 | 1000 | 1.0648 | 0.7148 |
56
- | 0.9451 | 0.25 | 1200 | 1.1950 | 0.6827 |
57
- | 1.0324 | 0.29 | 1400 | 1.1839 | 0.7020 |
58
- | 0.9302 | 0.34 | 1600 | 1.2874 | 0.7132 |
59
- | 0.9951 | 0.38 | 1800 | 1.1666 | 0.7072 |
60
- | 0.9903 | 0.42 | 2000 | 1.1557 | 0.7113 |
61
- | 0.9599 | 0.46 | 2200 | 1.1784 | 0.7018 |
62
- | 0.9895 | 0.51 | 2400 | 1.1852 | 0.7007 |
63
- | 1.0247 | 0.55 | 2600 | 1.0689 | 0.7162 |
64
- | 0.9836 | 0.59 | 2800 | 1.0963 | 0.7210 |
65
- | 1.0456 | 0.63 | 3000 | 1.0689 | 0.7138 |
66
- | 1.0871 | 0.67 | 3200 | 1.0866 | 0.7243 |
67
- | 1.1521 | 0.72 | 3400 | 1.0743 | 0.7243 |
68
- | 1.1439 | 0.76 | 3600 | 1.0287 | 0.7215 |
69
- | 1.1757 | 0.8 | 3800 | 1.0172 | 0.7243 |
70
- | 1.2097 | 0.84 | 4000 | 1.0121 | 0.7233 |
71
- | 1.1979 | 0.88 | 4200 | 1.0091 | 0.7215 |
72
- | 1.2282 | 0.93 | 4400 | 1.0057 | 0.7222 |
73
- | 1.2602 | 0.97 | 4600 | 1.0050 | 0.7222 |
74
 
75
 
76
  ### Framework versions
 
13
 
14
  This model was trained from scratch on the None dataset.
15
  It achieves the following results on the evaluation set:
16
+ - Loss: 1.0162
17
+ - Map@3: 0.7248
18
 
19
  ## Model description
20
 
 
33
  ### Training hyperparameters
34
 
35
  The following hyperparameters were used during training:
36
+ - learning_rate: 5e-06
37
  - train_batch_size: 1
38
  - eval_batch_size: 1
39
  - seed: 0
 
48
 
49
  | Training Loss | Epoch | Step | Validation Loss | Map@3 |
50
  |:-------------:|:-----:|:----:|:---------------:|:------:|
51
+ | 1.1455 | 0.04 | 200 | 1.0242 | 0.7222 |
52
+ | 1.1247 | 0.08 | 400 | 1.0420 | 0.7233 |
53
+ | 1.0755 | 0.13 | 600 | 1.0358 | 0.7222 |
54
+ | 1.003 | 0.17 | 800 | 1.1454 | 0.7258 |
55
+ | 1.0276 | 0.21 | 1000 | 1.0685 | 0.7205 |
56
+ | 0.9733 | 0.25 | 1200 | 1.1443 | 0.7050 |
57
+ | 1.0409 | 0.29 | 1400 | 1.1388 | 0.7012 |
58
+ | 0.9511 | 0.34 | 1600 | 1.1830 | 0.7197 |
59
+ | 1.0153 | 0.38 | 1800 | 1.1344 | 0.7172 |
60
+ | 1.0024 | 0.42 | 2000 | 1.1659 | 0.7212 |
61
+ | 0.9657 | 0.46 | 2200 | 1.1938 | 0.7100 |
62
+ | 0.9993 | 0.51 | 2400 | 1.1777 | 0.7042 |
63
+ | 1.0174 | 0.55 | 2600 | 1.0811 | 0.7145 |
64
+ | 0.9792 | 0.59 | 2800 | 1.1281 | 0.7162 |
65
+ | 1.0442 | 0.63 | 3000 | 1.0792 | 0.7133 |
66
+ | 1.075 | 0.67 | 3200 | 1.0900 | 0.7165 |
67
+ | 1.1424 | 0.72 | 3400 | 1.0698 | 0.7188 |
68
+ | 1.1411 | 0.76 | 3600 | 1.0476 | 0.7193 |
69
+ | 1.172 | 0.8 | 3800 | 1.0318 | 0.7225 |
70
+ | 1.208 | 0.84 | 4000 | 1.0224 | 0.7225 |
71
+ | 1.1975 | 0.88 | 4200 | 1.0195 | 0.7245 |
72
+ | 1.2282 | 0.93 | 4400 | 1.0168 | 0.7238 |
73
+ | 1.2635 | 0.97 | 4600 | 1.0162 | 0.7248 |
74
 
75
 
76
  ### Framework versions
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c667c5952bad6c915cedd127c4f44c07cab71c4d24c4532a2455e0f3872c8136
3
  size 1740387701
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4a07a2849b85ab664d04dcd6fd00913123ee85ee9fccc9af1701d226cb6c73e1
3
  size 1740387701