wcvz commited on
Commit
51dd844
1 Parent(s): 4d1d288

Model save

Browse files
Files changed (2) hide show
  1. README.md +362 -0
  2. adapter_model.safetensors +1 -1
README.md ADDED
@@ -0,0 +1,362 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ library_name: peft
4
+ tags:
5
+ - generated_from_trainer
6
+ base_model: facebook/esm2_t30_150M_UR50D
7
+ metrics:
8
+ - accuracy
9
+ model-index:
10
+ - name: esm2_t130_150M-lora-classifier_2024-04-26_00-25-40
11
+ results: []
12
+ ---
13
+
14
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
+ should probably proofread and complete it, then remove this comment. -->
16
+
17
+ # esm2_t130_150M-lora-classifier_2024-04-26_00-25-40
18
+
19
+ This model is a fine-tuned version of [facebook/esm2_t30_150M_UR50D](https://huggingface.co/facebook/esm2_t30_150M_UR50D) on the None dataset.
20
+ It achieves the following results on the evaluation set:
21
+ - Loss: 1.6470
22
+ - Accuracy: 0.8887
23
+
24
+ ## Model description
25
+
26
+ More information needed
27
+
28
+ ## Intended uses & limitations
29
+
30
+ More information needed
31
+
32
+ ## Training and evaluation data
33
+
34
+ More information needed
35
+
36
+ ## Training procedure
37
+
38
+ ### Training hyperparameters
39
+
40
+ The following hyperparameters were used during training:
41
+ - learning_rate: 0.0005701568055793089
42
+ - train_batch_size: 28
43
+ - eval_batch_size: 28
44
+ - seed: 8893
45
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
+ - lr_scheduler_type: cosine
47
+ - num_epochs: 300
48
+ - mixed_precision_training: Native AMP
49
+
50
+ ### Training results
51
+
52
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
53
+ |:-------------:|:-----:|:-----:|:---------------:|:--------:|
54
+ | 0.7096 | 1.0 | 55 | 0.6718 | 0.6055 |
55
+ | 0.6769 | 2.0 | 110 | 0.6739 | 0.6055 |
56
+ | 0.579 | 3.0 | 165 | 0.6608 | 0.6484 |
57
+ | 0.5726 | 4.0 | 220 | 0.5777 | 0.7109 |
58
+ | 0.6381 | 5.0 | 275 | 0.5020 | 0.7676 |
59
+ | 0.183 | 6.0 | 330 | 0.3725 | 0.8320 |
60
+ | 0.3701 | 7.0 | 385 | 0.3508 | 0.8535 |
61
+ | 0.2147 | 8.0 | 440 | 0.3191 | 0.8711 |
62
+ | 0.1654 | 9.0 | 495 | 0.3036 | 0.875 |
63
+ | 0.1581 | 10.0 | 550 | 0.3761 | 0.8516 |
64
+ | 0.3459 | 11.0 | 605 | 0.3746 | 0.8594 |
65
+ | 0.3325 | 12.0 | 660 | 0.3025 | 0.8867 |
66
+ | 0.1237 | 13.0 | 715 | 0.2983 | 0.8770 |
67
+ | 0.5167 | 14.0 | 770 | 0.3044 | 0.8887 |
68
+ | 0.3541 | 15.0 | 825 | 0.2927 | 0.8906 |
69
+ | 0.0378 | 16.0 | 880 | 0.3669 | 0.8906 |
70
+ | 0.062 | 17.0 | 935 | 0.3298 | 0.8887 |
71
+ | 0.1695 | 18.0 | 990 | 0.2912 | 0.9004 |
72
+ | 0.0444 | 19.0 | 1045 | 0.3034 | 0.9004 |
73
+ | 0.1794 | 20.0 | 1100 | 0.3641 | 0.8828 |
74
+ | 0.0634 | 21.0 | 1155 | 0.3521 | 0.8867 |
75
+ | 0.0446 | 22.0 | 1210 | 0.3438 | 0.8887 |
76
+ | 0.0266 | 23.0 | 1265 | 0.4553 | 0.8867 |
77
+ | 0.2637 | 24.0 | 1320 | 0.4715 | 0.8867 |
78
+ | 0.159 | 25.0 | 1375 | 0.4323 | 0.8945 |
79
+ | 0.2401 | 26.0 | 1430 | 0.6019 | 0.8809 |
80
+ | 0.1317 | 27.0 | 1485 | 0.5549 | 0.8906 |
81
+ | 0.1223 | 28.0 | 1540 | 0.4819 | 0.8926 |
82
+ | 0.0015 | 29.0 | 1595 | 0.6432 | 0.8711 |
83
+ | 0.0007 | 30.0 | 1650 | 0.6480 | 0.8926 |
84
+ | 0.0774 | 31.0 | 1705 | 0.7596 | 0.8926 |
85
+ | 0.1262 | 32.0 | 1760 | 0.7614 | 0.8809 |
86
+ | 0.034 | 33.0 | 1815 | 0.7392 | 0.8789 |
87
+ | 0.0021 | 34.0 | 1870 | 0.9068 | 0.8848 |
88
+ | 0.0003 | 35.0 | 1925 | 0.8724 | 0.8711 |
89
+ | 0.0001 | 36.0 | 1980 | 0.9483 | 0.8867 |
90
+ | 0.0127 | 37.0 | 2035 | 0.9638 | 0.8828 |
91
+ | 0.0001 | 38.0 | 2090 | 0.9105 | 0.8926 |
92
+ | 0.0001 | 39.0 | 2145 | 0.9231 | 0.8809 |
93
+ | 0.0008 | 40.0 | 2200 | 1.0224 | 0.8867 |
94
+ | 0.0001 | 41.0 | 2255 | 1.0666 | 0.8848 |
95
+ | 0.0002 | 42.0 | 2310 | 1.1028 | 0.8848 |
96
+ | 0.0 | 43.0 | 2365 | 0.9653 | 0.8906 |
97
+ | 0.0006 | 44.0 | 2420 | 1.1108 | 0.8848 |
98
+ | 0.0001 | 45.0 | 2475 | 1.2919 | 0.8730 |
99
+ | 0.0002 | 46.0 | 2530 | 1.0834 | 0.8926 |
100
+ | 0.0002 | 47.0 | 2585 | 1.1240 | 0.8887 |
101
+ | 0.0135 | 48.0 | 2640 | 1.1466 | 0.8887 |
102
+ | 0.0008 | 49.0 | 2695 | 1.2674 | 0.8691 |
103
+ | 0.0 | 50.0 | 2750 | 1.1311 | 0.8887 |
104
+ | 0.0086 | 51.0 | 2805 | 1.0957 | 0.8887 |
105
+ | 0.0 | 52.0 | 2860 | 1.1336 | 0.8789 |
106
+ | 0.0007 | 53.0 | 2915 | 1.1494 | 0.875 |
107
+ | 0.0002 | 54.0 | 2970 | 1.0790 | 0.8848 |
108
+ | 0.0002 | 55.0 | 3025 | 1.1489 | 0.8809 |
109
+ | 0.0 | 56.0 | 3080 | 1.1479 | 0.8867 |
110
+ | 0.0022 | 57.0 | 3135 | 1.2092 | 0.8848 |
111
+ | 0.2415 | 58.0 | 3190 | 1.2060 | 0.8848 |
112
+ | 0.7813 | 59.0 | 3245 | 1.3750 | 0.8613 |
113
+ | 0.0 | 60.0 | 3300 | 1.1202 | 0.875 |
114
+ | 0.0 | 61.0 | 3355 | 1.0502 | 0.8848 |
115
+ | 0.0 | 62.0 | 3410 | 1.3270 | 0.8730 |
116
+ | 0.0015 | 63.0 | 3465 | 1.0082 | 0.875 |
117
+ | 0.0002 | 64.0 | 3520 | 0.9724 | 0.8867 |
118
+ | 0.0014 | 65.0 | 3575 | 1.0862 | 0.8770 |
119
+ | 0.0002 | 66.0 | 3630 | 1.1366 | 0.8730 |
120
+ | 0.1868 | 67.0 | 3685 | 1.1838 | 0.8770 |
121
+ | 0.0004 | 68.0 | 3740 | 1.2073 | 0.875 |
122
+ | 0.0007 | 69.0 | 3795 | 1.1793 | 0.8770 |
123
+ | 0.0 | 70.0 | 3850 | 1.2262 | 0.8652 |
124
+ | 0.2838 | 71.0 | 3905 | 1.2415 | 0.875 |
125
+ | 0.0 | 72.0 | 3960 | 1.2346 | 0.8770 |
126
+ | 0.0041 | 73.0 | 4015 | 1.0830 | 0.8789 |
127
+ | 0.0055 | 74.0 | 4070 | 1.0731 | 0.8867 |
128
+ | 0.0 | 75.0 | 4125 | 1.4096 | 0.8652 |
129
+ | 0.0034 | 76.0 | 4180 | 1.1142 | 0.8711 |
130
+ | 0.0 | 77.0 | 4235 | 1.0250 | 0.8848 |
131
+ | 0.0002 | 78.0 | 4290 | 1.0700 | 0.8691 |
132
+ | 0.0009 | 79.0 | 4345 | 0.9032 | 0.8789 |
133
+ | 0.0001 | 80.0 | 4400 | 1.0556 | 0.8730 |
134
+ | 0.0001 | 81.0 | 4455 | 1.0740 | 0.8770 |
135
+ | 0.0002 | 82.0 | 4510 | 1.2571 | 0.8691 |
136
+ | 0.0 | 83.0 | 4565 | 1.2007 | 0.8809 |
137
+ | 0.0 | 84.0 | 4620 | 1.2515 | 0.875 |
138
+ | 0.0001 | 85.0 | 4675 | 1.0750 | 0.8828 |
139
+ | 0.0006 | 86.0 | 4730 | 1.3016 | 0.8730 |
140
+ | 0.0001 | 87.0 | 4785 | 1.2393 | 0.8809 |
141
+ | 0.0 | 88.0 | 4840 | 1.2232 | 0.8848 |
142
+ | 0.0003 | 89.0 | 4895 | 1.2187 | 0.8789 |
143
+ | 0.0 | 90.0 | 4950 | 1.2328 | 0.8730 |
144
+ | 0.0 | 91.0 | 5005 | 1.3026 | 0.8848 |
145
+ | 0.0 | 92.0 | 5060 | 1.3152 | 0.8770 |
146
+ | 0.0 | 93.0 | 5115 | 1.4069 | 0.875 |
147
+ | 0.0 | 94.0 | 5170 | 1.3988 | 0.8770 |
148
+ | 0.0 | 95.0 | 5225 | 1.3675 | 0.8594 |
149
+ | 0.0 | 96.0 | 5280 | 1.3366 | 0.8770 |
150
+ | 0.0003 | 97.0 | 5335 | 1.2140 | 0.8848 |
151
+ | 0.0 | 98.0 | 5390 | 1.3585 | 0.8711 |
152
+ | 0.0 | 99.0 | 5445 | 1.1665 | 0.8672 |
153
+ | 0.0 | 100.0 | 5500 | 1.0947 | 0.8809 |
154
+ | 0.0099 | 101.0 | 5555 | 1.2993 | 0.8730 |
155
+ | 0.0 | 102.0 | 5610 | 1.3578 | 0.8789 |
156
+ | 0.0 | 103.0 | 5665 | 1.3596 | 0.8867 |
157
+ | 0.0006 | 104.0 | 5720 | 1.3164 | 0.8848 |
158
+ | 0.0 | 105.0 | 5775 | 1.4100 | 0.8770 |
159
+ | 0.0 | 106.0 | 5830 | 1.3459 | 0.875 |
160
+ | 0.0005 | 107.0 | 5885 | 1.3783 | 0.8809 |
161
+ | 0.0 | 108.0 | 5940 | 1.2698 | 0.8770 |
162
+ | 0.0 | 109.0 | 5995 | 1.3933 | 0.8848 |
163
+ | 0.0 | 110.0 | 6050 | 1.3813 | 0.8809 |
164
+ | 0.0 | 111.0 | 6105 | 1.5747 | 0.875 |
165
+ | 0.0001 | 112.0 | 6160 | 1.3368 | 0.8867 |
166
+ | 0.0486 | 113.0 | 6215 | 1.3833 | 0.8828 |
167
+ | 0.1476 | 114.0 | 6270 | 1.4943 | 0.8828 |
168
+ | 0.0002 | 115.0 | 6325 | 1.4725 | 0.8789 |
169
+ | 0.0 | 116.0 | 6380 | 1.4614 | 0.875 |
170
+ | 0.0047 | 117.0 | 6435 | 1.6313 | 0.8770 |
171
+ | 0.0 | 118.0 | 6490 | 1.4459 | 0.8848 |
172
+ | 0.0026 | 119.0 | 6545 | 1.4150 | 0.8730 |
173
+ | 0.0 | 120.0 | 6600 | 1.6055 | 0.8555 |
174
+ | 0.0001 | 121.0 | 6655 | 1.3710 | 0.8789 |
175
+ | 0.3319 | 122.0 | 6710 | 1.3940 | 0.8867 |
176
+ | 0.0001 | 123.0 | 6765 | 1.2486 | 0.875 |
177
+ | 0.0002 | 124.0 | 6820 | 1.2946 | 0.8711 |
178
+ | 0.0 | 125.0 | 6875 | 1.2341 | 0.8711 |
179
+ | 0.0 | 126.0 | 6930 | 1.1418 | 0.8887 |
180
+ | 0.0 | 127.0 | 6985 | 1.0713 | 0.8926 |
181
+ | 0.0001 | 128.0 | 7040 | 1.1391 | 0.8613 |
182
+ | 0.1624 | 129.0 | 7095 | 1.2195 | 0.8789 |
183
+ | 0.0 | 130.0 | 7150 | 1.1576 | 0.8770 |
184
+ | 0.0001 | 131.0 | 7205 | 1.2939 | 0.8730 |
185
+ | 0.0 | 132.0 | 7260 | 1.1568 | 0.8867 |
186
+ | 0.0 | 133.0 | 7315 | 1.2117 | 0.8848 |
187
+ | 0.0 | 134.0 | 7370 | 1.1264 | 0.8926 |
188
+ | 0.0 | 135.0 | 7425 | 1.1675 | 0.8848 |
189
+ | 0.0 | 136.0 | 7480 | 1.1983 | 0.8828 |
190
+ | 0.0 | 137.0 | 7535 | 1.2666 | 0.8770 |
191
+ | 0.0001 | 138.0 | 7590 | 1.1287 | 0.8848 |
192
+ | 0.0 | 139.0 | 7645 | 1.0505 | 0.8848 |
193
+ | 0.0 | 140.0 | 7700 | 1.1770 | 0.8770 |
194
+ | 0.0 | 141.0 | 7755 | 1.1749 | 0.8906 |
195
+ | 0.0 | 142.0 | 7810 | 1.1311 | 0.8711 |
196
+ | 0.0 | 143.0 | 7865 | 1.1114 | 0.8652 |
197
+ | 0.0 | 144.0 | 7920 | 1.1419 | 0.8691 |
198
+ | 0.0 | 145.0 | 7975 | 1.1666 | 0.8691 |
199
+ | 0.0 | 146.0 | 8030 | 1.1712 | 0.8711 |
200
+ | 0.0 | 147.0 | 8085 | 1.1831 | 0.8711 |
201
+ | 0.0 | 148.0 | 8140 | 1.1799 | 0.8711 |
202
+ | 0.0 | 149.0 | 8195 | 1.1876 | 0.8711 |
203
+ | 0.0 | 150.0 | 8250 | 1.1884 | 0.8730 |
204
+ | 0.0 | 151.0 | 8305 | 1.2389 | 0.8730 |
205
+ | 0.0 | 152.0 | 8360 | 1.3622 | 0.875 |
206
+ | 0.0 | 153.0 | 8415 | 1.2604 | 0.8789 |
207
+ | 0.0 | 154.0 | 8470 | 1.3336 | 0.875 |
208
+ | 0.0 | 155.0 | 8525 | 1.3496 | 0.8809 |
209
+ | 0.0 | 156.0 | 8580 | 1.3882 | 0.8555 |
210
+ | 0.1815 | 157.0 | 8635 | 1.3679 | 0.8789 |
211
+ | 0.288 | 158.0 | 8690 | 1.3804 | 0.8691 |
212
+ | 0.0 | 159.0 | 8745 | 1.2980 | 0.8770 |
213
+ | 0.0 | 160.0 | 8800 | 1.4075 | 0.8789 |
214
+ | 0.0 | 161.0 | 8855 | 1.4231 | 0.8789 |
215
+ | 0.0 | 162.0 | 8910 | 1.4730 | 0.875 |
216
+ | 0.0019 | 163.0 | 8965 | 1.5861 | 0.8672 |
217
+ | 0.0 | 164.0 | 9020 | 1.4080 | 0.8809 |
218
+ | 0.0005 | 165.0 | 9075 | 1.5852 | 0.8711 |
219
+ | 0.0 | 166.0 | 9130 | 1.5370 | 0.875 |
220
+ | 0.0 | 167.0 | 9185 | 1.5288 | 0.875 |
221
+ | 0.0 | 168.0 | 9240 | 1.5516 | 0.8711 |
222
+ | 0.0 | 169.0 | 9295 | 1.5268 | 0.8730 |
223
+ | 0.0 | 170.0 | 9350 | 1.5061 | 0.8672 |
224
+ | 0.0 | 171.0 | 9405 | 1.4843 | 0.875 |
225
+ | 0.0 | 172.0 | 9460 | 1.5478 | 0.8633 |
226
+ | 0.0 | 173.0 | 9515 | 1.4753 | 0.8730 |
227
+ | 0.0 | 174.0 | 9570 | 1.6709 | 0.8730 |
228
+ | 0.0 | 175.0 | 9625 | 1.6663 | 0.875 |
229
+ | 0.0 | 176.0 | 9680 | 1.6980 | 0.8672 |
230
+ | 0.0 | 177.0 | 9735 | 1.5563 | 0.8770 |
231
+ | 0.0 | 178.0 | 9790 | 1.6146 | 0.875 |
232
+ | 0.0 | 179.0 | 9845 | 1.5599 | 0.8770 |
233
+ | 0.0 | 180.0 | 9900 | 1.5558 | 0.8789 |
234
+ | 0.0 | 181.0 | 9955 | 1.8485 | 0.8633 |
235
+ | 0.0 | 182.0 | 10010 | 1.7223 | 0.8789 |
236
+ | 0.0 | 183.0 | 10065 | 1.7169 | 0.875 |
237
+ | 0.0 | 184.0 | 10120 | 1.7125 | 0.8711 |
238
+ | 0.0 | 185.0 | 10175 | 1.7065 | 0.8711 |
239
+ | 0.0 | 186.0 | 10230 | 1.7748 | 0.8730 |
240
+ | 0.0 | 187.0 | 10285 | 1.6861 | 0.8789 |
241
+ | 0.0 | 188.0 | 10340 | 1.7325 | 0.8887 |
242
+ | 0.0 | 189.0 | 10395 | 1.7658 | 0.8828 |
243
+ | 0.0 | 190.0 | 10450 | 1.7649 | 0.8809 |
244
+ | 0.0 | 191.0 | 10505 | 1.7555 | 0.8828 |
245
+ | 0.0162 | 192.0 | 10560 | 1.8313 | 0.8691 |
246
+ | 0.0001 | 193.0 | 10615 | 1.8314 | 0.8574 |
247
+ | 0.0 | 194.0 | 10670 | 1.7706 | 0.8672 |
248
+ | 0.0 | 195.0 | 10725 | 1.6568 | 0.8730 |
249
+ | 0.0 | 196.0 | 10780 | 1.6568 | 0.8770 |
250
+ | 0.0 | 197.0 | 10835 | 1.6185 | 0.8848 |
251
+ | 0.0 | 198.0 | 10890 | 1.6133 | 0.8848 |
252
+ | 0.0 | 199.0 | 10945 | 1.6129 | 0.8848 |
253
+ | 0.0 | 200.0 | 11000 | 1.6121 | 0.8848 |
254
+ | 0.0 | 201.0 | 11055 | 1.6104 | 0.8828 |
255
+ | 0.0 | 202.0 | 11110 | 1.6075 | 0.8828 |
256
+ | 0.0 | 203.0 | 11165 | 1.6153 | 0.8867 |
257
+ | 0.0 | 204.0 | 11220 | 1.6339 | 0.8828 |
258
+ | 0.0 | 205.0 | 11275 | 1.6164 | 0.8867 |
259
+ | 0.0 | 206.0 | 11330 | 1.6114 | 0.8848 |
260
+ | 0.0 | 207.0 | 11385 | 1.6122 | 0.8867 |
261
+ | 0.0 | 208.0 | 11440 | 1.6079 | 0.8867 |
262
+ | 0.0 | 209.0 | 11495 | 1.6132 | 0.8867 |
263
+ | 0.0 | 210.0 | 11550 | 1.6141 | 0.8867 |
264
+ | 0.0 | 211.0 | 11605 | 1.6122 | 0.8867 |
265
+ | 0.0 | 212.0 | 11660 | 1.6070 | 0.8867 |
266
+ | 0.0 | 213.0 | 11715 | 1.6010 | 0.8867 |
267
+ | 0.0 | 214.0 | 11770 | 1.6562 | 0.8789 |
268
+ | 0.0005 | 215.0 | 11825 | 1.6297 | 0.8887 |
269
+ | 0.0 | 216.0 | 11880 | 1.6070 | 0.8809 |
270
+ | 0.0 | 217.0 | 11935 | 1.6750 | 0.8770 |
271
+ | 0.0 | 218.0 | 11990 | 1.6822 | 0.8730 |
272
+ | 0.0 | 219.0 | 12045 | 1.6819 | 0.8730 |
273
+ | 0.0 | 220.0 | 12100 | 1.6846 | 0.8770 |
274
+ | 0.0 | 221.0 | 12155 | 1.6827 | 0.875 |
275
+ | 0.0 | 222.0 | 12210 | 1.6822 | 0.875 |
276
+ | 0.0 | 223.0 | 12265 | 1.6780 | 0.8770 |
277
+ | 0.0 | 224.0 | 12320 | 1.6813 | 0.8770 |
278
+ | 0.0 | 225.0 | 12375 | 1.6770 | 0.8770 |
279
+ | 0.0 | 226.0 | 12430 | 1.6878 | 0.8789 |
280
+ | 0.0 | 227.0 | 12485 | 1.8890 | 0.8672 |
281
+ | 0.0 | 228.0 | 12540 | 1.6978 | 0.8828 |
282
+ | 0.0 | 229.0 | 12595 | 1.6945 | 0.8867 |
283
+ | 0.0 | 230.0 | 12650 | 1.6960 | 0.8848 |
284
+ | 0.0 | 231.0 | 12705 | 1.6972 | 0.8867 |
285
+ | 0.0 | 232.0 | 12760 | 1.6929 | 0.8867 |
286
+ | 0.0 | 233.0 | 12815 | 1.6911 | 0.8848 |
287
+ | 0.0 | 234.0 | 12870 | 1.6887 | 0.8867 |
288
+ | 0.0 | 235.0 | 12925 | 1.6999 | 0.8848 |
289
+ | 0.0 | 236.0 | 12980 | 1.7000 | 0.8848 |
290
+ | 0.0 | 237.0 | 13035 | 1.6877 | 0.8867 |
291
+ | 0.0 | 238.0 | 13090 | 1.6858 | 0.8867 |
292
+ | 0.0 | 239.0 | 13145 | 1.6859 | 0.8867 |
293
+ | 0.0 | 240.0 | 13200 | 1.6842 | 0.8867 |
294
+ | 0.0 | 241.0 | 13255 | 1.6829 | 0.8867 |
295
+ | 0.0 | 242.0 | 13310 | 1.6800 | 0.8867 |
296
+ | 0.0 | 243.0 | 13365 | 1.6870 | 0.8848 |
297
+ | 0.0 | 244.0 | 13420 | 1.6856 | 0.8848 |
298
+ | 0.0 | 245.0 | 13475 | 1.6831 | 0.8848 |
299
+ | 0.0 | 246.0 | 13530 | 1.6864 | 0.8828 |
300
+ | 0.0 | 247.0 | 13585 | 1.6896 | 0.8828 |
301
+ | 0.0 | 248.0 | 13640 | 1.6900 | 0.8828 |
302
+ | 0.0 | 249.0 | 13695 | 1.6906 | 0.8848 |
303
+ | 0.0 | 250.0 | 13750 | 1.6928 | 0.8828 |
304
+ | 0.0 | 251.0 | 13805 | 1.6943 | 0.8828 |
305
+ | 0.0 | 252.0 | 13860 | 1.6902 | 0.8789 |
306
+ | 0.0 | 253.0 | 13915 | 1.6638 | 0.8887 |
307
+ | 0.0 | 254.0 | 13970 | 1.6632 | 0.8867 |
308
+ | 0.0 | 255.0 | 14025 | 1.6627 | 0.8867 |
309
+ | 0.0 | 256.0 | 14080 | 1.6631 | 0.8867 |
310
+ | 0.0 | 257.0 | 14135 | 1.6626 | 0.8867 |
311
+ | 0.0 | 258.0 | 14190 | 1.6629 | 0.8867 |
312
+ | 0.0 | 259.0 | 14245 | 1.6617 | 0.8867 |
313
+ | 0.0 | 260.0 | 14300 | 1.6606 | 0.8867 |
314
+ | 0.0 | 261.0 | 14355 | 1.6598 | 0.8867 |
315
+ | 0.0 | 262.0 | 14410 | 1.6559 | 0.8867 |
316
+ | 0.0 | 263.0 | 14465 | 1.6564 | 0.8867 |
317
+ | 0.0 | 264.0 | 14520 | 1.6555 | 0.8867 |
318
+ | 0.0 | 265.0 | 14575 | 1.6588 | 0.8867 |
319
+ | 0.0 | 266.0 | 14630 | 1.6565 | 0.8867 |
320
+ | 0.0 | 267.0 | 14685 | 1.6558 | 0.8867 |
321
+ | 0.0 | 268.0 | 14740 | 1.6564 | 0.8848 |
322
+ | 0.0 | 269.0 | 14795 | 1.6578 | 0.8848 |
323
+ | 0.0 | 270.0 | 14850 | 1.6566 | 0.8848 |
324
+ | 0.0 | 271.0 | 14905 | 1.6560 | 0.8867 |
325
+ | 0.0 | 272.0 | 14960 | 1.6587 | 0.8848 |
326
+ | 0.0 | 273.0 | 15015 | 1.6575 | 0.8867 |
327
+ | 0.0 | 274.0 | 15070 | 1.6575 | 0.8848 |
328
+ | 0.0 | 275.0 | 15125 | 1.6570 | 0.8867 |
329
+ | 0.0 | 276.0 | 15180 | 1.6586 | 0.8848 |
330
+ | 0.0 | 277.0 | 15235 | 1.6572 | 0.8887 |
331
+ | 0.0 | 278.0 | 15290 | 1.6577 | 0.8848 |
332
+ | 0.0 | 279.0 | 15345 | 1.6570 | 0.8867 |
333
+ | 0.0 | 280.0 | 15400 | 1.6567 | 0.8887 |
334
+ | 0.0 | 281.0 | 15455 | 1.6548 | 0.8887 |
335
+ | 0.0 | 282.0 | 15510 | 1.6558 | 0.8867 |
336
+ | 0.0 | 283.0 | 15565 | 1.6505 | 0.8887 |
337
+ | 0.0 | 284.0 | 15620 | 1.6515 | 0.8887 |
338
+ | 0.0 | 285.0 | 15675 | 1.6513 | 0.8887 |
339
+ | 0.0 | 286.0 | 15730 | 1.6456 | 0.8887 |
340
+ | 0.0 | 287.0 | 15785 | 1.6471 | 0.8887 |
341
+ | 0.0 | 288.0 | 15840 | 1.6451 | 0.8887 |
342
+ | 0.0 | 289.0 | 15895 | 1.6468 | 0.8887 |
343
+ | 0.0 | 290.0 | 15950 | 1.6470 | 0.8887 |
344
+ | 0.0 | 291.0 | 16005 | 1.6448 | 0.8887 |
345
+ | 0.0 | 292.0 | 16060 | 1.6478 | 0.8887 |
346
+ | 0.0 | 293.0 | 16115 | 1.6475 | 0.8887 |
347
+ | 0.0 | 294.0 | 16170 | 1.6471 | 0.8887 |
348
+ | 0.0 | 295.0 | 16225 | 1.6476 | 0.8887 |
349
+ | 0.0 | 296.0 | 16280 | 1.6475 | 0.8887 |
350
+ | 0.0 | 297.0 | 16335 | 1.6460 | 0.8887 |
351
+ | 0.0 | 298.0 | 16390 | 1.6471 | 0.8887 |
352
+ | 0.0 | 299.0 | 16445 | 1.6469 | 0.8887 |
353
+ | 0.0 | 300.0 | 16500 | 1.6470 | 0.8887 |
354
+
355
+
356
+ ### Framework versions
357
+
358
+ - PEFT 0.10.0
359
+ - Transformers 4.39.3
360
+ - Pytorch 2.2.1
361
+ - Datasets 2.16.1
362
+ - Tokenizers 0.15.2
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:985d80cd11b9d0bf8c20e460590d4993fa5fb2f3507060eac5d444fecf83be50
3
  size 3514768
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9d60f5e239b7b910b7c026acf89af3f8dc3d9f5b3b6eeae91a76a2940bc5364d
3
  size 3514768