File size: 31,426 Bytes
c899517
 
 
 
9153d7d
 
c899517
 
 
 
9153d7d
 
c899517
 
 
 
 
 
 
 
 
 
9153d7d
c899517
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
---
license: apache-2.0
library_name: peft
tags:
- alignment-handbook
- generated_from_trainer
- trl
- dpo
- generated_from_trainer
base_model: mistralai/Mistral-7B-v0.1
datasets:
- HuggingFaceH4/ultrafeedback_binarized
model-index:
- name: zephyr-7b-gpo-update4-i0
  results: []
---

<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->

# zephyr-7b-gpo-update4-i0

This model is a fine-tuned version of [alignment-handbook/zephyr-7b-sft-qlora](https://huggingface.co/alignment-handbook/zephyr-7b-sft-qlora) on the HuggingFaceH4/ultrafeedback_binarized dataset.
It achieves the following results on the evaluation set:
- Loss: 0.0239
- Rewards/chosen: -0.0415
- Rewards/rejected: -0.1266
- Rewards/accuracies: 0.6600
- Rewards/margins: 0.0851
- Logps/rejected: -236.9313
- Logps/chosen: -240.2975
- Logits/rejected: -2.0900
- Logits/chosen: -2.2780

## Model description

More information needed

## Intended uses & limitations

More information needed

## Training and evaluation data

More information needed

## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- learning_rate: 5e-06
- train_batch_size: 2
- eval_batch_size: 2
- seed: 42
- distributed_type: multi-GPU
- gradient_accumulation_steps: 2
- total_train_batch_size: 4
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: cosine
- lr_scheduler_warmup_ratio: 0.1
- num_epochs: 1

### Training results

| Training Loss | Epoch | Step  | Validation Loss | Rewards/chosen | Rewards/rejected | Rewards/accuracies | Rewards/margins | Logps/rejected | Logps/chosen | Logits/rejected | Logits/chosen |
|:-------------:|:-----:|:-----:|:---------------:|:--------------:|:----------------:|:------------------:|:---------------:|:--------------:|:------------:|:---------------:|:-------------:|
| 0.0706        | 0.01  | 100   | 0.0536          | 0.0012         | 0.0008           | 0.4925             | 0.0004          | -211.4570      | -231.7682    | -2.1603         | -2.3487       |
| 0.0614        | 0.01  | 200   | 0.0524          | 0.0029         | -0.0002          | 0.5890             | 0.0031          | -211.6427      | -231.4156    | -2.1610         | -2.3494       |
| 0.0495        | 0.02  | 300   | 0.0499          | 0.0190         | 0.0096           | 0.5805             | 0.0093          | -209.6874      | -228.2110    | -2.1645         | -2.3531       |
| 0.065         | 0.03  | 400   | 0.0470          | 0.0242         | 0.0074           | 0.5980             | 0.0167          | -210.1239      | -227.1680    | -2.1655         | -2.3542       |
| 0.04          | 0.03  | 500   | 0.0416          | -0.0511        | -0.0901          | 0.6125             | 0.0390          | -229.6261      | -242.2272    | -2.1418         | -2.3291       |
| 0.0313        | 0.04  | 600   | 0.0413          | -0.0451        | -0.0781          | 0.6160             | 0.0330          | -227.2309      | -241.0243    | -2.1400         | -2.3279       |
| 0.0519        | 0.05  | 700   | 0.0408          | 0.0360         | 0.0023           | 0.6155             | 0.0336          | -211.1453      | -224.8123    | -2.1490         | -2.3374       |
| 0.034         | 0.05  | 800   | 0.0369          | -0.0217        | -0.0726          | 0.6125             | 0.0510          | -226.1373      | -236.3390    | -2.1339         | -2.3227       |
| 0.0343        | 0.06  | 900   | 0.0361          | 0.0040         | -0.0439          | 0.6005             | 0.0478          | -220.3831      | -231.2078    | -2.1160         | -2.3030       |
| 0.0482        | 0.07  | 1000  | 0.0360          | -0.0952        | -0.1491          | 0.6100             | 0.0539          | -241.4311      | -251.0361    | -2.1121         | -2.2986       |
| 0.0316        | 0.07  | 1100  | 0.0415          | -0.0564        | -0.1248          | 0.6045             | 0.0684          | -236.5681      | -243.2760    | -2.1106         | -2.2966       |
| 0.0326        | 0.08  | 1200  | 0.0331          | -0.0594        | -0.1297          | 0.6405             | 0.0702          | -237.5470      | -243.8901    | -2.1027         | -2.2877       |
| 0.0313        | 0.09  | 1300  | 0.0313          | -0.0027        | -0.0614          | 0.6320             | 0.0587          | -223.8952      | -232.5400    | -2.1139         | -2.2994       |
| 0.0345        | 0.09  | 1400  | 0.0331          | -0.0106        | -0.0663          | 0.6275             | 0.0557          | -224.8707      | -234.1205    | -2.0508         | -2.2308       |
| 0.0629        | 0.1   | 1500  | 0.0340          | -0.0537        | -0.1284          | 0.6440             | 0.0747          | -237.2957      | -242.7522    | -2.0009         | -2.1776       |
| 0.0313        | 0.1   | 1600  | 0.0311          | -0.0639        | -0.1417          | 0.6310             | 0.0777          | -239.9465      | -244.7944    | -2.0578         | -2.2404       |
| 0.0287        | 0.11  | 1700  | 0.0303          | -0.0281        | -0.0938          | 0.6360             | 0.0657          | -230.3665      | -237.6215    | -2.1022         | -2.2867       |
| 0.0335        | 0.12  | 1800  | 0.0316          | 0.0007         | -0.0533          | 0.6260             | 0.0540          | -222.2785      | -231.8674    | -2.1158         | -2.3011       |
| 0.0209        | 0.12  | 1900  | 0.0333          | -0.0611        | -0.1124          | 0.6400             | 0.0513          | -234.0950      | -244.2219    | -2.1046         | -2.2893       |
| 0.0183        | 0.13  | 2000  | 0.0302          | -0.0622        | -0.1366          | 0.6500             | 0.0744          | -238.9349      | -244.4466    | -2.1213         | -2.3085       |
| 0.0235        | 0.14  | 2100  | 0.0289          | -0.0383        | -0.1150          | 0.6475             | 0.0767          | -234.6193      | -239.6733    | -2.0933         | -2.2787       |
| 0.0401        | 0.14  | 2200  | 0.0284          | -0.0577        | -0.1367          | 0.6370             | 0.0790          | -238.9556      | -243.5481    | -2.0898         | -2.2757       |
| 0.0257        | 0.15  | 2300  | 0.0304          | -0.0226        | -0.1119          | 0.6300             | 0.0893          | -233.9949      | -236.5215    | -2.0975         | -2.2834       |
| 0.0339        | 0.16  | 2400  | 0.0306          | 0.0075         | -0.0542          | 0.6350             | 0.0617          | -222.4461      | -230.5073    | -2.1318         | -2.3176       |
| 0.0132        | 0.16  | 2500  | 0.0312          | -0.0022        | -0.0606          | 0.6350             | 0.0584          | -223.7263      | -232.4370    | -2.1390         | -2.3256       |
| 0.0196        | 0.17  | 2600  | 0.0281          | -0.0069        | -0.0808          | 0.6500             | 0.0739          | -227.7710      | -233.3759    | -2.1025         | -2.2871       |
| 0.0317        | 0.18  | 2700  | 0.0280          | -0.0329        | -0.1059          | 0.6545             | 0.0730          | -232.7858      | -238.5806    | -2.1090         | -2.2942       |
| 0.036         | 0.18  | 2800  | 0.0279          | -0.0178        | -0.0869          | 0.6365             | 0.0691          | -228.9888      | -235.5567    | -2.1050         | -2.2897       |
| 0.0353        | 0.19  | 2900  | 0.0279          | -0.0415        | -0.1092          | 0.6445             | 0.0677          | -233.4535      | -240.3115    | -2.1000         | -2.2848       |
| 0.0259        | 0.2   | 3000  | 0.0289          | -0.0379        | -0.1213          | 0.6505             | 0.0834          | -235.8732      | -239.5773    | -2.0886         | -2.2741       |
| 0.0362        | 0.2   | 3100  | 0.0289          | -0.1100        | -0.1916          | 0.6485             | 0.0816          | -249.9393      | -254.0055    | -2.0925         | -2.2772       |
| 0.0319        | 0.21  | 3200  | 0.0283          | -0.0527        | -0.1321          | 0.6385             | 0.0794          | -238.0300      | -242.5391    | -2.0741         | -2.2569       |
| 0.0333        | 0.22  | 3300  | 0.0280          | -0.0509        | -0.1397          | 0.6535             | 0.0887          | -239.5463      | -242.1913    | -2.0690         | -2.2521       |
| 0.0347        | 0.22  | 3400  | 0.0285          | -0.0420        | -0.1146          | 0.6420             | 0.0726          | -234.5293      | -240.4102    | -2.0931         | -2.2767       |
| 0.025         | 0.23  | 3500  | 0.0277          | -0.0120        | -0.0953          | 0.6560             | 0.0833          | -230.6713      | -234.4121    | -2.0685         | -2.2513       |
| 0.0305        | 0.24  | 3600  | 0.0276          | -0.0113        | -0.0943          | 0.6520             | 0.0830          | -230.4667      | -234.2561    | -2.0925         | -2.2770       |
| 0.0331        | 0.24  | 3700  | 0.0283          | -0.0661        | -0.1472          | 0.6435             | 0.0812          | -241.0565      | -245.2157    | -2.0870         | -2.2712       |
| 0.0351        | 0.25  | 3800  | 0.0291          | -0.0335        | -0.1002          | 0.6410             | 0.0667          | -231.6431      | -238.6974    | -2.1198         | -2.3060       |
| 0.0164        | 0.26  | 3900  | 0.0280          | -0.0089        | -0.0886          | 0.6340             | 0.0797          | -229.3295      | -233.7921    | -2.0933         | -2.2780       |
| 0.0445        | 0.26  | 4000  | 0.0271          | -0.0361        | -0.1230          | 0.6390             | 0.0869          | -236.2058      | -239.2251    | -2.0860         | -2.2705       |
| 0.0176        | 0.27  | 4100  | 0.0289          | -0.0496        | -0.1127          | 0.6470             | 0.0630          | -234.1423      | -241.9269    | -2.1377         | -2.3253       |
| 0.0244        | 0.27  | 4200  | 0.0293          | -0.0263        | -0.0989          | 0.6425             | 0.0726          | -231.3835      | -237.2554    | -2.1260         | -2.3122       |
| 0.0378        | 0.28  | 4300  | 0.0267          | 0.0038         | -0.0727          | 0.6440             | 0.0766          | -226.1550      | -231.2366    | -2.0843         | -2.2686       |
| 0.0135        | 0.29  | 4400  | 0.0273          | -0.0216        | -0.1048          | 0.6435             | 0.0832          | -232.5620      | -236.3245    | -2.0998         | -2.2857       |
| 0.0143        | 0.29  | 4500  | 0.0268          | -0.0302        | -0.1056          | 0.6375             | 0.0755          | -232.7406      | -238.0378    | -2.0988         | -2.2843       |
| 0.0268        | 0.3   | 4600  | 0.0270          | -0.0210        | -0.0904          | 0.6400             | 0.0694          | -229.6838      | -236.1970    | -2.1062         | -2.2917       |
| 0.026         | 0.31  | 4700  | 0.0272          | -0.0808        | -0.1659          | 0.6495             | 0.0851          | -244.7986      | -248.1638    | -2.1119         | -2.2987       |
| 0.0447        | 0.31  | 4800  | 0.0265          | -0.0634        | -0.1429          | 0.6465             | 0.0795          | -240.1879      | -244.6757    | -2.1131         | -2.2996       |
| 0.0311        | 0.32  | 4900  | 0.0269          | -0.0418        | -0.1304          | 0.6470             | 0.0886          | -237.6830      | -240.3570    | -2.0728         | -2.2562       |
| 0.0241        | 0.33  | 5000  | 0.0267          | -0.0626        | -0.1478          | 0.6425             | 0.0853          | -241.1806      | -244.5231    | -2.0852         | -2.2706       |
| 0.0183        | 0.33  | 5100  | 0.0266          | -0.0589        | -0.1374          | 0.6415             | 0.0785          | -239.0941      | -243.7824    | -2.0980         | -2.2840       |
| 0.0196        | 0.34  | 5200  | 0.0281          | -0.1203        | -0.1997          | 0.6440             | 0.0794          | -251.5512      | -256.0692    | -2.1151         | -2.3031       |
| 0.0218        | 0.35  | 5300  | 0.0284          | -0.0855        | -0.1675          | 0.6445             | 0.0821          | -245.1141      | -249.0969    | -2.1199         | -2.3078       |
| 0.0392        | 0.35  | 5400  | 0.0276          | -0.0211        | -0.0901          | 0.6320             | 0.0690          | -229.6360      | -236.2313    | -2.1202         | -2.3068       |
| 0.0095        | 0.36  | 5500  | 0.0278          | -0.0108        | -0.0838          | 0.6365             | 0.0729          | -228.3683      | -234.1733    | -2.1144         | -2.3003       |
| 0.0199        | 0.37  | 5600  | 0.0279          | -0.0468        | -0.1295          | 0.6430             | 0.0827          | -237.5136      | -241.3706    | -2.0764         | -2.2607       |
| 0.0237        | 0.37  | 5700  | 0.0267          | -0.0323        | -0.1215          | 0.6445             | 0.0891          | -235.9061      | -238.4741    | -2.0452         | -2.2280       |
| 0.0323        | 0.38  | 5800  | 0.0269          | -0.0412        | -0.1289          | 0.6460             | 0.0876          | -237.3893      | -240.2530    | -2.0370         | -2.2195       |
| 0.0242        | 0.39  | 5900  | 0.0260          | -0.0303        | -0.1115          | 0.6455             | 0.0812          | -233.9047      | -238.0558    | -2.0427         | -2.2254       |
| 0.0239        | 0.39  | 6000  | 0.0265          | -0.0064        | -0.0780          | 0.6395             | 0.0716          | -227.2050      | -233.2807    | -2.0840         | -2.2698       |
| 0.0246        | 0.4   | 6100  | 0.0266          | -0.0466        | -0.1195          | 0.6475             | 0.0728          | -235.5066      | -241.3312    | -2.0964         | -2.2834       |
| 0.0109        | 0.41  | 6200  | 0.0259          | -0.0380        | -0.1166          | 0.6420             | 0.0786          | -234.9308      | -239.6033    | -2.0589         | -2.2443       |
| 0.0289        | 0.41  | 6300  | 0.0258          | -0.0286        | -0.1078          | 0.6525             | 0.0791          | -233.1673      | -237.7339    | -2.0557         | -2.2405       |
| 0.0287        | 0.42  | 6400  | 0.0267          | -0.0259        | -0.1155          | 0.6430             | 0.0896          | -234.7208      | -237.1919    | -2.0664         | -2.2525       |
| 0.0631        | 0.43  | 6500  | 0.0259          | -0.0313        | -0.1091          | 0.6460             | 0.0778          | -233.4391      | -238.2719    | -2.0895         | -2.2759       |
| 0.037         | 0.43  | 6600  | 0.0260          | -0.0094        | -0.0871          | 0.6490             | 0.0777          | -229.0337      | -233.8820    | -2.0997         | -2.2868       |
| 0.0296        | 0.44  | 6700  | 0.0264          | -0.0446        | -0.1288          | 0.6565             | 0.0842          | -237.3631      | -240.9244    | -2.1026         | -2.2903       |
| 0.038         | 0.44  | 6800  | 0.0262          | -0.0694        | -0.1493          | 0.6565             | 0.0799          | -241.4658      | -245.8865    | -2.0871         | -2.2739       |
| 0.0458        | 0.45  | 6900  | 0.0261          | -0.0352        | -0.1124          | 0.6525             | 0.0772          | -234.0974      | -239.0529    | -2.0925         | -2.2798       |
| 0.0275        | 0.46  | 7000  | 0.0257          | -0.0520        | -0.1401          | 0.6535             | 0.0881          | -239.6416      | -242.4081    | -2.0897         | -2.2774       |
| 0.0175        | 0.46  | 7100  | 0.0255          | -0.0397        | -0.1193          | 0.6530             | 0.0795          | -235.4656      | -239.9513    | -2.1058         | -2.2933       |
| 0.035         | 0.47  | 7200  | 0.0260          | -0.0543        | -0.1267          | 0.6485             | 0.0724          | -236.9568      | -242.8715    | -2.1193         | -2.3083       |
| 0.015         | 0.48  | 7300  | 0.0257          | -0.0871        | -0.1622          | 0.6390             | 0.0751          | -244.0609      | -249.4324    | -2.1123         | -2.3009       |
| 0.0231        | 0.48  | 7400  | 0.0255          | -0.0659        | -0.1463          | 0.6490             | 0.0804          | -240.8683      | -245.1848    | -2.1035         | -2.2913       |
| 0.0211        | 0.49  | 7500  | 0.0258          | -0.0631        | -0.1462          | 0.6520             | 0.0831          | -240.8420      | -244.6235    | -2.0635         | -2.2485       |
| 0.0379        | 0.5   | 7600  | 0.0259          | -0.0748        | -0.1597          | 0.6475             | 0.0849          | -243.5423      | -246.9550    | -2.0566         | -2.2404       |
| 0.0117        | 0.5   | 7700  | 0.0257          | -0.0554        | -0.1408          | 0.6620             | 0.0854          | -239.7720      | -243.0760    | -2.0661         | -2.2502       |
| 0.0197        | 0.51  | 7800  | 0.0261          | -0.0680        | -0.1537          | 0.6590             | 0.0857          | -242.3484      | -245.6013    | -2.0867         | -2.2723       |
| 0.0296        | 0.52  | 7900  | 0.0253          | -0.0680        | -0.1488          | 0.6555             | 0.0808          | -241.3649      | -245.6047    | -2.0900         | -2.2762       |
| 0.0385        | 0.52  | 8000  | 0.0251          | -0.0474        | -0.1297          | 0.6500             | 0.0823          | -237.5529      | -241.4889    | -2.0737         | -2.2589       |
| 0.0295        | 0.53  | 8100  | 0.0249          | -0.0725        | -0.1568          | 0.6590             | 0.0842          | -242.9643      | -246.5116    | -2.0447         | -2.2293       |
| 0.0147        | 0.54  | 8200  | 0.0250          | -0.0814        | -0.1636          | 0.6455             | 0.0822          | -244.3407      | -248.2939    | -2.0459         | -2.2301       |
| 0.0166        | 0.54  | 8300  | 0.0254          | -0.0635        | -0.1415          | 0.6535             | 0.0780          | -239.9138      | -244.7009    | -2.0618         | -2.2466       |
| 0.0177        | 0.55  | 8400  | 0.0260          | -0.0569        | -0.1258          | 0.6505             | 0.0689          | -236.7758      | -243.3866    | -2.0623         | -2.2464       |
| 0.0323        | 0.56  | 8500  | 0.0247          | -0.0606        | -0.1478          | 0.6590             | 0.0872          | -241.1788      | -244.1342    | -2.0510         | -2.2352       |
| 0.0178        | 0.56  | 8600  | 0.0245          | -0.0697        | -0.1572          | 0.6610             | 0.0875          | -243.0600      | -245.9448    | -2.0607         | -2.2454       |
| 0.0473        | 0.57  | 8700  | 0.0247          | -0.0695        | -0.1535          | 0.6565             | 0.0840          | -242.3023      | -245.9043    | -2.0663         | -2.2518       |
| 0.0302        | 0.58  | 8800  | 0.0249          | -0.0482        | -0.1318          | 0.6610             | 0.0837          | -237.9781      | -241.6350    | -2.0593         | -2.2448       |
| 0.0391        | 0.58  | 8900  | 0.0248          | -0.0637        | -0.1548          | 0.6620             | 0.0911          | -242.5767      | -244.7529    | -2.0658         | -2.2522       |
| 0.0377        | 0.59  | 9000  | 0.0246          | -0.0355        | -0.1189          | 0.6575             | 0.0834          | -235.3853      | -239.0974    | -2.0745         | -2.2613       |
| 0.0296        | 0.6   | 9100  | 0.0249          | -0.0387        | -0.1166          | 0.6550             | 0.0779          | -234.9412      | -239.7537    | -2.0871         | -2.2747       |
| 0.0241        | 0.6   | 9200  | 0.0252          | -0.0358        | -0.1111          | 0.6575             | 0.0753          | -233.8348      | -239.1661    | -2.1060         | -2.2943       |
| 0.019         | 0.61  | 9300  | 0.0250          | -0.0516        | -0.1373          | 0.6580             | 0.0858          | -239.0793      | -242.3174    | -2.1004         | -2.2889       |
| 0.0247        | 0.62  | 9400  | 0.0251          | -0.0712        | -0.1504          | 0.6545             | 0.0792          | -241.6835      | -246.2362    | -2.1041         | -2.2926       |
| 0.0161        | 0.62  | 9500  | 0.0249          | -0.0518        | -0.1338          | 0.6485             | 0.0820          | -238.3770      | -242.3746    | -2.0949         | -2.2827       |
| 0.0198        | 0.63  | 9600  | 0.0250          | -0.0282        | -0.1124          | 0.6500             | 0.0842          | -234.0898      | -237.6352    | -2.0913         | -2.2787       |
| 0.0368        | 0.63  | 9700  | 0.0248          | -0.0568        | -0.1405          | 0.6585             | 0.0836          | -239.7049      | -243.3711    | -2.0914         | -2.2787       |
| 0.0214        | 0.64  | 9800  | 0.0248          | -0.0559        | -0.1371          | 0.6570             | 0.0811          | -239.0298      | -243.1945    | -2.0971         | -2.2844       |
| 0.0331        | 0.65  | 9900  | 0.0246          | -0.0441        | -0.1329          | 0.6600             | 0.0888          | -238.1875      | -240.8263    | -2.0867         | -2.2732       |
| 0.0316        | 0.65  | 10000 | 0.0246          | -0.0573        | -0.1474          | 0.6580             | 0.0901          | -241.0922      | -243.4642    | -2.0770         | -2.2634       |
| 0.0181        | 0.66  | 10100 | 0.0248          | -0.0757        | -0.1612          | 0.6670             | 0.0855          | -243.8461      | -247.1387    | -2.0801         | -2.2661       |
| 0.0159        | 0.67  | 10200 | 0.0245          | -0.0638        | -0.1550          | 0.6610             | 0.0912          | -242.6056      | -244.7626    | -2.0611         | -2.2463       |
| 0.018         | 0.67  | 10300 | 0.0244          | -0.0590        | -0.1447          | 0.6615             | 0.0857          | -240.5506      | -243.8084    | -2.0698         | -2.2554       |
| 0.0144        | 0.68  | 10400 | 0.0245          | -0.0385        | -0.1258          | 0.6605             | 0.0873          | -236.7707      | -239.7064    | -2.0630         | -2.2489       |
| 0.0273        | 0.69  | 10500 | 0.0244          | -0.0431        | -0.1273          | 0.6565             | 0.0842          | -237.0745      | -240.6274    | -2.0678         | -2.2537       |
| 0.0194        | 0.69  | 10600 | 0.0243          | -0.0430        | -0.1273          | 0.6635             | 0.0843          | -237.0673      | -240.6028    | -2.0684         | -2.2543       |
| 0.0199        | 0.7   | 10700 | 0.0244          | -0.0439        | -0.1259          | 0.6595             | 0.0820          | -236.7907      | -240.7807    | -2.0696         | -2.2556       |
| 0.0349        | 0.71  | 10800 | 0.0245          | -0.0394        | -0.1225          | 0.6585             | 0.0832          | -236.1209      | -239.8839    | -2.0673         | -2.2533       |
| 0.0294        | 0.71  | 10900 | 0.0246          | -0.0459        | -0.1264          | 0.6615             | 0.0805          | -236.8899      | -241.1904    | -2.0696         | -2.2554       |
| 0.0493        | 0.72  | 11000 | 0.0247          | -0.0406        | -0.1196          | 0.6555             | 0.0789          | -235.5289      | -240.1349    | -2.0714         | -2.2571       |
| 0.0186        | 0.73  | 11100 | 0.0246          | -0.0362        | -0.1179          | 0.6605             | 0.0817          | -235.1986      | -239.2465    | -2.0741         | -2.2600       |
| 0.0233        | 0.73  | 11200 | 0.0247          | -0.0275        | -0.1085          | 0.6585             | 0.0810          | -233.3055      | -237.5009    | -2.0750         | -2.2610       |
| 0.0218        | 0.74  | 11300 | 0.0244          | -0.0370        | -0.1200          | 0.6575             | 0.0831          | -235.6197      | -239.4001    | -2.0764         | -2.2629       |
| 0.0365        | 0.75  | 11400 | 0.0245          | -0.0355        | -0.1223          | 0.6580             | 0.0868          | -236.0721      | -239.1116    | -2.0719         | -2.2584       |
| 0.0199        | 0.75  | 11500 | 0.0246          | -0.0318        | -0.1118          | 0.6590             | 0.0800          | -233.9702      | -238.3574    | -2.0827         | -2.2695       |
| 0.0296        | 0.76  | 11600 | 0.0244          | -0.0421        | -0.1299          | 0.6665             | 0.0878          | -237.5938      | -240.4171    | -2.0765         | -2.2633       |
| 0.015         | 0.77  | 11700 | 0.0244          | -0.0487        | -0.1316          | 0.6600             | 0.0829          | -237.9386      | -241.7478    | -2.0779         | -2.2644       |
| 0.0127        | 0.77  | 11800 | 0.0244          | -0.0598        | -0.1424          | 0.6580             | 0.0826          | -240.0971      | -243.9701    | -2.0787         | -2.2653       |
| 0.0199        | 0.78  | 11900 | 0.0243          | -0.0591        | -0.1450          | 0.6605             | 0.0859          | -240.6168      | -243.8326    | -2.0758         | -2.2626       |
| 0.0313        | 0.79  | 12000 | 0.0244          | -0.0559        | -0.1424          | 0.6605             | 0.0865          | -240.0914      | -243.1773    | -2.0797         | -2.2669       |
| 0.0102        | 0.79  | 12100 | 0.0244          | -0.0513        | -0.1355          | 0.6560             | 0.0842          | -238.7046      | -242.2641    | -2.0830         | -2.2705       |
| 0.0325        | 0.8   | 12200 | 0.0243          | -0.0456        | -0.1291          | 0.6600             | 0.0835          | -237.4338      | -241.1291    | -2.0835         | -2.2709       |
| 0.028         | 0.8   | 12300 | 0.0243          | -0.0493        | -0.1364          | 0.6585             | 0.0872          | -238.8947      | -241.8556    | -2.0821         | -2.2695       |
| 0.0278        | 0.81  | 12400 | 0.0241          | -0.0510        | -0.1343          | 0.6600             | 0.0833          | -238.4753      | -242.2064    | -2.0913         | -2.2793       |
| 0.0142        | 0.82  | 12500 | 0.0241          | -0.0540        | -0.1371          | 0.6570             | 0.0831          | -239.0412      | -242.8141    | -2.0913         | -2.2793       |
| 0.0177        | 0.82  | 12600 | 0.0242          | -0.0556        | -0.1379          | 0.6580             | 0.0823          | -239.1902      | -243.1229    | -2.0917         | -2.2797       |
| 0.0133        | 0.83  | 12700 | 0.0242          | -0.0496        | -0.1314          | 0.6575             | 0.0819          | -237.8956      | -241.9153    | -2.0933         | -2.2814       |
| 0.0186        | 0.84  | 12800 | 0.0242          | -0.0451        | -0.1272          | 0.6565             | 0.0822          | -237.0618      | -241.0176    | -2.0936         | -2.2818       |
| 0.0117        | 0.84  | 12900 | 0.0241          | -0.0397        | -0.1232          | 0.6580             | 0.0835          | -236.2435      | -239.9395    | -2.0908         | -2.2790       |
| 0.0116        | 0.85  | 13000 | 0.0241          | -0.0419        | -0.1272          | 0.6580             | 0.0853          | -237.0613      | -240.3864    | -2.0899         | -2.2781       |
| 0.0338        | 0.86  | 13100 | 0.0241          | -0.0404        | -0.1232          | 0.6565             | 0.0828          | -236.2545      | -240.0884    | -2.0941         | -2.2824       |
| 0.0206        | 0.86  | 13200 | 0.0240          | -0.0429        | -0.1280          | 0.6590             | 0.0851          | -237.2177      | -240.5875    | -2.0892         | -2.2772       |
| 0.018         | 0.87  | 13300 | 0.0240          | -0.0407        | -0.1257          | 0.6600             | 0.0851          | -236.7596      | -240.1422    | -2.0891         | -2.2772       |
| 0.0275        | 0.88  | 13400 | 0.0240          | -0.0392        | -0.1234          | 0.6585             | 0.0842          | -236.2926      | -239.8449    | -2.0904         | -2.2786       |
| 0.0177        | 0.88  | 13500 | 0.0240          | -0.0369        | -0.1201          | 0.6580             | 0.0832          | -235.6223      | -239.3825    | -2.0911         | -2.2792       |
| 0.0225        | 0.89  | 13600 | 0.0240          | -0.0405        | -0.1255          | 0.6580             | 0.0850          | -236.7200      | -240.1148    | -2.0913         | -2.2794       |
| 0.0223        | 0.9   | 13700 | 0.0240          | -0.0422        | -0.1268          | 0.6595             | 0.0846          | -236.9746      | -240.4513    | -2.0923         | -2.2803       |
| 0.0302        | 0.9   | 13800 | 0.0240          | -0.0416        | -0.1272          | 0.6575             | 0.0857          | -237.0577      | -240.3201    | -2.0900         | -2.2780       |
| 0.0213        | 0.91  | 13900 | 0.0239          | -0.0407        | -0.1267          | 0.6605             | 0.0859          | -236.9426      | -240.1542    | -2.0888         | -2.2767       |
| 0.0221        | 0.92  | 14000 | 0.0239          | -0.0425        | -0.1287          | 0.6595             | 0.0862          | -237.3506      | -240.4969    | -2.0892         | -2.2772       |
| 0.0259        | 0.92  | 14100 | 0.0239          | -0.0411        | -0.1266          | 0.6585             | 0.0855          | -236.9374      | -240.2254    | -2.0892         | -2.2772       |
| 0.0156        | 0.93  | 14200 | 0.0239          | -0.0419        | -0.1278          | 0.6615             | 0.0859          | -237.1707      | -240.3793    | -2.0891         | -2.2771       |
| 0.0158        | 0.94  | 14300 | 0.0239          | -0.0414        | -0.1269          | 0.6600             | 0.0855          | -237.0012      | -240.2890    | -2.0887         | -2.2765       |
| 0.0216        | 0.94  | 14400 | 0.0239          | -0.0413        | -0.1268          | 0.6620             | 0.0856          | -236.9817      | -240.2556    | -2.0895         | -2.2774       |
| 0.0126        | 0.95  | 14500 | 0.0239          | -0.0413        | -0.1269          | 0.6605             | 0.0856          | -237.0005      | -240.2699    | -2.0895         | -2.2774       |
| 0.0346        | 0.96  | 14600 | 0.0239          | -0.0416        | -0.1269          | 0.6590             | 0.0853          | -236.9897      | -240.3241    | -2.0901         | -2.2781       |
| 0.0225        | 0.96  | 14700 | 0.0239          | -0.0415        | -0.1267          | 0.6605             | 0.0852          | -236.9473      | -240.3016    | -2.0895         | -2.2774       |
| 0.0099        | 0.97  | 14800 | 0.0239          | -0.0415        | -0.1268          | 0.6595             | 0.0853          | -236.9750      | -240.3092    | -2.0891         | -2.2771       |
| 0.0235        | 0.97  | 14900 | 0.0239          | -0.0415        | -0.1268          | 0.6585             | 0.0853          | -236.9760      | -240.2991    | -2.0898         | -2.2777       |
| 0.019         | 0.98  | 15000 | 0.0239          | -0.0415        | -0.1267          | 0.6610             | 0.0852          | -236.9527      | -240.3060    | -2.0899         | -2.2778       |
| 0.0368        | 0.99  | 15100 | 0.0239          | -0.0415        | -0.1267          | 0.6605             | 0.0852          | -236.9458      | -240.2961    | -2.0904         | -2.2784       |
| 0.0267        | 0.99  | 15200 | 0.0239          | -0.0414        | -0.1265          | 0.6580             | 0.0851          | -236.9213      | -240.2912    | -2.0899         | -2.2778       |


### Framework versions

- PEFT 0.7.1
- Transformers 4.36.2
- Pytorch 2.1.2+cu121
- Datasets 2.14.6
- Tokenizers 0.15.2