update model card README.md
Browse files
README.md
ADDED
@@ -0,0 +1,385 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
tags:
|
4 |
+
- generated_from_trainer
|
5 |
+
model-index:
|
6 |
+
- name: codet5-small-custom-functions-dataset-python
|
7 |
+
results: []
|
8 |
+
---
|
9 |
+
|
10 |
+
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
11 |
+
should probably proofread and complete it, then remove this comment. -->
|
12 |
+
|
13 |
+
# codet5-small-custom-functions-dataset-python
|
14 |
+
|
15 |
+
This model is a fine-tuned version of [Salesforce/codet5-small](https://huggingface.co/Salesforce/codet5-small) on the None dataset.
|
16 |
+
It achieves the following results on the evaluation set:
|
17 |
+
- Loss: 0.2103
|
18 |
+
|
19 |
+
## Model description
|
20 |
+
|
21 |
+
More information needed
|
22 |
+
|
23 |
+
## Intended uses & limitations
|
24 |
+
|
25 |
+
More information needed
|
26 |
+
|
27 |
+
## Training and evaluation data
|
28 |
+
|
29 |
+
More information needed
|
30 |
+
|
31 |
+
## Training procedure
|
32 |
+
|
33 |
+
### Training hyperparameters
|
34 |
+
|
35 |
+
The following hyperparameters were used during training:
|
36 |
+
- learning_rate: 2e-05
|
37 |
+
- train_batch_size: 8
|
38 |
+
- eval_batch_size: 8
|
39 |
+
- seed: 42
|
40 |
+
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
41 |
+
- lr_scheduler_type: linear
|
42 |
+
- num_epochs: 10
|
43 |
+
|
44 |
+
### Training results
|
45 |
+
|
46 |
+
| Training Loss | Epoch | Step | Validation Loss |
|
47 |
+
|:-------------:|:-----:|:----:|:---------------:|
|
48 |
+
| 6.8821 | 0.03 | 1 | 4.9003 |
|
49 |
+
| 5.1641 | 0.06 | 2 | 4.1876 |
|
50 |
+
| 4.5747 | 0.09 | 3 | 3.5772 |
|
51 |
+
| 3.985 | 0.12 | 4 | 3.0527 |
|
52 |
+
| 4.0255 | 0.15 | 5 | 2.5962 |
|
53 |
+
| 3.1963 | 0.18 | 6 | 2.2589 |
|
54 |
+
| 3.01 | 0.21 | 7 | 1.9755 |
|
55 |
+
| 2.5837 | 0.24 | 8 | 1.7736 |
|
56 |
+
| 2.6645 | 0.27 | 9 | 1.6032 |
|
57 |
+
| 1.8825 | 0.3 | 10 | 1.4620 |
|
58 |
+
| 2.282 | 0.33 | 11 | 1.3621 |
|
59 |
+
| 1.9555 | 0.36 | 12 | 1.2926 |
|
60 |
+
| 2.0374 | 0.39 | 13 | 1.2261 |
|
61 |
+
| 1.6276 | 0.42 | 14 | 1.1631 |
|
62 |
+
| 1.937 | 0.45 | 15 | 1.1053 |
|
63 |
+
| 1.4738 | 0.48 | 16 | 1.0512 |
|
64 |
+
| 1.5335 | 0.52 | 17 | 1.0016 |
|
65 |
+
| 1.5224 | 0.55 | 18 | 0.9554 |
|
66 |
+
| 1.5048 | 0.58 | 19 | 0.9175 |
|
67 |
+
| 1.3983 | 0.61 | 20 | 0.8806 |
|
68 |
+
| 1.2506 | 0.64 | 21 | 0.8495 |
|
69 |
+
| 1.186 | 0.67 | 22 | 0.8243 |
|
70 |
+
| 1.1824 | 0.7 | 23 | 0.7988 |
|
71 |
+
| 1.29 | 0.73 | 24 | 0.7728 |
|
72 |
+
| 1.159 | 0.76 | 25 | 0.7468 |
|
73 |
+
| 0.9893 | 0.79 | 26 | 0.7193 |
|
74 |
+
| 1.2054 | 0.82 | 27 | 0.7013 |
|
75 |
+
| 1.0004 | 0.85 | 28 | 0.6850 |
|
76 |
+
| 0.7918 | 0.88 | 29 | 0.6704 |
|
77 |
+
| 1.0357 | 0.91 | 30 | 0.6570 |
|
78 |
+
| 1.0648 | 0.94 | 31 | 0.6452 |
|
79 |
+
| 1.0679 | 0.97 | 32 | 0.6336 |
|
80 |
+
| 0.9296 | 1.0 | 33 | 0.6227 |
|
81 |
+
| 0.8459 | 1.03 | 34 | 0.6123 |
|
82 |
+
| 0.8312 | 1.06 | 35 | 0.6000 |
|
83 |
+
| 0.9367 | 1.09 | 36 | 0.5844 |
|
84 |
+
| 0.8813 | 1.12 | 37 | 0.5724 |
|
85 |
+
| 0.9134 | 1.15 | 38 | 0.5608 |
|
86 |
+
| 0.6967 | 1.18 | 39 | 0.5509 |
|
87 |
+
| 0.8654 | 1.21 | 40 | 0.5416 |
|
88 |
+
| 0.784 | 1.24 | 41 | 0.5324 |
|
89 |
+
| 0.7623 | 1.27 | 42 | 0.5237 |
|
90 |
+
| 0.739 | 1.3 | 43 | 0.5145 |
|
91 |
+
| 0.8273 | 1.33 | 44 | 0.5064 |
|
92 |
+
| 0.7384 | 1.36 | 45 | 0.4968 |
|
93 |
+
| 0.6936 | 1.39 | 46 | 0.4882 |
|
94 |
+
| 0.7078 | 1.42 | 47 | 0.4807 |
|
95 |
+
| 0.6214 | 1.45 | 48 | 0.4740 |
|
96 |
+
| 0.6983 | 1.48 | 49 | 0.4662 |
|
97 |
+
| 0.6328 | 1.52 | 50 | 0.4588 |
|
98 |
+
| 0.663 | 1.55 | 51 | 0.4533 |
|
99 |
+
| 0.6518 | 1.58 | 52 | 0.4476 |
|
100 |
+
| 0.5782 | 1.61 | 53 | 0.4343 |
|
101 |
+
| 0.6361 | 1.64 | 54 | 0.4296 |
|
102 |
+
| 0.5804 | 1.67 | 55 | 0.4249 |
|
103 |
+
| 0.6557 | 1.7 | 56 | 0.4210 |
|
104 |
+
| 0.6801 | 1.73 | 57 | 0.4173 |
|
105 |
+
| 0.6682 | 1.76 | 58 | 0.4132 |
|
106 |
+
| 0.6346 | 1.79 | 59 | 0.4090 |
|
107 |
+
| 0.6421 | 1.82 | 60 | 0.4028 |
|
108 |
+
| 0.6318 | 1.85 | 61 | 0.3969 |
|
109 |
+
| 0.6914 | 1.88 | 62 | 0.3942 |
|
110 |
+
| 0.5953 | 1.91 | 63 | 0.3920 |
|
111 |
+
| 0.7016 | 1.94 | 64 | 0.3894 |
|
112 |
+
| 0.5728 | 1.97 | 65 | 0.3839 |
|
113 |
+
| 0.5417 | 2.0 | 66 | 0.3738 |
|
114 |
+
| 0.5502 | 2.03 | 67 | 0.3705 |
|
115 |
+
| 0.5167 | 2.06 | 68 | 0.3668 |
|
116 |
+
| 0.6452 | 2.09 | 69 | 0.3629 |
|
117 |
+
| 0.4713 | 2.12 | 70 | 0.3583 |
|
118 |
+
| 0.5239 | 2.15 | 71 | 0.3553 |
|
119 |
+
| 0.6125 | 2.18 | 72 | 0.3527 |
|
120 |
+
| 0.4548 | 2.21 | 73 | 0.3414 |
|
121 |
+
| 0.5705 | 2.24 | 74 | 0.3389 |
|
122 |
+
| 0.4912 | 2.27 | 75 | 0.3374 |
|
123 |
+
| 0.4566 | 2.3 | 76 | 0.3316 |
|
124 |
+
| 0.5642 | 2.33 | 77 | 0.3288 |
|
125 |
+
| 0.4212 | 2.36 | 78 | 0.3260 |
|
126 |
+
| 0.3808 | 2.39 | 79 | 0.3236 |
|
127 |
+
| 0.4833 | 2.42 | 80 | 0.3214 |
|
128 |
+
| 0.4775 | 2.45 | 81 | 0.3193 |
|
129 |
+
| 0.5598 | 2.48 | 82 | 0.3175 |
|
130 |
+
| 0.5144 | 2.52 | 83 | 0.3162 |
|
131 |
+
| 0.4554 | 2.55 | 84 | 0.3152 |
|
132 |
+
| 0.4811 | 2.58 | 85 | 0.3141 |
|
133 |
+
| 0.4545 | 2.61 | 86 | 0.3130 |
|
134 |
+
| 0.438 | 2.64 | 87 | 0.3117 |
|
135 |
+
| 0.4071 | 2.67 | 88 | 0.3104 |
|
136 |
+
| 0.4635 | 2.7 | 89 | 0.3090 |
|
137 |
+
| 0.5118 | 2.73 | 90 | 0.3077 |
|
138 |
+
| 0.4043 | 2.76 | 91 | 0.3059 |
|
139 |
+
| 0.4675 | 2.79 | 92 | 0.3044 |
|
140 |
+
| 0.4551 | 2.82 | 93 | 0.3021 |
|
141 |
+
| 0.497 | 2.85 | 94 | 0.2987 |
|
142 |
+
| 0.4334 | 2.88 | 95 | 0.2932 |
|
143 |
+
| 0.4087 | 2.91 | 96 | 0.2901 |
|
144 |
+
| 0.477 | 2.94 | 97 | 0.2888 |
|
145 |
+
| 0.4834 | 2.97 | 98 | 0.2871 |
|
146 |
+
| 0.4513 | 3.0 | 99 | 0.2856 |
|
147 |
+
| 0.4172 | 3.03 | 100 | 0.2845 |
|
148 |
+
| 0.3827 | 3.06 | 101 | 0.2837 |
|
149 |
+
| 0.3851 | 3.09 | 102 | 0.2830 |
|
150 |
+
| 0.3976 | 3.12 | 103 | 0.2823 |
|
151 |
+
| 0.4909 | 3.15 | 104 | 0.2833 |
|
152 |
+
| 0.5409 | 3.18 | 105 | 0.2830 |
|
153 |
+
| 0.4039 | 3.21 | 106 | 0.2808 |
|
154 |
+
| 0.4057 | 3.24 | 107 | 0.2789 |
|
155 |
+
| 0.4214 | 3.27 | 108 | 0.2779 |
|
156 |
+
| 0.4209 | 3.3 | 109 | 0.2768 |
|
157 |
+
| 0.5044 | 3.33 | 110 | 0.2759 |
|
158 |
+
| 0.3457 | 3.36 | 111 | 0.2750 |
|
159 |
+
| 0.394 | 3.39 | 112 | 0.2744 |
|
160 |
+
| 0.4008 | 3.42 | 113 | 0.2739 |
|
161 |
+
| 0.3837 | 3.45 | 114 | 0.2736 |
|
162 |
+
| 0.3843 | 3.48 | 115 | 0.2734 |
|
163 |
+
| 0.4458 | 3.52 | 116 | 0.2730 |
|
164 |
+
| 0.4417 | 3.55 | 117 | 0.2725 |
|
165 |
+
| 0.4274 | 3.58 | 118 | 0.2719 |
|
166 |
+
| 0.4129 | 3.61 | 119 | 0.2712 |
|
167 |
+
| 0.421 | 3.64 | 120 | 0.2702 |
|
168 |
+
| 0.3625 | 3.67 | 121 | 0.2692 |
|
169 |
+
| 0.3785 | 3.7 | 122 | 0.2683 |
|
170 |
+
| 0.4023 | 3.73 | 123 | 0.2671 |
|
171 |
+
| 0.416 | 3.76 | 124 | 0.2663 |
|
172 |
+
| 0.3661 | 3.79 | 125 | 0.2654 |
|
173 |
+
| 0.373 | 3.82 | 126 | 0.2647 |
|
174 |
+
| 0.4045 | 3.85 | 127 | 0.2640 |
|
175 |
+
| 0.3955 | 3.88 | 128 | 0.2633 |
|
176 |
+
| 0.3796 | 3.91 | 129 | 0.2627 |
|
177 |
+
| 0.3682 | 3.94 | 130 | 0.2621 |
|
178 |
+
| 0.4195 | 3.97 | 131 | 0.2614 |
|
179 |
+
| 0.4135 | 4.0 | 132 | 0.2609 |
|
180 |
+
| 0.3244 | 4.03 | 133 | 0.2601 |
|
181 |
+
| 0.411 | 4.06 | 134 | 0.2597 |
|
182 |
+
| 0.4019 | 4.09 | 135 | 0.2599 |
|
183 |
+
| 0.451 | 4.12 | 136 | 0.2592 |
|
184 |
+
| 0.3948 | 4.15 | 137 | 0.2584 |
|
185 |
+
| 0.3375 | 4.18 | 138 | 0.2577 |
|
186 |
+
| 0.3687 | 4.21 | 139 | 0.2567 |
|
187 |
+
| 0.3946 | 4.24 | 140 | 0.2557 |
|
188 |
+
| 0.4181 | 4.27 | 141 | 0.2547 |
|
189 |
+
| 0.2949 | 4.3 | 142 | 0.2540 |
|
190 |
+
| 0.3621 | 4.33 | 143 | 0.2530 |
|
191 |
+
| 0.4134 | 4.36 | 144 | 0.2523 |
|
192 |
+
| 0.3366 | 4.39 | 145 | 0.2516 |
|
193 |
+
| 0.3798 | 4.42 | 146 | 0.2510 |
|
194 |
+
| 0.3519 | 4.45 | 147 | 0.2505 |
|
195 |
+
| 0.2999 | 4.48 | 148 | 0.2501 |
|
196 |
+
| 0.4096 | 4.52 | 149 | 0.2495 |
|
197 |
+
| 0.4736 | 4.55 | 150 | 0.2485 |
|
198 |
+
| 0.3481 | 4.58 | 151 | 0.2481 |
|
199 |
+
| 0.3683 | 4.61 | 152 | 0.2479 |
|
200 |
+
| 0.325 | 4.64 | 153 | 0.2476 |
|
201 |
+
| 0.3746 | 4.67 | 154 | 0.2473 |
|
202 |
+
| 0.3394 | 4.7 | 155 | 0.2468 |
|
203 |
+
| 0.3653 | 4.73 | 156 | 0.2463 |
|
204 |
+
| 0.3222 | 4.76 | 157 | 0.2458 |
|
205 |
+
| 0.3496 | 4.79 | 158 | 0.2453 |
|
206 |
+
| 0.368 | 4.82 | 159 | 0.2450 |
|
207 |
+
| 0.3473 | 4.85 | 160 | 0.2447 |
|
208 |
+
| 0.3712 | 4.88 | 161 | 0.2445 |
|
209 |
+
| 0.3542 | 4.91 | 162 | 0.2443 |
|
210 |
+
| 0.3249 | 4.94 | 163 | 0.2436 |
|
211 |
+
| 0.3135 | 4.97 | 164 | 0.2431 |
|
212 |
+
| 0.3603 | 5.0 | 165 | 0.2427 |
|
213 |
+
| 0.3345 | 5.03 | 166 | 0.2424 |
|
214 |
+
| 0.3385 | 5.06 | 167 | 0.2428 |
|
215 |
+
| 0.3939 | 5.09 | 168 | 0.2422 |
|
216 |
+
| 0.334 | 5.12 | 169 | 0.2414 |
|
217 |
+
| 0.3482 | 5.15 | 170 | 0.2401 |
|
218 |
+
| 0.3323 | 5.18 | 171 | 0.2396 |
|
219 |
+
| 0.3603 | 5.21 | 172 | 0.2391 |
|
220 |
+
| 0.354 | 5.24 | 173 | 0.2385 |
|
221 |
+
| 0.3241 | 5.27 | 174 | 0.2379 |
|
222 |
+
| 0.4134 | 5.3 | 175 | 0.2373 |
|
223 |
+
| 0.3726 | 5.33 | 176 | 0.2369 |
|
224 |
+
| 0.2997 | 5.36 | 177 | 0.2364 |
|
225 |
+
| 0.3317 | 5.39 | 178 | 0.2360 |
|
226 |
+
| 0.3692 | 5.42 | 179 | 0.2356 |
|
227 |
+
| 0.3411 | 5.45 | 180 | 0.2347 |
|
228 |
+
| 0.274 | 5.48 | 181 | 0.2342 |
|
229 |
+
| 0.3714 | 5.52 | 182 | 0.2337 |
|
230 |
+
| 0.442 | 5.55 | 183 | 0.2332 |
|
231 |
+
| 0.3262 | 5.58 | 184 | 0.2327 |
|
232 |
+
| 0.2929 | 5.61 | 185 | 0.2323 |
|
233 |
+
| 0.3435 | 5.64 | 186 | 0.2315 |
|
234 |
+
| 0.3921 | 5.67 | 187 | 0.2311 |
|
235 |
+
| 0.3609 | 5.7 | 188 | 0.2306 |
|
236 |
+
| 0.3585 | 5.73 | 189 | 0.2302 |
|
237 |
+
| 0.3323 | 5.76 | 190 | 0.2298 |
|
238 |
+
| 0.3205 | 5.79 | 191 | 0.2295 |
|
239 |
+
| 0.3407 | 5.82 | 192 | 0.2293 |
|
240 |
+
| 0.3109 | 5.85 | 193 | 0.2290 |
|
241 |
+
| 0.3075 | 5.88 | 194 | 0.2287 |
|
242 |
+
| 0.3538 | 5.91 | 195 | 0.2285 |
|
243 |
+
| 0.2968 | 5.94 | 196 | 0.2283 |
|
244 |
+
| 0.34 | 5.97 | 197 | 0.2281 |
|
245 |
+
| 0.3608 | 6.0 | 198 | 0.2279 |
|
246 |
+
| 0.2768 | 6.03 | 199 | 0.2277 |
|
247 |
+
| 0.3783 | 6.06 | 200 | 0.2275 |
|
248 |
+
| 0.3024 | 6.09 | 201 | 0.2272 |
|
249 |
+
| 0.3221 | 6.12 | 202 | 0.2269 |
|
250 |
+
| 0.3432 | 6.15 | 203 | 0.2266 |
|
251 |
+
| 0.3497 | 6.18 | 204 | 0.2264 |
|
252 |
+
| 0.3174 | 6.21 | 205 | 0.2261 |
|
253 |
+
| 0.3034 | 6.24 | 206 | 0.2259 |
|
254 |
+
| 0.3035 | 6.27 | 207 | 0.2257 |
|
255 |
+
| 0.3185 | 6.3 | 208 | 0.2255 |
|
256 |
+
| 0.3851 | 6.33 | 209 | 0.2252 |
|
257 |
+
| 0.3612 | 6.36 | 210 | 0.2249 |
|
258 |
+
| 0.2838 | 6.39 | 211 | 0.2247 |
|
259 |
+
| 0.3452 | 6.42 | 212 | 0.2245 |
|
260 |
+
| 0.3358 | 6.45 | 213 | 0.2243 |
|
261 |
+
| 0.3181 | 6.48 | 214 | 0.2241 |
|
262 |
+
| 0.329 | 6.52 | 215 | 0.2240 |
|
263 |
+
| 0.2819 | 6.55 | 216 | 0.2238 |
|
264 |
+
| 0.3283 | 6.58 | 217 | 0.2237 |
|
265 |
+
| 0.2752 | 6.61 | 218 | 0.2235 |
|
266 |
+
| 0.3194 | 6.64 | 219 | 0.2233 |
|
267 |
+
| 0.2981 | 6.67 | 220 | 0.2230 |
|
268 |
+
| 0.2954 | 6.7 | 221 | 0.2229 |
|
269 |
+
| 0.2762 | 6.73 | 222 | 0.2228 |
|
270 |
+
| 0.3206 | 6.76 | 223 | 0.2223 |
|
271 |
+
| 0.3017 | 6.79 | 224 | 0.2221 |
|
272 |
+
| 0.3219 | 6.82 | 225 | 0.2219 |
|
273 |
+
| 0.2929 | 6.85 | 226 | 0.2215 |
|
274 |
+
| 0.3576 | 6.88 | 227 | 0.2212 |
|
275 |
+
| 0.2712 | 6.91 | 228 | 0.2210 |
|
276 |
+
| 0.2682 | 6.94 | 229 | 0.2207 |
|
277 |
+
| 0.3412 | 6.97 | 230 | 0.2205 |
|
278 |
+
| 0.3136 | 7.0 | 231 | 0.2203 |
|
279 |
+
| 0.3161 | 7.03 | 232 | 0.2200 |
|
280 |
+
| 0.2902 | 7.06 | 233 | 0.2197 |
|
281 |
+
| 0.3053 | 7.09 | 234 | 0.2194 |
|
282 |
+
| 0.3182 | 7.12 | 235 | 0.2190 |
|
283 |
+
| 0.2752 | 7.15 | 236 | 0.2186 |
|
284 |
+
| 0.262 | 7.18 | 237 | 0.2182 |
|
285 |
+
| 0.2783 | 7.21 | 238 | 0.2178 |
|
286 |
+
| 0.2795 | 7.24 | 239 | 0.2174 |
|
287 |
+
| 0.2964 | 7.27 | 240 | 0.2171 |
|
288 |
+
| 0.2737 | 7.3 | 241 | 0.2167 |
|
289 |
+
| 0.3377 | 7.33 | 242 | 0.2164 |
|
290 |
+
| 0.2579 | 7.36 | 243 | 0.2161 |
|
291 |
+
| 0.3015 | 7.39 | 244 | 0.2158 |
|
292 |
+
| 0.2525 | 7.42 | 245 | 0.2156 |
|
293 |
+
| 0.3187 | 7.45 | 246 | 0.2154 |
|
294 |
+
| 0.2628 | 7.48 | 247 | 0.2152 |
|
295 |
+
| 0.3267 | 7.52 | 248 | 0.2151 |
|
296 |
+
| 0.2718 | 7.55 | 249 | 0.2149 |
|
297 |
+
| 0.3153 | 7.58 | 250 | 0.2148 |
|
298 |
+
| 0.3555 | 7.61 | 251 | 0.2146 |
|
299 |
+
| 0.2921 | 7.64 | 252 | 0.2145 |
|
300 |
+
| 0.3538 | 7.67 | 253 | 0.2143 |
|
301 |
+
| 0.3197 | 7.7 | 254 | 0.2143 |
|
302 |
+
| 0.3745 | 7.73 | 255 | 0.2141 |
|
303 |
+
| 0.2762 | 7.76 | 256 | 0.2140 |
|
304 |
+
| 0.3053 | 7.79 | 257 | 0.2139 |
|
305 |
+
| 0.3357 | 7.82 | 258 | 0.2137 |
|
306 |
+
| 0.3105 | 7.85 | 259 | 0.2136 |
|
307 |
+
| 0.3287 | 7.88 | 260 | 0.2134 |
|
308 |
+
| 0.3194 | 7.91 | 261 | 0.2133 |
|
309 |
+
| 0.3151 | 7.94 | 262 | 0.2131 |
|
310 |
+
| 0.2784 | 7.97 | 263 | 0.2130 |
|
311 |
+
| 0.2946 | 8.0 | 264 | 0.2128 |
|
312 |
+
| 0.2804 | 8.03 | 265 | 0.2127 |
|
313 |
+
| 0.2549 | 8.06 | 266 | 0.2126 |
|
314 |
+
| 0.3115 | 8.09 | 267 | 0.2125 |
|
315 |
+
| 0.3675 | 8.12 | 268 | 0.2123 |
|
316 |
+
| 0.2582 | 8.15 | 269 | 0.2122 |
|
317 |
+
| 0.2974 | 8.18 | 270 | 0.2121 |
|
318 |
+
| 0.2885 | 8.21 | 271 | 0.2120 |
|
319 |
+
| 0.2962 | 8.24 | 272 | 0.2120 |
|
320 |
+
| 0.3726 | 8.27 | 273 | 0.2119 |
|
321 |
+
| 0.2631 | 8.3 | 274 | 0.2119 |
|
322 |
+
| 0.3114 | 8.33 | 275 | 0.2120 |
|
323 |
+
| 0.3445 | 8.36 | 276 | 0.2120 |
|
324 |
+
| 0.2782 | 8.39 | 277 | 0.2121 |
|
325 |
+
| 0.3429 | 8.42 | 278 | 0.2121 |
|
326 |
+
| 0.2533 | 8.45 | 279 | 0.2121 |
|
327 |
+
| 0.2858 | 8.48 | 280 | 0.2121 |
|
328 |
+
| 0.2815 | 8.52 | 281 | 0.2122 |
|
329 |
+
| 0.3285 | 8.55 | 282 | 0.2123 |
|
330 |
+
| 0.3484 | 8.58 | 283 | 0.2124 |
|
331 |
+
| 0.2468 | 8.61 | 284 | 0.2124 |
|
332 |
+
| 0.2686 | 8.64 | 285 | 0.2124 |
|
333 |
+
| 0.2784 | 8.67 | 286 | 0.2124 |
|
334 |
+
| 0.2645 | 8.7 | 287 | 0.2123 |
|
335 |
+
| 0.2882 | 8.73 | 288 | 0.2122 |
|
336 |
+
| 0.293 | 8.76 | 289 | 0.2121 |
|
337 |
+
| 0.2691 | 8.79 | 290 | 0.2120 |
|
338 |
+
| 0.3051 | 8.82 | 291 | 0.2120 |
|
339 |
+
| 0.2897 | 8.85 | 292 | 0.2119 |
|
340 |
+
| 0.2625 | 8.88 | 293 | 0.2119 |
|
341 |
+
| 0.3175 | 8.91 | 294 | 0.2119 |
|
342 |
+
| 0.2702 | 8.94 | 295 | 0.2118 |
|
343 |
+
| 0.3006 | 8.97 | 296 | 0.2118 |
|
344 |
+
| 0.2438 | 9.0 | 297 | 0.2118 |
|
345 |
+
| 0.3455 | 9.03 | 298 | 0.2118 |
|
346 |
+
| 0.2754 | 9.06 | 299 | 0.2117 |
|
347 |
+
| 0.2761 | 9.09 | 300 | 0.2117 |
|
348 |
+
| 0.2699 | 9.12 | 301 | 0.2116 |
|
349 |
+
| 0.322 | 9.15 | 302 | 0.2116 |
|
350 |
+
| 0.2373 | 9.18 | 303 | 0.2115 |
|
351 |
+
| 0.2814 | 9.21 | 304 | 0.2114 |
|
352 |
+
| 0.3558 | 9.24 | 305 | 0.2113 |
|
353 |
+
| 0.3223 | 9.27 | 306 | 0.2113 |
|
354 |
+
| 0.2798 | 9.3 | 307 | 0.2112 |
|
355 |
+
| 0.3263 | 9.33 | 308 | 0.2111 |
|
356 |
+
| 0.2523 | 9.36 | 309 | 0.2110 |
|
357 |
+
| 0.2687 | 9.39 | 310 | 0.2109 |
|
358 |
+
| 0.2623 | 9.42 | 311 | 0.2109 |
|
359 |
+
| 0.3164 | 9.45 | 312 | 0.2108 |
|
360 |
+
| 0.2801 | 9.48 | 313 | 0.2108 |
|
361 |
+
| 0.2967 | 9.52 | 314 | 0.2107 |
|
362 |
+
| 0.2816 | 9.55 | 315 | 0.2107 |
|
363 |
+
| 0.2721 | 9.58 | 316 | 0.2107 |
|
364 |
+
| 0.297 | 9.61 | 317 | 0.2106 |
|
365 |
+
| 0.2585 | 9.64 | 318 | 0.2106 |
|
366 |
+
| 0.2361 | 9.67 | 319 | 0.2106 |
|
367 |
+
| 0.2365 | 9.7 | 320 | 0.2105 |
|
368 |
+
| 0.3068 | 9.73 | 321 | 0.2105 |
|
369 |
+
| 0.2938 | 9.76 | 322 | 0.2105 |
|
370 |
+
| 0.3219 | 9.79 | 323 | 0.2104 |
|
371 |
+
| 0.2706 | 9.82 | 324 | 0.2104 |
|
372 |
+
| 0.2837 | 9.85 | 325 | 0.2104 |
|
373 |
+
| 0.3062 | 9.88 | 326 | 0.2103 |
|
374 |
+
| 0.3063 | 9.91 | 327 | 0.2103 |
|
375 |
+
| 0.3163 | 9.94 | 328 | 0.2103 |
|
376 |
+
| 0.2935 | 9.97 | 329 | 0.2103 |
|
377 |
+
| 0.2611 | 10.0 | 330 | 0.2103 |
|
378 |
+
|
379 |
+
|
380 |
+
### Framework versions
|
381 |
+
|
382 |
+
- Transformers 4.29.1
|
383 |
+
- Pytorch 2.0.0+cu118
|
384 |
+
- Datasets 2.12.0
|
385 |
+
- Tokenizers 0.13.3
|