End of training
Browse files
README.md
CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
16 |
|
17 |
This model is a fine-tuned version of [facebook/detr-resnet-50](https://huggingface.co/facebook/detr-resnet-50) on the None dataset.
|
18 |
It achieves the following results on the evaluation set:
|
19 |
-
- Loss: 0.
|
20 |
|
21 |
## Model description
|
22 |
|
@@ -35,123 +35,323 @@ More information needed
|
|
35 |
### Training hyperparameters
|
36 |
|
37 |
The following hyperparameters were used during training:
|
38 |
-
- learning_rate:
|
39 |
-
- train_batch_size:
|
40 |
- eval_batch_size: 8
|
41 |
- seed: 42
|
42 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
43 |
- lr_scheduler_type: cosine
|
44 |
-
- num_epochs:
|
45 |
|
46 |
### Training results
|
47 |
|
48 |
-
| Training Loss | Epoch | Step
|
49 |
-
|
50 |
-
|
|
51 |
-
|
|
52 |
-
|
|
53 |
-
|
|
54 |
-
| 0.
|
55 |
-
| 0.
|
56 |
-
| 0.
|
57 |
-
| 0.
|
58 |
-
| 0.
|
59 |
-
| 0.
|
60 |
-
| 0.
|
61 |
-
| 0.
|
62 |
-
| 0.
|
63 |
-
| 0.
|
64 |
-
| 0.
|
65 |
-
| 0.
|
66 |
-
| 0.
|
67 |
-
| 0.
|
68 |
-
| 0.
|
69 |
-
| 0.
|
70 |
-
| 0.
|
71 |
-
| 0.
|
72 |
-
| 0.
|
73 |
-
| 0.
|
74 |
-
| 0.
|
75 |
-
| 0.
|
76 |
-
| 0.
|
77 |
-
| 0.
|
78 |
-
| 0.
|
79 |
-
| 0.
|
80 |
-
| 0.
|
81 |
-
| 0.
|
82 |
-
| 0.
|
83 |
-
| 0.
|
84 |
-
| 0.
|
85 |
-
| 0.
|
86 |
-
| 0.
|
87 |
-
| 0.
|
88 |
-
| 0.
|
89 |
-
| 0.
|
90 |
-
| 0.
|
91 |
-
| 0.
|
92 |
-
| 0.
|
93 |
-
| 0.
|
94 |
-
| 0.
|
95 |
-
| 0.
|
96 |
-
| 0.
|
97 |
-
| 0.
|
98 |
-
| 0.
|
99 |
-
| 0.
|
100 |
-
| 0.
|
101 |
-
| 0.
|
102 |
-
| 0.
|
103 |
-
| 0.
|
104 |
-
| 0.
|
105 |
-
| 0.
|
106 |
-
| 0.
|
107 |
-
| 0.
|
108 |
-
| 0.
|
109 |
-
| 0.
|
110 |
-
| 0.
|
111 |
-
| 0.
|
112 |
-
| 0.
|
113 |
-
| 0.
|
114 |
-
| 0.
|
115 |
-
| 0.
|
116 |
-
| 0.
|
117 |
-
| 0.
|
118 |
-
| 0.
|
119 |
-
| 0.
|
120 |
-
| 0.
|
121 |
-
| 0.
|
122 |
-
| 0.
|
123 |
-
| 0.
|
124 |
-
| 0.
|
125 |
-
| 0.
|
126 |
-
| 0.
|
127 |
-
| 0.
|
128 |
-
| 0.
|
129 |
-
| 0.
|
130 |
-
| 0.
|
131 |
-
| 0.
|
132 |
-
| 0.
|
133 |
-
| 0.
|
134 |
-
| 0.
|
135 |
-
| 0.
|
136 |
-
| 0.
|
137 |
-
| 0.
|
138 |
-
| 0.
|
139 |
-
| 0.
|
140 |
-
| 0.
|
141 |
-
| 0.
|
142 |
-
| 0.
|
143 |
-
| 0.
|
144 |
-
| 0.
|
145 |
-
| 0.
|
146 |
-
| 0.
|
147 |
-
| 0.
|
148 |
-
| 0.
|
149 |
-
| 0.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
150 |
|
151 |
|
152 |
### Framework versions
|
153 |
|
154 |
-
- Transformers 4.
|
155 |
- Pytorch 2.4.1+cu121
|
156 |
-
- Datasets 2.
|
157 |
-
- Tokenizers 0.
|
|
|
16 |
|
17 |
This model is a fine-tuned version of [facebook/detr-resnet-50](https://huggingface.co/facebook/detr-resnet-50) on the None dataset.
|
18 |
It achieves the following results on the evaluation set:
|
19 |
+
- Loss: 0.2536
|
20 |
|
21 |
## Model description
|
22 |
|
|
|
35 |
### Training hyperparameters
|
36 |
|
37 |
The following hyperparameters were used during training:
|
38 |
+
- learning_rate: 1e-05
|
39 |
+
- train_batch_size: 2
|
40 |
- eval_batch_size: 8
|
41 |
- seed: 42
|
42 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
43 |
- lr_scheduler_type: cosine
|
44 |
+
- num_epochs: 300
|
45 |
|
46 |
### Training results
|
47 |
|
48 |
+
| Training Loss | Epoch | Step | Validation Loss |
|
49 |
+
|:-------------:|:-----:|:------:|:---------------:|
|
50 |
+
| 0.9466 | 1.0 | 497 | 0.8504 |
|
51 |
+
| 0.6647 | 2.0 | 994 | 0.5799 |
|
52 |
+
| 0.571 | 3.0 | 1491 | 0.4565 |
|
53 |
+
| 0.5146 | 4.0 | 1988 | 0.4214 |
|
54 |
+
| 0.4731 | 5.0 | 2485 | 0.4012 |
|
55 |
+
| 0.4703 | 6.0 | 2982 | 0.3908 |
|
56 |
+
| 0.4529 | 7.0 | 3479 | 0.3663 |
|
57 |
+
| 0.4454 | 8.0 | 3976 | 0.3666 |
|
58 |
+
| 0.3936 | 9.0 | 4473 | 0.3634 |
|
59 |
+
| 0.4043 | 10.0 | 4970 | 0.3439 |
|
60 |
+
| 0.3773 | 11.0 | 5467 | 0.3395 |
|
61 |
+
| 0.3584 | 12.0 | 5964 | 0.3292 |
|
62 |
+
| 0.3522 | 13.0 | 6461 | 0.3303 |
|
63 |
+
| 0.3538 | 14.0 | 6958 | 0.3392 |
|
64 |
+
| 0.3523 | 15.0 | 7455 | 0.3241 |
|
65 |
+
| 0.3357 | 16.0 | 7952 | 0.3179 |
|
66 |
+
| 0.3345 | 17.0 | 8449 | 0.3086 |
|
67 |
+
| 0.3455 | 18.0 | 8946 | 0.3193 |
|
68 |
+
| 0.3473 | 19.0 | 9443 | 0.3100 |
|
69 |
+
| 0.3075 | 20.0 | 9940 | 0.3139 |
|
70 |
+
| 0.3025 | 21.0 | 10437 | 0.2843 |
|
71 |
+
| 0.3051 | 22.0 | 10934 | 0.3035 |
|
72 |
+
| 0.3231 | 23.0 | 11431 | 0.3032 |
|
73 |
+
| 0.3253 | 24.0 | 11928 | 0.2904 |
|
74 |
+
| 0.3169 | 25.0 | 12425 | 0.2863 |
|
75 |
+
| 0.2944 | 26.0 | 12922 | 0.2840 |
|
76 |
+
| 0.2918 | 27.0 | 13419 | 0.2889 |
|
77 |
+
| 0.2868 | 28.0 | 13916 | 0.2934 |
|
78 |
+
| 0.3029 | 29.0 | 14413 | 0.2762 |
|
79 |
+
| 0.3141 | 30.0 | 14910 | 0.2891 |
|
80 |
+
| 0.2966 | 31.0 | 15407 | 0.3015 |
|
81 |
+
| 0.3006 | 32.0 | 15904 | 0.3064 |
|
82 |
+
| 0.2885 | 33.0 | 16401 | 0.2870 |
|
83 |
+
| 0.306 | 34.0 | 16898 | 0.2750 |
|
84 |
+
| 0.2997 | 35.0 | 17395 | 0.2684 |
|
85 |
+
| 0.279 | 36.0 | 17892 | 0.2655 |
|
86 |
+
| 0.2711 | 37.0 | 18389 | 0.2823 |
|
87 |
+
| 0.3016 | 38.0 | 18886 | 0.2903 |
|
88 |
+
| 0.2833 | 39.0 | 19383 | 0.2972 |
|
89 |
+
| 0.2963 | 40.0 | 19880 | 0.2755 |
|
90 |
+
| 0.2916 | 41.0 | 20377 | 0.2676 |
|
91 |
+
| 0.2982 | 42.0 | 20874 | 0.2676 |
|
92 |
+
| 0.2665 | 43.0 | 21371 | 0.2803 |
|
93 |
+
| 0.288 | 44.0 | 21868 | 0.2829 |
|
94 |
+
| 0.285 | 45.0 | 22365 | 0.3296 |
|
95 |
+
| 0.2622 | 46.0 | 22862 | 0.2740 |
|
96 |
+
| 0.2707 | 47.0 | 23359 | 0.2724 |
|
97 |
+
| 0.2923 | 48.0 | 23856 | 0.2924 |
|
98 |
+
| 0.2531 | 49.0 | 24353 | 0.2735 |
|
99 |
+
| 0.2756 | 50.0 | 24850 | 0.2766 |
|
100 |
+
| 0.2639 | 51.0 | 25347 | 0.2555 |
|
101 |
+
| 0.2646 | 52.0 | 25844 | 0.2708 |
|
102 |
+
| 0.2606 | 53.0 | 26341 | 0.2735 |
|
103 |
+
| 0.2704 | 54.0 | 26838 | 0.2853 |
|
104 |
+
| 0.2522 | 55.0 | 27335 | 0.2513 |
|
105 |
+
| 0.2817 | 56.0 | 27832 | 0.2692 |
|
106 |
+
| 0.2601 | 57.0 | 28329 | 0.2962 |
|
107 |
+
| 0.2453 | 58.0 | 28826 | 0.2890 |
|
108 |
+
| 0.2614 | 59.0 | 29323 | 0.2528 |
|
109 |
+
| 0.254 | 60.0 | 29820 | 0.2808 |
|
110 |
+
| 0.2302 | 61.0 | 30317 | 0.3073 |
|
111 |
+
| 0.249 | 62.0 | 30814 | 0.2677 |
|
112 |
+
| 0.262 | 63.0 | 31311 | 0.2641 |
|
113 |
+
| 0.2321 | 64.0 | 31808 | 0.2751 |
|
114 |
+
| 0.2623 | 65.0 | 32305 | 0.2873 |
|
115 |
+
| 0.2575 | 66.0 | 32802 | 0.2759 |
|
116 |
+
| 0.2464 | 67.0 | 33299 | 0.2533 |
|
117 |
+
| 0.2328 | 68.0 | 33796 | 0.2647 |
|
118 |
+
| 0.2324 | 69.0 | 34293 | 0.2700 |
|
119 |
+
| 0.2311 | 70.0 | 34790 | 0.2594 |
|
120 |
+
| 0.2286 | 71.0 | 35287 | 0.2531 |
|
121 |
+
| 0.2609 | 72.0 | 35784 | 0.2458 |
|
122 |
+
| 0.2313 | 73.0 | 36281 | 0.2550 |
|
123 |
+
| 0.2336 | 74.0 | 36778 | 0.2388 |
|
124 |
+
| 0.2306 | 75.0 | 37275 | 0.2396 |
|
125 |
+
| 0.245 | 76.0 | 37772 | 0.2600 |
|
126 |
+
| 0.2439 | 77.0 | 38269 | 0.2685 |
|
127 |
+
| 0.2388 | 78.0 | 38766 | 0.2458 |
|
128 |
+
| 0.2206 | 79.0 | 39263 | 0.2642 |
|
129 |
+
| 0.2416 | 80.0 | 39760 | 0.2693 |
|
130 |
+
| 0.2107 | 81.0 | 40257 | 0.2546 |
|
131 |
+
| 0.2385 | 82.0 | 40754 | 0.2564 |
|
132 |
+
| 0.2164 | 83.0 | 41251 | 0.2584 |
|
133 |
+
| 0.2343 | 84.0 | 41748 | 0.2531 |
|
134 |
+
| 0.2246 | 85.0 | 42245 | 0.2515 |
|
135 |
+
| 0.2212 | 86.0 | 42742 | 0.2745 |
|
136 |
+
| 0.2176 | 87.0 | 43239 | 0.2802 |
|
137 |
+
| 0.227 | 88.0 | 43736 | 0.2589 |
|
138 |
+
| 0.2138 | 89.0 | 44233 | 0.2577 |
|
139 |
+
| 0.218 | 90.0 | 44730 | 0.2698 |
|
140 |
+
| 0.2036 | 91.0 | 45227 | 0.2403 |
|
141 |
+
| 0.208 | 92.0 | 45724 | 0.2540 |
|
142 |
+
| 0.2009 | 93.0 | 46221 | 0.2511 |
|
143 |
+
| 0.2144 | 94.0 | 46718 | 0.2544 |
|
144 |
+
| 0.2103 | 95.0 | 47215 | 0.2675 |
|
145 |
+
| 0.2174 | 96.0 | 47712 | 0.2726 |
|
146 |
+
| 0.2046 | 97.0 | 48209 | 0.2554 |
|
147 |
+
| 0.2218 | 98.0 | 48706 | 0.2675 |
|
148 |
+
| 0.2172 | 99.0 | 49203 | 0.2631 |
|
149 |
+
| 0.2157 | 100.0 | 49700 | 0.2635 |
|
150 |
+
| 0.2068 | 101.0 | 50197 | 0.2416 |
|
151 |
+
| 0.2149 | 102.0 | 50694 | 0.2517 |
|
152 |
+
| 0.2217 | 103.0 | 51191 | 0.2542 |
|
153 |
+
| 0.2123 | 104.0 | 51688 | 0.2708 |
|
154 |
+
| 0.2052 | 105.0 | 52185 | 0.2573 |
|
155 |
+
| 0.2119 | 106.0 | 52682 | 0.2510 |
|
156 |
+
| 0.2004 | 107.0 | 53179 | 0.2636 |
|
157 |
+
| 0.2031 | 108.0 | 53676 | 0.2534 |
|
158 |
+
| 0.2034 | 109.0 | 54173 | 0.2575 |
|
159 |
+
| 0.2122 | 110.0 | 54670 | 0.2564 |
|
160 |
+
| 0.1985 | 111.0 | 55167 | 0.2775 |
|
161 |
+
| 0.1993 | 112.0 | 55664 | 0.2627 |
|
162 |
+
| 0.2173 | 113.0 | 56161 | 0.2615 |
|
163 |
+
| 0.1947 | 114.0 | 56658 | 0.2520 |
|
164 |
+
| 0.2167 | 115.0 | 57155 | 0.2691 |
|
165 |
+
| 0.2045 | 116.0 | 57652 | 0.2574 |
|
166 |
+
| 0.2087 | 117.0 | 58149 | 0.2706 |
|
167 |
+
| 0.1864 | 118.0 | 58646 | 0.2429 |
|
168 |
+
| 0.1957 | 119.0 | 59143 | 0.2434 |
|
169 |
+
| 0.2066 | 120.0 | 59640 | 0.2477 |
|
170 |
+
| 0.1906 | 121.0 | 60137 | 0.2653 |
|
171 |
+
| 0.1884 | 122.0 | 60634 | 0.2607 |
|
172 |
+
| 0.1931 | 123.0 | 61131 | 0.2692 |
|
173 |
+
| 0.2023 | 124.0 | 61628 | 0.2426 |
|
174 |
+
| 0.2033 | 125.0 | 62125 | 0.2577 |
|
175 |
+
| 0.196 | 126.0 | 62622 | 0.2551 |
|
176 |
+
| 0.2041 | 127.0 | 63119 | 0.2491 |
|
177 |
+
| 0.1878 | 128.0 | 63616 | 0.2597 |
|
178 |
+
| 0.2044 | 129.0 | 64113 | 0.2241 |
|
179 |
+
| 0.2014 | 130.0 | 64610 | 0.2479 |
|
180 |
+
| 0.1891 | 131.0 | 65107 | 0.2531 |
|
181 |
+
| 0.1945 | 132.0 | 65604 | 0.2467 |
|
182 |
+
| 0.1818 | 133.0 | 66101 | 0.2522 |
|
183 |
+
| 0.2094 | 134.0 | 66598 | 0.2466 |
|
184 |
+
| 0.2054 | 135.0 | 67095 | 0.2563 |
|
185 |
+
| 0.2138 | 136.0 | 67592 | 0.2725 |
|
186 |
+
| 0.1857 | 137.0 | 68089 | 0.2446 |
|
187 |
+
| 0.1962 | 138.0 | 68586 | 0.2475 |
|
188 |
+
| 0.1877 | 139.0 | 69083 | 0.2689 |
|
189 |
+
| 0.1771 | 140.0 | 69580 | 0.2680 |
|
190 |
+
| 0.1879 | 141.0 | 70077 | 0.2583 |
|
191 |
+
| 0.1838 | 142.0 | 70574 | 0.2470 |
|
192 |
+
| 0.2014 | 143.0 | 71071 | 0.2506 |
|
193 |
+
| 0.1735 | 144.0 | 71568 | 0.2784 |
|
194 |
+
| 0.1795 | 145.0 | 72065 | 0.2490 |
|
195 |
+
| 0.1826 | 146.0 | 72562 | 0.2525 |
|
196 |
+
| 0.1766 | 147.0 | 73059 | 0.2682 |
|
197 |
+
| 0.1903 | 148.0 | 73556 | 0.2654 |
|
198 |
+
| 0.2029 | 149.0 | 74053 | 0.2507 |
|
199 |
+
| 0.1772 | 150.0 | 74550 | 0.2503 |
|
200 |
+
| 0.1855 | 151.0 | 75047 | 0.2510 |
|
201 |
+
| 0.1894 | 152.0 | 75544 | 0.2574 |
|
202 |
+
| 0.1785 | 153.0 | 76041 | 0.2614 |
|
203 |
+
| 0.1773 | 154.0 | 76538 | 0.2493 |
|
204 |
+
| 0.1653 | 155.0 | 77035 | 0.2564 |
|
205 |
+
| 0.1737 | 156.0 | 77532 | 0.2628 |
|
206 |
+
| 0.1715 | 157.0 | 78029 | 0.2732 |
|
207 |
+
| 0.1688 | 158.0 | 78526 | 0.2663 |
|
208 |
+
| 0.1893 | 159.0 | 79023 | 0.2576 |
|
209 |
+
| 0.1662 | 160.0 | 79520 | 0.2538 |
|
210 |
+
| 0.1715 | 161.0 | 80017 | 0.2767 |
|
211 |
+
| 0.1756 | 162.0 | 80514 | 0.2600 |
|
212 |
+
| 0.1777 | 163.0 | 81011 | 0.2635 |
|
213 |
+
| 0.1775 | 164.0 | 81508 | 0.2681 |
|
214 |
+
| 0.1653 | 165.0 | 82005 | 0.2555 |
|
215 |
+
| 0.1695 | 166.0 | 82502 | 0.2536 |
|
216 |
+
| 0.1745 | 167.0 | 82999 | 0.2454 |
|
217 |
+
| 0.1726 | 168.0 | 83496 | 0.2507 |
|
218 |
+
| 0.1734 | 169.0 | 83993 | 0.2467 |
|
219 |
+
| 0.171 | 170.0 | 84490 | 0.2670 |
|
220 |
+
| 0.1742 | 171.0 | 84987 | 0.2406 |
|
221 |
+
| 0.1718 | 172.0 | 85484 | 0.2270 |
|
222 |
+
| 0.1734 | 173.0 | 85981 | 0.2822 |
|
223 |
+
| 0.1623 | 174.0 | 86478 | 0.2620 |
|
224 |
+
| 0.1731 | 175.0 | 86975 | 0.2587 |
|
225 |
+
| 0.182 | 176.0 | 87472 | 0.2741 |
|
226 |
+
| 0.1632 | 177.0 | 87969 | 0.2251 |
|
227 |
+
| 0.1601 | 178.0 | 88466 | 0.2584 |
|
228 |
+
| 0.1664 | 179.0 | 88963 | 0.2691 |
|
229 |
+
| 0.1789 | 180.0 | 89460 | 0.2679 |
|
230 |
+
| 0.1622 | 181.0 | 89957 | 0.2632 |
|
231 |
+
| 0.1602 | 182.0 | 90454 | 0.2448 |
|
232 |
+
| 0.1701 | 183.0 | 90951 | 0.2402 |
|
233 |
+
| 0.1635 | 184.0 | 91448 | 0.2601 |
|
234 |
+
| 0.1668 | 185.0 | 91945 | 0.2551 |
|
235 |
+
| 0.1632 | 186.0 | 92442 | 0.2681 |
|
236 |
+
| 0.1643 | 187.0 | 92939 | 0.2447 |
|
237 |
+
| 0.1597 | 188.0 | 93436 | 0.2354 |
|
238 |
+
| 0.169 | 189.0 | 93933 | 0.2506 |
|
239 |
+
| 0.1561 | 190.0 | 94430 | 0.2551 |
|
240 |
+
| 0.1626 | 191.0 | 94927 | 0.2350 |
|
241 |
+
| 0.172 | 192.0 | 95424 | 0.2455 |
|
242 |
+
| 0.1566 | 193.0 | 95921 | 0.2496 |
|
243 |
+
| 0.1594 | 194.0 | 96418 | 0.2491 |
|
244 |
+
| 0.1602 | 195.0 | 96915 | 0.2486 |
|
245 |
+
| 0.1609 | 196.0 | 97412 | 0.2640 |
|
246 |
+
| 0.1677 | 197.0 | 97909 | 0.2594 |
|
247 |
+
| 0.165 | 198.0 | 98406 | 0.2433 |
|
248 |
+
| 0.1653 | 199.0 | 98903 | 0.2412 |
|
249 |
+
| 0.1585 | 200.0 | 99400 | 0.2410 |
|
250 |
+
| 0.163 | 201.0 | 99897 | 0.2476 |
|
251 |
+
| 0.1488 | 202.0 | 100394 | 0.2461 |
|
252 |
+
| 0.1551 | 203.0 | 100891 | 0.2619 |
|
253 |
+
| 0.1611 | 204.0 | 101388 | 0.2686 |
|
254 |
+
| 0.1482 | 205.0 | 101885 | 0.2743 |
|
255 |
+
| 0.1673 | 206.0 | 102382 | 0.2562 |
|
256 |
+
| 0.1586 | 207.0 | 102879 | 0.2651 |
|
257 |
+
| 0.1594 | 208.0 | 103376 | 0.2524 |
|
258 |
+
| 0.1571 | 209.0 | 103873 | 0.2523 |
|
259 |
+
| 0.1507 | 210.0 | 104370 | 0.2572 |
|
260 |
+
| 0.1605 | 211.0 | 104867 | 0.2580 |
|
261 |
+
| 0.1548 | 212.0 | 105364 | 0.2541 |
|
262 |
+
| 0.1528 | 213.0 | 105861 | 0.2597 |
|
263 |
+
| 0.15 | 214.0 | 106358 | 0.2493 |
|
264 |
+
| 0.1465 | 215.0 | 106855 | 0.2491 |
|
265 |
+
| 0.1549 | 216.0 | 107352 | 0.2590 |
|
266 |
+
| 0.1516 | 217.0 | 107849 | 0.2634 |
|
267 |
+
| 0.1659 | 218.0 | 108346 | 0.2517 |
|
268 |
+
| 0.157 | 219.0 | 108843 | 0.2434 |
|
269 |
+
| 0.1572 | 220.0 | 109340 | 0.2567 |
|
270 |
+
| 0.1604 | 221.0 | 109837 | 0.2505 |
|
271 |
+
| 0.1501 | 222.0 | 110334 | 0.2565 |
|
272 |
+
| 0.1527 | 223.0 | 110831 | 0.2642 |
|
273 |
+
| 0.1571 | 224.0 | 111328 | 0.2560 |
|
274 |
+
| 0.1512 | 225.0 | 111825 | 0.2467 |
|
275 |
+
| 0.1521 | 226.0 | 112322 | 0.2642 |
|
276 |
+
| 0.1521 | 227.0 | 112819 | 0.2689 |
|
277 |
+
| 0.1555 | 228.0 | 113316 | 0.2525 |
|
278 |
+
| 0.1485 | 229.0 | 113813 | 0.2684 |
|
279 |
+
| 0.1587 | 230.0 | 114310 | 0.2571 |
|
280 |
+
| 0.1563 | 231.0 | 114807 | 0.2619 |
|
281 |
+
| 0.1448 | 232.0 | 115304 | 0.2636 |
|
282 |
+
| 0.1517 | 233.0 | 115801 | 0.2569 |
|
283 |
+
| 0.1418 | 234.0 | 116298 | 0.2365 |
|
284 |
+
| 0.1515 | 235.0 | 116795 | 0.2578 |
|
285 |
+
| 0.1518 | 236.0 | 117292 | 0.2418 |
|
286 |
+
| 0.143 | 237.0 | 117789 | 0.2554 |
|
287 |
+
| 0.1482 | 238.0 | 118286 | 0.2482 |
|
288 |
+
| 0.1476 | 239.0 | 118783 | 0.2473 |
|
289 |
+
| 0.1482 | 240.0 | 119280 | 0.2574 |
|
290 |
+
| 0.1443 | 241.0 | 119777 | 0.2531 |
|
291 |
+
| 0.1527 | 242.0 | 120274 | 0.2557 |
|
292 |
+
| 0.1435 | 243.0 | 120771 | 0.2540 |
|
293 |
+
| 0.1494 | 244.0 | 121268 | 0.2460 |
|
294 |
+
| 0.1431 | 245.0 | 121765 | 0.2475 |
|
295 |
+
| 0.1562 | 246.0 | 122262 | 0.2509 |
|
296 |
+
| 0.1413 | 247.0 | 122759 | 0.2567 |
|
297 |
+
| 0.1472 | 248.0 | 123256 | 0.2555 |
|
298 |
+
| 0.1557 | 249.0 | 123753 | 0.2569 |
|
299 |
+
| 0.1548 | 250.0 | 124250 | 0.2578 |
|
300 |
+
| 0.1592 | 251.0 | 124747 | 0.2534 |
|
301 |
+
| 0.1448 | 252.0 | 125244 | 0.2514 |
|
302 |
+
| 0.1489 | 253.0 | 125741 | 0.2536 |
|
303 |
+
| 0.1498 | 254.0 | 126238 | 0.2468 |
|
304 |
+
| 0.1467 | 255.0 | 126735 | 0.2516 |
|
305 |
+
| 0.1464 | 256.0 | 127232 | 0.2545 |
|
306 |
+
| 0.1555 | 257.0 | 127729 | 0.2516 |
|
307 |
+
| 0.1553 | 258.0 | 128226 | 0.2525 |
|
308 |
+
| 0.1482 | 259.0 | 128723 | 0.2557 |
|
309 |
+
| 0.1564 | 260.0 | 129220 | 0.2539 |
|
310 |
+
| 0.1408 | 261.0 | 129717 | 0.2626 |
|
311 |
+
| 0.1408 | 262.0 | 130214 | 0.2550 |
|
312 |
+
| 0.1454 | 263.0 | 130711 | 0.2581 |
|
313 |
+
| 0.1509 | 264.0 | 131208 | 0.2527 |
|
314 |
+
| 0.1554 | 265.0 | 131705 | 0.2530 |
|
315 |
+
| 0.1427 | 266.0 | 132202 | 0.2614 |
|
316 |
+
| 0.1435 | 267.0 | 132699 | 0.2581 |
|
317 |
+
| 0.1504 | 268.0 | 133196 | 0.2525 |
|
318 |
+
| 0.1431 | 269.0 | 133693 | 0.2541 |
|
319 |
+
| 0.1514 | 270.0 | 134190 | 0.2511 |
|
320 |
+
| 0.1494 | 271.0 | 134687 | 0.2539 |
|
321 |
+
| 0.1482 | 272.0 | 135184 | 0.2545 |
|
322 |
+
| 0.1442 | 273.0 | 135681 | 0.2538 |
|
323 |
+
| 0.1435 | 274.0 | 136178 | 0.2537 |
|
324 |
+
| 0.1491 | 275.0 | 136675 | 0.2494 |
|
325 |
+
| 0.1435 | 276.0 | 137172 | 0.2538 |
|
326 |
+
| 0.1377 | 277.0 | 137669 | 0.2498 |
|
327 |
+
| 0.1381 | 278.0 | 138166 | 0.2506 |
|
328 |
+
| 0.1378 | 279.0 | 138663 | 0.2508 |
|
329 |
+
| 0.1514 | 280.0 | 139160 | 0.2532 |
|
330 |
+
| 0.144 | 281.0 | 139657 | 0.2545 |
|
331 |
+
| 0.1524 | 282.0 | 140154 | 0.2516 |
|
332 |
+
| 0.1494 | 283.0 | 140651 | 0.2538 |
|
333 |
+
| 0.1471 | 284.0 | 141148 | 0.2548 |
|
334 |
+
| 0.1499 | 285.0 | 141645 | 0.2539 |
|
335 |
+
| 0.1504 | 286.0 | 142142 | 0.2544 |
|
336 |
+
| 0.139 | 287.0 | 142639 | 0.2537 |
|
337 |
+
| 0.1438 | 288.0 | 143136 | 0.2534 |
|
338 |
+
| 0.1466 | 289.0 | 143633 | 0.2545 |
|
339 |
+
| 0.1429 | 290.0 | 144130 | 0.2540 |
|
340 |
+
| 0.1484 | 291.0 | 144627 | 0.2539 |
|
341 |
+
| 0.1496 | 292.0 | 145124 | 0.2537 |
|
342 |
+
| 0.1382 | 293.0 | 145621 | 0.2534 |
|
343 |
+
| 0.1467 | 294.0 | 146118 | 0.2535 |
|
344 |
+
| 0.1355 | 295.0 | 146615 | 0.2536 |
|
345 |
+
| 0.1429 | 296.0 | 147112 | 0.2537 |
|
346 |
+
| 0.1405 | 297.0 | 147609 | 0.2536 |
|
347 |
+
| 0.1446 | 298.0 | 148106 | 0.2536 |
|
348 |
+
| 0.143 | 299.0 | 148603 | 0.2536 |
|
349 |
+
| 0.1496 | 300.0 | 149100 | 0.2536 |
|
350 |
|
351 |
|
352 |
### Framework versions
|
353 |
|
354 |
+
- Transformers 4.45.2
|
355 |
- Pytorch 2.4.1+cu121
|
356 |
+
- Datasets 2.19.2
|
357 |
+
- Tokenizers 0.20.0
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 166496880
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:79b8255c9db175260e0525a18f6c2fefd724fccc70741bcdda16e8d4a17a39eb
|
3 |
size 166496880
|
runs/Oct08_05-15-24_a928b780b319/events.out.tfevents.1728364526.a928b780b319.135.0
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:1ceb9138cf9c097e64e52567586423c5b3ed33eca2c0885017d87d700e6960d6
|
3 |
+
size 1155018
|