--- base_model: yhavinga/ul2-large-dutch library_name: peft license: apache-2.0 tags: - generated_from_trainer model-index: - name: ul2-large-dutch-finetuned-oba-book-search results: [] --- # ul2-large-dutch-finetuned-oba-book-search This model is a fine-tuned version of [yhavinga/ul2-large-dutch](https://huggingface.co/yhavinga/ul2-large-dutch) on the None dataset. It achieves the following results on the evaluation set: - Loss: 4.1161 - Top-5-accuracy: 4.1679 ## Model description More information needed ## Intended uses & limitations More information needed ## Training and evaluation data More information needed ## Training procedure ### Training hyperparameters The following hyperparameters were used during training: - learning_rate: 0.3 - train_batch_size: 16 - eval_batch_size: 16 - seed: 42 - optimizer: AdaFactor - lr_scheduler_type: linear - num_epochs: 10 ### Training results | Training Loss | Epoch | Step | Validation Loss | Top-5-accuracy | |:-------------:|:------:|:----:|:---------------:|:--------------:| | 6.2541 | 0.2577 | 200 | 4.6137 | 0.0579 | | 5.8635 | 0.5155 | 400 | 4.5076 | 0.1158 | | 5.5301 | 0.7732 | 600 | 4.4350 | 0.1447 | | 5.5298 | 1.0309 | 800 | 4.4449 | 0.1447 | | 5.3296 | 1.2887 | 1000 | 4.4621 | 0.1158 | | 5.3336 | 1.5464 | 1200 | 4.4232 | 0.1447 | | 5.2192 | 1.8041 | 1400 | 4.3842 | 0.1447 | | 5.2348 | 2.0619 | 1600 | 4.3465 | 0.1447 | | 5.0988 | 2.3196 | 1800 | 4.3129 | 0.2026 | | 5.1633 | 2.5773 | 2000 | 4.3007 | 0.1737 | | 5.1103 | 2.8351 | 2200 | 4.2722 | 0.2026 | | 5.0057 | 3.0928 | 2400 | 4.3158 | 0.1447 | | 5.0554 | 3.3505 | 2600 | 4.2731 | 0.4342 | | 4.9774 | 3.6082 | 2800 | 4.2467 | 0.3763 | | 4.9769 | 3.8660 | 3000 | 4.2320 | 0.5789 | | 4.9825 | 4.1237 | 3200 | 4.2115 | 0.8394 | | 4.9692 | 4.3814 | 3400 | 4.2172 | 1.3893 | | 4.9681 | 4.6392 | 3600 | 4.2093 | 1.5630 | | 4.8661 | 4.8969 | 3800 | 4.2003 | 2.2865 | | 4.942 | 5.1546 | 4000 | 4.2047 | 2.3734 | | 4.8974 | 5.4124 | 4200 | 4.1583 | 2.8654 | | 4.8827 | 5.6701 | 4400 | 4.1852 | 2.9522 | | 4.8705 | 5.9278 | 4600 | 4.1661 | 3.4732 | | 4.8714 | 6.1856 | 4800 | 4.1478 | 3.7916 | | 4.7909 | 6.4433 | 5000 | 4.1748 | 3.6179 | | 4.8357 | 6.7010 | 5200 | 4.1471 | 3.9074 | | 4.8723 | 6.9588 | 5400 | 4.1518 | 4.0232 | | 4.8838 | 7.2165 | 5600 | 4.1428 | 4.1389 | | 4.804 | 7.4742 | 5800 | 4.1468 | 4.0232 | | 4.8232 | 7.7320 | 6000 | 4.1390 | 4.1389 | | 4.8571 | 7.9897 | 6200 | 4.1305 | 4.0810 | | 4.7454 | 8.2474 | 6400 | 4.1297 | 4.1679 | | 4.8652 | 8.5052 | 6600 | 4.1262 | 4.1968 | | 4.7882 | 8.7629 | 6800 | 4.1227 | 4.1679 | | 4.8025 | 9.0206 | 7000 | 4.1134 | 4.1679 | | 4.8124 | 9.2784 | 7200 | 4.1211 | 4.1389 | | 4.7157 | 9.5361 | 7400 | 4.1122 | 4.1389 | | 4.8666 | 9.7938 | 7600 | 4.1161 | 4.1679 | ### Framework versions - PEFT 0.11.0 - Transformers 4.44.2 - Pytorch 1.13.0+cu116 - Datasets 3.0.0 - Tokenizers 0.19.1