metadata

base_model: yhavinga/ul2-large-dutch
library_name: peft
license: apache-2.0
tags:
  - generated_from_trainer
model-index:
  - name: ul2-large-dutch-finetuned-oba-book-search
    results: []

ul2-large-dutch-finetuned-oba-book-search

This model is a fine-tuned version of yhavinga/ul2-large-dutch on the None dataset. It achieves the following results on the evaluation set:

Loss: 4.5684
Top-5-accuracy: 0.1158

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 0.03
train_batch_size: 16
eval_batch_size: 16
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 5

Training results

Training Loss	Epoch	Step	Validation Loss	Top-5-accuracy
7.7153	0.2577	200	5.3876	0.0
6.8602	0.5155	400	4.8652	0.0
6.3689	0.7732	600	4.6435	0.0
6.2303	1.0309	800	4.6293	0.0579
6.0898	1.2887	1000	4.6395	0.0289
6.0367	1.5464	1200	4.5855	0.0289
5.8512	1.8041	1400	4.5860	0.0579
5.9489	2.0619	1600	4.5672	0.0868
5.7601	2.3196	1800	4.5522	0.0579
5.7379	2.5773	2000	4.5572	0.0868
5.7397	2.8351	2200	4.5559	0.0579
5.7488	3.0928	2400	4.5769	0.1447
5.7581	3.3505	2600	4.5421	0.1158
5.6448	3.6082	2800	4.5174	0.1447
5.6551	3.8660	3000	4.5773	0.1158
5.6971	4.1237	3200	4.5495	0.0868
5.7085	4.3814	3400	4.5392	0.1447
5.6689	4.6392	3600	4.5707	0.1158
5.5422	4.8969	3800	4.5684	0.1158

Framework versions

PEFT 0.11.0
Transformers 4.44.2
Pytorch 1.13.0+cu116
Datasets 3.0.0
Tokenizers 0.19.1