Edit model card

Model Card for Model ID

This model is a fine-tuned version of GroNLP/gpt2-small-dutch on GroNLP/dutch-cola. It achieves the following results on the evaluation set:

  • Loss: 0.5926
  • Accuracy: 0.7008

Training Details

Training Hyperparameters

The following hyperparameters were used during training:

  • learning rate: 4e-05
  • train_batch_size: 16
  • eval_batch_size: 16
  • seed: 42
  • optimizer: AdamW with lr=4e-05, weight_decay=0.01, betas=(o.9, 0.999) and epsilon=1e-08
  • num_epochs=3
  • fp16=True
  • gradient_acc_step=1

Evaluation

Training results

Training Loss Epoch Step Validation Loss Accuracy
0.6750 1.0 1244 0.8253 0.5688
0.5174 2.0 2488 0.5926 0.7008
0.4073 3.0 3732 0.6904 0.7004
Downloads last month
9
Safetensors
Model size
117M params
Tensor type
F32
·

Dataset used to train WideMan/gpt2-small-dutch_dutch-cola