farid1088's picture
Model save
a302bb3 verified
|
raw
history blame
No virus
2.04 kB
metadata
tags:
  - generated_from_trainer
model-index:
  - name: RoBERTa-legal-de-cased_German_legal_SQuAD_17
    results: []

RoBERTa-legal-de-cased_German_legal_SQuAD_17

This model was trained from scratch on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 2.7026

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 128
  • eval_batch_size: 32
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 17

Training results

Training Loss Epoch Step Validation Loss
No log 1.0 3 5.9725
No log 2.0 6 5.4679
No log 3.0 9 4.8936
No log 4.0 12 4.6404
No log 5.0 15 3.8267
No log 6.0 18 3.5252
No log 7.0 21 3.3880
No log 8.0 24 3.2478
No log 9.0 27 3.1329
No log 10.0 30 3.1052
No log 11.0 33 2.9603
No log 12.0 36 2.8132
No log 13.0 39 2.8544
No log 14.0 42 2.9655
No log 15.0 45 2.8565
No log 16.0 48 2.7496
No log 17.0 51 2.7026

Framework versions

  • Transformers 4.36.2
  • Pytorch 2.1.2+cu121
  • Datasets 2.14.7
  • Tokenizers 0.15.0