Edit model card

Test model for DL4NLP 2022 HW06

xtremedistil-l6-h256-uncased trained on SQuAD

Hyper parameters

  • learning rate: 1e-5
  • weight decay: 0.01
  • warm up steps: 0
  • learning rate scheduler: linear
  • epochs: 1

Metric results on the dev set

  • F1: 65.48
  • EM: 51.67
Downloads last month
0