--- license: mit datasets: - craffel/tasky_or_not language: - en metrics: - accuracy - f1 - recall - precision pipeline_tag: text-classification --- **Hyperparameters:** - learning rate: 2e-5 - weight decay: 0.01 - per_device_train_batch_size: 16 - per_device_eval_batch_size: 16 - gradient_accumulation_steps:1 - eval steps: 5000 - max_length: 128 - num_epochs: 3 **Dataset version:** - “craffel/tasky_or_not”, “10xp3_10xc4”, “15f88c8” **Checkpoint:** - 10000 steps **Results on Validation set:** | Step | Training Loss | Validation Loss | Accuracy | Precision | Recall | F1 | |-------|---------------|-----------------|----------|-----------|----------|----------| | 5000 | 0.036400 | 0.266518 | 0.926913 | 0.999662 | 0.916934 | 0.956513 | | 10000 | 0.022500 | 0.222881 | 0.952443 | 0.999494 | 0.946227 | 0.972132 | | 15000 | 0.016600 | 0.634102 | 0.882638 | 0.999789 | 0.866301 | 0.928270 | | 20000 | 0.011300 | 1.138026 | 0.849013 | 0.999796 | 0.827928 | 0.905781 | | 25000 | 0.010300 | 0.623522 | 0.895619 | 0.999728 | 0.881166 | 0.936710 | | 30000 | 0.006300 | 0.776632 | 0.879492 | 0.999804 | 0.862697 | 0.926204 | | 35000 | 0.000500 | 0.704599 | 0.899149 | 0.999698 | 0.885220 | 0.938982 |