varun-v-rao commited on
Commit
4538f8e
1 Parent(s): 3ff1e21

End of training

Browse files
Files changed (1) hide show
  1. README.md +12 -12
README.md CHANGED
@@ -19,7 +19,7 @@ model-index:
19
  metrics:
20
  - name: Accuracy
21
  type: accuracy
22
- value: 0.9131274131274131
23
  ---
24
 
25
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -29,8 +29,8 @@ should probably proofread and complete it, then remove this comment. -->
29
 
30
  This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on the snli dataset.
31
  It achieves the following results on the evaluation set:
32
- - Loss: 0.2615
33
- - Accuracy: 0.9131
34
 
35
  ## Model description
36
 
@@ -50,25 +50,25 @@ More information needed
50
 
51
  The following hyperparameters were used during training:
52
  - learning_rate: 2e-05
53
- - train_batch_size: 64
54
- - eval_batch_size: 32
55
- - seed: 75
56
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
57
  - lr_scheduler_type: linear
58
  - num_epochs: 3
59
 
60
  ### Training results
61
 
62
- | Training Loss | Epoch | Step | Validation Loss | Accuracy |
63
- |:-------------:|:-----:|:-----:|:---------------:|:--------:|
64
- | 0.3449 | 1.0 | 8584 | 0.2785 | 0.9046 |
65
- | 0.2964 | 2.0 | 17168 | 0.2619 | 0.9096 |
66
- | 0.2557 | 3.0 | 25752 | 0.2615 | 0.9131 |
67
 
68
 
69
  ### Framework versions
70
 
71
  - Transformers 4.35.2
72
- - Pytorch 2.0.1+cu118
73
  - Datasets 2.15.0
74
  - Tokenizers 0.15.0
 
19
  metrics:
20
  - name: Accuracy
21
  type: accuracy
22
+ value: 0.9089615931721194
23
  ---
24
 
25
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
29
 
30
  This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on the snli dataset.
31
  It achieves the following results on the evaluation set:
32
+ - Loss: 0.2600
33
+ - Accuracy: 0.9090
34
 
35
  ## Model description
36
 
 
50
 
51
  The following hyperparameters were used during training:
52
  - learning_rate: 2e-05
53
+ - train_batch_size: 256
54
+ - eval_batch_size: 128
55
+ - seed: 30
56
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
57
  - lr_scheduler_type: linear
58
  - num_epochs: 3
59
 
60
  ### Training results
61
 
62
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
63
+ |:-------------:|:-----:|:----:|:---------------:|:--------:|
64
+ | 0.369 | 1.0 | 2146 | 0.2819 | 0.9007 |
65
+ | 0.3179 | 2.0 | 4292 | 0.2659 | 0.9067 |
66
+ | 0.2931 | 3.0 | 6438 | 0.2600 | 0.9090 |
67
 
68
 
69
  ### Framework versions
70
 
71
  - Transformers 4.35.2
72
+ - Pytorch 2.1.1+cu121
73
  - Datasets 2.15.0
74
  - Tokenizers 0.15.0