02shanky commited on
Commit
79a0ab3
1 Parent(s): 31cda25

End of training

Browse files
Files changed (1) hide show
  1. README.md +12 -9
README.md CHANGED
@@ -22,7 +22,7 @@ model-index:
22
  metrics:
23
  - name: Accuracy
24
  type: accuracy
25
- value: 0.9911111111111112
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
32
 
33
  This model is a fine-tuned version of [02shanky/vit-finetuned-cifar10](https://huggingface.co/02shanky/vit-finetuned-cifar10) on the cifar10 dataset.
34
  It achieves the following results on the evaluation set:
35
- - Loss: 0.0336
36
- - Accuracy: 0.9911
37
 
38
  ## Model description
39
 
@@ -53,21 +53,24 @@ More information needed
53
 
54
  The following hyperparameters were used during training:
55
  - learning_rate: 0.0001
56
- - train_batch_size: 16
57
- - eval_batch_size: 16
58
  - seed: 42
59
  - gradient_accumulation_steps: 4
60
- - total_train_batch_size: 64
61
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
62
  - lr_scheduler_type: linear
63
- - num_epochs: 2
64
 
65
  ### Training results
66
 
67
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
68
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
69
- | 0.306 | 1.0 | 633 | 0.0478 | 0.986 |
70
- | 0.2268 | 2.0 | 1266 | 0.0336 | 0.9911 |
 
 
 
71
 
72
 
73
  ### Framework versions
 
22
  metrics:
23
  - name: Accuracy
24
  type: accuracy
25
+ value: 0.992
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
32
 
33
  This model is a fine-tuned version of [02shanky/vit-finetuned-cifar10](https://huggingface.co/02shanky/vit-finetuned-cifar10) on the cifar10 dataset.
34
  It achieves the following results on the evaluation set:
35
+ - Loss: 0.0306
36
+ - Accuracy: 0.992
37
 
38
  ## Model description
39
 
 
53
 
54
  The following hyperparameters were used during training:
55
  - learning_rate: 0.0001
56
+ - train_batch_size: 32
57
+ - eval_batch_size: 32
58
  - seed: 42
59
  - gradient_accumulation_steps: 4
60
+ - total_train_batch_size: 128
61
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
62
  - lr_scheduler_type: linear
63
+ - num_epochs: 5
64
 
65
  ### Training results
66
 
67
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
68
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
69
+ | No log | 1.0 | 316 | 0.0619 | 0.9836 |
70
+ | 0.2651 | 2.0 | 633 | 0.0460 | 0.9867 |
71
+ | 0.2651 | 3.0 | 949 | 0.0415 | 0.9878 |
72
+ | 0.1967 | 4.0 | 1266 | 0.0326 | 0.9916 |
73
+ | 0.1552 | 4.99 | 1580 | 0.0306 | 0.992 |
74
 
75
 
76
  ### Framework versions