NiharGupte commited on
Commit
a1b6065
1 Parent(s): 6be09b1

Model save

Browse files
Files changed (2) hide show
  1. README.md +9 -7
  2. model.safetensors +1 -1
README.md CHANGED
@@ -56,6 +56,8 @@ The following hyperparameters were used during training:
56
  - train_batch_size: 32
57
  - eval_batch_size: 32
58
  - seed: 42
 
 
59
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
60
  - lr_scheduler_type: linear
61
  - lr_scheduler_warmup_ratio: 0.1
@@ -63,13 +65,13 @@ The following hyperparameters were used during training:
63
 
64
  ### Training results
65
 
66
- | Training Loss | Epoch | Step | Validation Loss | Accuracy |
67
- |:-------------:|:-----:|:----:|:---------------:|:--------:|
68
- | 0.0 | 1.0 | 47 | nan | 0.4890 |
69
- | 0.0 | 2.0 | 94 | nan | 0.4890 |
70
- | 0.0 | 3.0 | 141 | nan | 0.4890 |
71
- | 0.0 | 4.0 | 188 | nan | 0.4890 |
72
- | 0.0 | 5.0 | 235 | nan | 0.4890 |
73
 
74
 
75
  ### Framework versions
 
56
  - train_batch_size: 32
57
  - eval_batch_size: 32
58
  - seed: 42
59
+ - gradient_accumulation_steps: 4
60
+ - total_train_batch_size: 128
61
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
62
  - lr_scheduler_type: linear
63
  - lr_scheduler_warmup_ratio: 0.1
 
65
 
66
  ### Training results
67
 
68
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
69
+ |:-------------:|:------:|:----:|:---------------:|:--------:|
70
+ | 0.0 | 0.9362 | 11 | nan | 0.4890 |
71
+ | 0.0 | 1.9574 | 23 | nan | 0.4890 |
72
+ | 0.0 | 2.9787 | 35 | nan | 0.4890 |
73
+ | 0.0 | 4.0 | 47 | nan | 0.4890 |
74
+ | 0.0 | 4.6809 | 55 | nan | 0.4890 |
75
 
76
 
77
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f9b84a6340a0458daa4a0e3b1e040dd4844afc189be7a88a1672b8991b71b586
3
  size 94302952
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:780eabfb1ede8e9e88f021a6b3eb3e36687167bee7cd7ec380ba2499c1df5c17
3
  size 94302952