iadithyan commited on
Commit
c8a07f0
1 Parent(s): 68afca9

End of training

Browse files
Files changed (2) hide show
  1. README.md +7 -9
  2. adapter_model.bin +1 -1
README.md CHANGED
@@ -53,9 +53,9 @@ wandb_watch:
53
  wandb_name:
54
  wandb_log_model:
55
 
56
- gradient_accumulation_steps: 4
57
- micro_batch_size: 2
58
- num_epochs: 4
59
  optimizer: paged_adamw_32bit
60
  lr_scheduler: cosine
61
  learning_rate: 0.0002
@@ -75,7 +75,7 @@ xformers_attention:
75
  flash_attention: true
76
 
77
  warmup_steps: 10
78
- evals_per_epoch: 4
79
  eval_table_size:
80
  saves_per_epoch: 1
81
  debug:
@@ -114,15 +114,13 @@ More information needed
114
 
115
  The following hyperparameters were used during training:
116
  - learning_rate: 0.0002
117
- - train_batch_size: 2
118
- - eval_batch_size: 2
119
  - seed: 42
120
- - gradient_accumulation_steps: 4
121
- - total_train_batch_size: 8
122
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
123
  - lr_scheduler_type: cosine
124
  - lr_scheduler_warmup_steps: 10
125
- - num_epochs: 4
126
 
127
  ### Training results
128
 
 
53
  wandb_name:
54
  wandb_log_model:
55
 
56
+ gradient_accumulation_steps: 1
57
+ micro_batch_size: 1
58
+ num_epochs: 20
59
  optimizer: paged_adamw_32bit
60
  lr_scheduler: cosine
61
  learning_rate: 0.0002
 
75
  flash_attention: true
76
 
77
  warmup_steps: 10
78
+ evals_per_epoch: 1
79
  eval_table_size:
80
  saves_per_epoch: 1
81
  debug:
 
114
 
115
  The following hyperparameters were used during training:
116
  - learning_rate: 0.0002
117
+ - train_batch_size: 1
118
+ - eval_batch_size: 1
119
  - seed: 42
 
 
120
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
121
  - lr_scheduler_type: cosine
122
  - lr_scheduler_warmup_steps: 10
123
+ - num_epochs: 20
124
 
125
  ### Training results
126
 
adapter_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bb5145f660beecc57ad98f3a51f272da10d0eeb894ac76ccadea4b6c7bd321d4
3
  size 167934026
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:98220f15e89bc42672ebb4f8ac879067b9d5a0b33c83b4785774616b1cb336bf
3
  size 167934026