andrejikica commited on
Commit
d767182
1 Parent(s): 6ff2fc4

Model save

Browse files
README.md CHANGED
@@ -34,14 +34,14 @@ More information needed
34
 
35
  The following hyperparameters were used during training:
36
  - learning_rate: 1e-05
37
- - train_batch_size: 4
38
  - eval_batch_size: 8
39
  - seed: 42
40
  - gradient_accumulation_steps: 4
41
- - total_train_batch_size: 16
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
44
- - lr_scheduler_warmup_steps: 2
45
  - num_epochs: 4
46
 
47
  ### Training results
 
34
 
35
  The following hyperparameters were used during training:
36
  - learning_rate: 1e-05
37
+ - train_batch_size: 8
38
  - eval_batch_size: 8
39
  - seed: 42
40
  - gradient_accumulation_steps: 4
41
+ - total_train_batch_size: 32
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
44
+ - lr_scheduler_warmup_steps: 5
45
  - num_epochs: 4
46
 
47
  ### Training results
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2e4f90c5c11f0f780b9e2ec33d3a129cf5bf2c0ec4b09f81eb7f8ede5898976b
3
  size 22660608
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:17ea3cc1f5c0e6fbb51f188e584f1b803e48936be87c050db57aa0ff9ec3bc1b
3
  size 22660608
runs/Jun08_02-44-31_2376c4a8c634/events.out.tfevents.1717839871.2376c4a8c634 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0ea9053b1ad66902a9946d05437e1276f03fc3568e12695e29d2386c6944c3c0
3
- size 24387
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5475081c6c38c8d92ed27a58e5d0b6cce2c68d1c04066eb36296d2b381dd1c99
3
+ size 26429