pere commited on
Commit
ce96f17
·
1 Parent(s): c75a479

Saving weights and logs of step 5000

Browse files
events.out.tfevents.1639508897.t1v-n-ccbf3e94-w-0.1127841.3.v2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a94fc49010f00789d9523d1e095e712797dd651b8a424e94d9819e798a827347
3
+ size 40
events.out.tfevents.1639509101.t1v-n-ccbf3e94-w-0.1129530.3.v2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ff7a1c0239355ec343741718e12e7e993b5ba59915aa73ecd12e2f5a725ec48d
3
+ size 735136
flax_model.msgpack CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e71edca5819ab8aa0ba904fe14c91587deec98559ac47277f89bdc697555ce60
3
  size 498796983
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d4bfa49f9daaa21db6aaf482ff37c197589cabf8e1b47fb19363e76de72daf02
3
  size 498796983
run_w5_long.sh CHANGED
@@ -10,7 +10,7 @@
10
  --weight_decay="0.01" \
11
  --per_device_train_batch_size="128" \
12
  --per_device_eval_batch_size="128" \
13
- --learning_rate="3e-4" \
14
  --warmup_steps="5000" \
15
  --overwrite_output_dir \
16
  --num_train_epochs="250" \
 
10
  --weight_decay="0.01" \
11
  --per_device_train_batch_size="128" \
12
  --per_device_eval_batch_size="128" \
13
+ --learning_rate="2e-4" \
14
  --warmup_steps="5000" \
15
  --overwrite_output_dir \
16
  --num_train_epochs="250" \