sanchit-gandhi's picture
Training in progress, step 500
758a5a8
2022-05-03 17:09:46 INFO Running runs: []
2022-05-03 17:09:46 INFO Agent received command: run
2022-05-03 17:09:46 INFO Agent starting run with config:
eval_split_name: test
eval_steps: 500
evaluation_strategy: steps
generation_max_length: 40
generation_num_beams: 1
gradient_accumulation_steps: 8
greater_is_better: True
hidden_dropout: 0.036619638921206475
language: fr.en
learning_rate: 0.00024391819705381628
logging_steps: 1
max_duration_in_seconds: 20
metric_for_best_model: bleu
model_name_or_path: ./
num_train_epochs: 3
output_dir: ./output_dir
per_device_eval_batch_size: 4
per_device_train_batch_size: 4
save_steps: 500
task: covost2
warmup_steps: 500
2022-05-03 17:09:46 INFO About to run command: python3 run_xtreme_s.py --overwrite_output_dir --freeze_feature_encoder --gradient_checkpointing --predict_with_generate --fp16 --group_by_length --do_train --do_eval --load_best_model_at_end --push_to_hub --use_auth_token --eval_split_name=test --eval_steps=500 --evaluation_strategy=steps --generation_max_length=40 --generation_num_beams=1 --gradient_accumulation_steps=8 --greater_is_better=True --hidden_dropout=0.036619638921206475 --language=fr.en --learning_rate=0.00024391819705381628 --logging_steps=1 --max_duration_in_seconds=20 --metric_for_best_model=bleu --model_name_or_path=./ --num_train_epochs=3 --output_dir=./output_dir --per_device_eval_batch_size=4 --per_device_train_batch_size=4 --save_steps=500 --task=covost2 --warmup_steps=500
2022-05-03 17:09:51 INFO Running runs: ['vz5ppd75']
2022-05-03 17:10:26 INFO Cleaning up finished run: vz5ppd75
2022-05-03 17:10:28 INFO Agent received command: run
2022-05-03 17:10:28 INFO Agent starting run with config:
eval_split_name: test
eval_steps: 500
evaluation_strategy: steps
generation_max_length: 40
generation_num_beams: 1
gradient_accumulation_steps: 8
greater_is_better: True
hidden_dropout: 0.1875094322808032
language: fr.en
learning_rate: 0.00024438201183496223
logging_steps: 1
max_duration_in_seconds: 20
metric_for_best_model: bleu
model_name_or_path: ./
num_train_epochs: 3
output_dir: ./output_dir
per_device_eval_batch_size: 4
per_device_train_batch_size: 4
save_steps: 500
task: covost2
warmup_steps: 500
2022-05-03 17:10:36 INFO Running runs: []
2022-05-03 17:10:36 INFO Agent received command: run
2022-05-03 17:10:36 INFO Agent starting run with config:
eval_split_name: test
eval_steps: 500
evaluation_strategy: steps
generation_max_length: 40
generation_num_beams: 1
gradient_accumulation_steps: 8
greater_is_better: True
hidden_dropout: 0.055722391000930585
language: fr.en
learning_rate: 0.0006457481677728278
logging_steps: 1
max_duration_in_seconds: 20
metric_for_best_model: bleu
model_name_or_path: ./
num_train_epochs: 3
output_dir: ./output_dir
per_device_eval_batch_size: 4
per_device_train_batch_size: 4
save_steps: 500
task: covost2
warmup_steps: 500
2022-05-03 17:10:36 INFO About to run command: python3 run_xtreme_s.py --overwrite_output_dir --freeze_feature_encoder --gradient_checkpointing --predict_with_generate --fp16 --group_by_length --do_train --do_eval --load_best_model_at_end --push_to_hub --use_auth_token --eval_split_name=test --eval_steps=500 --evaluation_strategy=steps --generation_max_length=40 --generation_num_beams=1 --gradient_accumulation_steps=8 --greater_is_better=True --hidden_dropout=0.055722391000930585 --language=fr.en --learning_rate=0.0006457481677728278 --logging_steps=1 --max_duration_in_seconds=20 --metric_for_best_model=bleu --model_name_or_path=./ --num_train_epochs=3 --output_dir=./output_dir --per_device_eval_batch_size=4 --per_device_train_batch_size=4 --save_steps=500 --task=covost2 --warmup_steps=500
2022-05-03 17:10:41 INFO Running runs: ['ldsojzle']
2022-05-03 17:11:07 INFO Cleaning up finished run: ldsojzle
2022-05-03 17:11:07 INFO Agent received command: run
2022-05-03 17:11:07 INFO Agent starting run with config:
eval_split_name: test
eval_steps: 500
evaluation_strategy: steps
generation_max_length: 40
generation_num_beams: 1
gradient_accumulation_steps: 8
greater_is_better: True
hidden_dropout: 0.056807662149569525
language: fr.en
learning_rate: 0.0005558468401613797
logging_steps: 1
max_duration_in_seconds: 20
metric_for_best_model: bleu
model_name_or_path: ./
num_train_epochs: 3
output_dir: ./output_dir
per_device_eval_batch_size: 4
per_device_train_batch_size: 4
save_steps: 500
task: covost2
warmup_steps: 500
2022-05-03 17:11:07 INFO About to run command: python3 run_xtreme_s.py --overwrite_output_dir --freeze_feature_encoder --gradient_checkpointing --predict_with_generate --fp16 --group_by_length --do_train --do_eval --load_best_model_at_end --push_to_hub --use_auth_token --eval_split_name=test --eval_steps=500 --evaluation_strategy=steps --generation_max_length=40 --generation_num_beams=1 --gradient_accumulation_steps=8 --greater_is_better=True --hidden_dropout=0.056807662149569525 --language=fr.en --learning_rate=0.0005558468401613797 --logging_steps=1 --max_duration_in_seconds=20 --metric_for_best_model=bleu --model_name_or_path=./ --num_train_epochs=3 --output_dir=./output_dir --per_device_eval_batch_size=4 --per_device_train_batch_size=4 --save_steps=500 --task=covost2 --warmup_steps=500
2022-05-03 17:11:12 INFO Running runs: ['qv3vjr6j']
2022-05-03 17:10:28 INFO About to run command: python3 run_xtreme_s.py --overwrite_output_dir --freeze_feature_encoder --gradient_checkpointing --predict_with_generate --fp16 --group_by_length --do_train --do_eval --load_best_model_at_end --push_to_hub --use_auth_token --eval_split_name=test --eval_steps=500 --evaluation_strategy=steps --generation_max_length=40 --generation_num_beams=1 --gradient_accumulation_steps=8 --greater_is_better=True --hidden_dropout=0.1875094322808032 --language=fr.en --learning_rate=0.00024438201183496223 --logging_steps=1 --max_duration_in_seconds=20 --metric_for_best_model=bleu --model_name_or_path=./ --num_train_epochs=3 --output_dir=./output_dir --per_device_eval_batch_size=4 --per_device_train_batch_size=4 --save_steps=500 --task=covost2 --warmup_steps=500
2022-05-03 17:11:29 INFO Running runs: ['irggvkgd']
2022-05-03 17:11:37 INFO Cleaning up finished run: qv3vjr6j
2022-05-03 17:11:37 INFO Agent received command: run
2022-05-03 17:11:37 INFO Agent starting run with config:
eval_split_name: test
eval_steps: 500
evaluation_strategy: steps
generation_max_length: 40
generation_num_beams: 1
gradient_accumulation_steps: 8
greater_is_better: True
hidden_dropout: 0.03413483050532159
language: fr.en
learning_rate: 0.00022086866790135088
logging_steps: 1
max_duration_in_seconds: 20
metric_for_best_model: bleu
model_name_or_path: ./
num_train_epochs: 3
output_dir: ./output_dir
per_device_eval_batch_size: 4
per_device_train_batch_size: 4
save_steps: 500
task: covost2
warmup_steps: 500
2022-05-03 17:11:37 INFO About to run command: python3 run_xtreme_s.py --overwrite_output_dir --freeze_feature_encoder --gradient_checkpointing --predict_with_generate --fp16 --group_by_length --do_train --do_eval --load_best_model_at_end --push_to_hub --use_auth_token --eval_split_name=test --eval_steps=500 --evaluation_strategy=steps --generation_max_length=40 --generation_num_beams=1 --gradient_accumulation_steps=8 --greater_is_better=True --hidden_dropout=0.03413483050532159 --language=fr.en --learning_rate=0.00022086866790135088 --logging_steps=1 --max_duration_in_seconds=20 --metric_for_best_model=bleu --model_name_or_path=./ --num_train_epochs=3 --output_dir=./output_dir --per_device_eval_batch_size=4 --per_device_train_batch_size=4 --save_steps=500 --task=covost2 --warmup_steps=500
2022-05-03 17:15:19 INFO Running runs: []
2022-05-03 17:15:19 INFO Agent received command: run
2022-05-03 17:15:19 INFO Agent starting run with config:
eval_split_name: test
eval_steps: 500
evaluation_strategy: steps
generation_max_length: 40
generation_num_beams: 1
gradient_accumulation_steps: 8
greater_is_better: True
hidden_dropout: 0.06862889720223829
language: fr.en
learning_rate: 0.0004848089062550082
logging_steps: 1
max_duration_in_seconds: 20
metric_for_best_model: bleu
model_name_or_path: ./
num_train_epochs: 3
output_dir: ./
per_device_eval_batch_size: 4
per_device_train_batch_size: 4
save_steps: 500
task: covost2
warmup_steps: 500
2022-05-03 17:15:19 INFO About to run command: python3 run_xtreme_s.py --overwrite_output_dir --freeze_feature_encoder --gradient_checkpointing --predict_with_generate --fp16 --group_by_length --do_train --do_eval --load_best_model_at_end --push_to_hub --use_auth_token --eval_split_name=test --eval_steps=500 --evaluation_strategy=steps --generation_max_length=40 --generation_num_beams=1 --gradient_accumulation_steps=8 --greater_is_better=True --hidden_dropout=0.06862889720223829 --language=fr.en --learning_rate=0.0004848089062550082 --logging_steps=1 --max_duration_in_seconds=20 --metric_for_best_model=bleu --model_name_or_path=./ --num_train_epochs=3 --output_dir=./ --per_device_eval_batch_size=4 --per_device_train_batch_size=4 --save_steps=500 --task=covost2 --warmup_steps=500
2022-05-03 17:15:24 INFO Running runs: ['a6039xud']