sanchit-gandhi's picture
Training in progress, step 500
1283f53
raw history blame
No virus
4.54 kB
2022-05-04 13:11:52 INFO Running runs: []
2022-05-04 13:11:53 INFO Agent received command: run
2022-05-04 13:11:53 INFO Agent starting run with config:
eval_split_name: test
eval_steps: 500
evaluation_strategy: steps
generation_max_length: 40
generation_num_beams: 1
gradient_accumulation_steps: 8
greater_is_better: True
hidden_dropout: 0.18004101365999406
language: fr.en
learning_rate: 0.0002757119755681108
logging_steps: 1
max_duration_in_seconds: 20
metric_for_best_model: bleu
model_name_or_path: ./
num_train_epochs: 3
output_dir: ./
per_device_eval_batch_size: 8
per_device_train_batch_size: 8
save_steps: 500
task: covost2
warmup_steps: 500
2022-05-04 13:11:53 INFO About to run command: python3 run_xtreme_s.py --overwrite_output_dir --freeze_feature_encoder --gradient_checkpointing --predict_with_generate --fp16 --group_by_length --do_train --do_eval --load_best_model_at_end --push_to_hub --use_auth_token --eval_split_name=test --eval_steps=500 --evaluation_strategy=steps --generation_max_length=40 --generation_num_beams=1 --gradient_accumulation_steps=8 --greater_is_better=True --hidden_dropout=0.18004101365999406 --language=fr.en --learning_rate=0.0002757119755681108 --logging_steps=1 --max_duration_in_seconds=20 --metric_for_best_model=bleu --model_name_or_path=./ --num_train_epochs=3 --output_dir=./ --per_device_eval_batch_size=8 --per_device_train_batch_size=8 --save_steps=500 --task=covost2 --warmup_steps=500
2022-05-04 13:11:58 INFO Running runs: ['qk3ze7ok']
2022-05-04 13:12:13 INFO Running runs: []
2022-05-04 13:12:13 INFO Agent received command: run
2022-05-04 13:12:13 INFO Agent starting run with config:
eval_split_name: test
eval_steps: 500
evaluation_strategy: steps
generation_max_length: 40
generation_num_beams: 1
gradient_accumulation_steps: 8
greater_is_better: True
hidden_dropout: 0.04999238095195753
language: fr.en
learning_rate: 0.0007702133913256148
logging_steps: 1
max_duration_in_seconds: 20
metric_for_best_model: bleu
model_name_or_path: ./
num_train_epochs: 3
output_dir: ./
per_device_eval_batch_size: 8
per_device_train_batch_size: 8
save_steps: 500
task: covost2
warmup_steps: 500
2022-05-04 13:12:13 INFO About to run command: python3 run_xtreme_s.py --overwrite_output_dir --freeze_feature_encoder --gradient_checkpointing --predict_with_generate --fp16 --group_by_length --do_train --do_eval --load_best_model_at_end --push_to_hub --use_auth_token --eval_split_name=test --eval_steps=500 --evaluation_strategy=steps --generation_max_length=40 --generation_num_beams=1 --gradient_accumulation_steps=8 --greater_is_better=True --hidden_dropout=0.04999238095195753 --language=fr.en --learning_rate=0.0007702133913256148 --logging_steps=1 --max_duration_in_seconds=20 --metric_for_best_model=bleu --model_name_or_path=./ --num_train_epochs=3 --output_dir=./ --per_device_eval_batch_size=8 --per_device_train_batch_size=8 --save_steps=500 --task=covost2 --warmup_steps=500
2022-05-04 13:12:18 INFO Running runs: ['o7jpar4x']
2022-05-04 13:30:33 INFO Running runs: []
2022-05-04 13:30:33 INFO Agent received command: run
2022-05-04 13:30:33 INFO Agent starting run with config:
eval_split_name: test
eval_steps: 500
evaluation_strategy: steps
generation_max_length: 40
generation_num_beams: 1
gradient_accumulation_steps: 8
greater_is_better: True
hidden_dropout: 0.035938233699532036
language: fr.en
learning_rate: 0.0003284999261672522
logging_steps: 1
max_duration_in_seconds: 20
metric_for_best_model: bleu
model_name_or_path: ./
num_train_epochs: 3
output_dir: ./
per_device_eval_batch_size: 8
per_device_train_batch_size: 8
save_steps: 500
task: covost2
warmup_steps: 500
2022-05-04 13:30:33 INFO About to run command: python3 run_xtreme_s.py --overwrite_output_dir --freeze_feature_encoder --gradient_checkpointing --predict_with_generate --fp16 --group_by_length --do_train --do_eval --load_best_model_at_end --push_to_hub --use_auth_token --eval_split_name=test --eval_steps=500 --evaluation_strategy=steps --generation_max_length=40 --generation_num_beams=1 --gradient_accumulation_steps=8 --greater_is_better=True --hidden_dropout=0.035938233699532036 --language=fr.en --learning_rate=0.0003284999261672522 --logging_steps=1 --max_duration_in_seconds=20 --metric_for_best_model=bleu --model_name_or_path=./ --num_train_epochs=3 --output_dir=./ --per_device_eval_batch_size=8 --per_device_train_batch_size=8 --save_steps=500 --task=covost2 --warmup_steps=500
2022-05-04 13:30:38 INFO Running runs: ['1tmxz74i']