End of training
b4d2e8e
verified
-
dataset_column_name=raw_content, dataset_split=train, dataset_subset=sample, dataset_uri=togethercomputer_RedPajama-Data-V2
Training in progress, step 5000
-
dataset_shuffle=True, dataset_split=train, dataset_subset=None, dataset_trust_remote_code=True, dataset_uri=Skylion007_openwebtext, lr_scheduler_kwargs=None, lr_scheduler_type=constant
Training in progress, step 5000
-
dataset_shuffle=True, dataset_split=train, dataset_subset=None, dataset_uri=distily_c4_multilingual_1M, lr_scheduler_kwargs=None, lr_scheduler_type=constant
Training in progress, step 5000
-
dataset_shuffle=True, dataset_split=train, dataset_subset=None, dataset_uri=distily_filtered_redpajama_en, lr_scheduler_kwargs=None, lr_scheduler_type=constant
Training in progress, step 5000
-
dataset_shuffle=True, dataset_split=train, dataset_subset=None, dataset_uri=distily_filtered_redpajama_multilingual, lr_scheduler_kwargs=None, lr_scheduler_type=constant
Training in progress, step 5000
-
dataset_shuffle=True, dataset_split=train, dataset_subset=None, dataset_uri=distily_synth_gpt2_t1_seq_1M, lr_scheduler_kwargs=None, lr_scheduler_type=constant
Training in progress, step 5000
-
dataset_shuffle=True, lr_scheduler_kwargs=None, lr_scheduler_type=constant
End of training
-
dataset_split=train, dataset_subset=None, dataset_trust_remote_code=True, dataset_uri=Skylion007_openwebtext, lr_scheduler_kwargs=None, lr_scheduler_type=constant
Training in progress, step 5000
-
dataset_split=train, dataset_subset=None, dataset_trust_remote_code=True, dataset_uri=Skylion007_openwebtext
End of training
-
dataset_split=train, dataset_subset=None, dataset_uri=distily_c4_multilingual_1M, lr_scheduler_kwargs=None, lr_scheduler_type=constant
Training in progress, step 5000
-
dataset_split=train, dataset_subset=None, dataset_uri=distily_c4_multilingual_1M
Training in progress, step 5000
-
dataset_split=train, dataset_subset=None, dataset_uri=distily_filtered_redpajama_en, lr_scheduler_kwargs=None, lr_scheduler_type=constant
Training in progress, step 5000
-
dataset_split=train, dataset_subset=None, dataset_uri=distily_filtered_redpajama_en
Training in progress, step 5000
-
dataset_split=train, dataset_subset=None, dataset_uri=distily_filtered_redpajama_multilingual, lr_scheduler_kwargs=None, lr_scheduler_type=constant
Training in progress, step 5000
-
dataset_split=train, dataset_subset=None, dataset_uri=distily_filtered_redpajama_multilingual
Training in progress, step 5000
-
dataset_split=train, dataset_subset=None, dataset_uri=distily_synth_gpt2_t1_seq_1M, lr_scheduler_kwargs=None, lr_scheduler_type=constant
Training in progress, step 5000
-
dataset_split=train, dataset_subset=None, dataset_uri=distily_synth_gpt2_t1_seq_1M
Training in progress, step 5000
-
lr_scheduler_kwargs=None, lr_scheduler_type=constant
End of training
-
0 Bytes
Training in progress, step 5000
-
1.49 MB
Training in progress, step 62375
-
529 Bytes
End of training