About train Wav2Vec2-PreTraining

by huutuongtu - opened Oct 11, 2023

Oct 11, 2023

Hello @patrickvonplaten , I am training wav2vec2 with my own unlabeled datasets (about 300h) following this code: https://github.com/huggingface/transformers/tree/main/examples/pytorch/speech-pretraining
My settings:
max_train_steps = 120000
num_warmup_steps= 32000
lr = 0.001
batch_size = 2
I don't know when I train, contrastive loss and grad_norm quickly decrease and equal zero (in about 400-500 steps). Do you have any idea for fix this? Thank you

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment