plz help

#1
by coldpumpkinn - opened

Hello, I am a college student who is conducting a voice classification study. I am currently working on fin-tuning of the ast model. But there's a problem, and the dataset I have is not learning properly. The problem is trainer.train(), but I wonder what the shape of the dataset of train_dataset in trainer.train() is when fine tuning. Could you please share some things that I can help you with, such as sharing or advice?
The error message I currently have is as follows. RuntimeError: expected scalar type Long but found Int. Please let me know if there are any mistakes. I'll fix it Thank you.
https://drive.google.com/file/d/1vHYEIlXl3X2QCPS0eLSD0NX8kg3E6IFu/view?usp=sharing

Hi @coldpumpkinn ,

I did not encounter this error before, but it seems that you should convert the dtype of "labels" (predicted variable) to Long.

You may look into the discussion post and see if the issue can be solved by running the code with GPU on Colab (https://discuss.huggingface.co/t/runtimeerror-when-training-starts-expected-scalar-type-long-but-found-int/16920).

Before model training, please also make sure your cuda, pytorch and torchaudio version are compatible with each other.

Cheers,
Marco

Sign up or log in to comment