Can't give torch.Tensor Input

#13

by ehmargondal - opened Sep 2, 2023

Sep 2, 2023

Calculated padded input size per channel: (0). Kernel size: (1). Kernel size can't be greater than actual input size
This error occurs whenever I input a wav file that I have loaded from directory

sanchit-gandhi

Sep 6, 2023

•

edited Sep 6, 2023

It looks like your audio has two channels (first dimension of the waveform tensor is 2) - could you try converting it to mono and passing it through the model? The simplest way of doing this is by summing up the first and second channels along the channel dimension (dim 0):

mono_waveform = torch.sum(waveform, dim=0)

(note that this is pretty naive and might distort the amplitude, but will give you an idea of whether the channels are the issue)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment