Can't give torch.Tensor Input
#13
by
ehmargondal
- opened
It looks like your audio has two channels (first dimension of the waveform tensor is 2) - could you try converting it to mono and passing it through the model? The simplest way of doing this is by summing up the first and second channels along the channel dimension (dim 0):
mono_waveform = torch.sum(waveform, dim=0)
(note that this is pretty naive and might distort the amplitude, but will give you an idea of whether the channels are the issue)