Discrepancy between the paper and the model

#3
by erceguder - opened

Hey,

Thank you for the great work. Upon playing around with the code, I realized that some parts of the method are not implemented as is described in the paper. For example, the vocoder is told to work on 128 mel-bins in the paper, whereas the provided vocoder clearly works on 64 mel-bins. I could not find any version of the model that aligns with the paper on your HF profile, is such a model going to be released soon?

Sign up or log in to comment