transformers optimum soundfile vocos pydub torch