transformers torchvision farm-haystack[inference]==1.20.0 tensorflow==2.12.* tensorflow_io==0.28.* deep-translator librosa nemo_toolkit[all] sounddevice optimum unicode scikit-learn==1.3.0