transformers==4.40.0 datasets pillow numpy torch streaming-stt-nemo==0.2.0 edge-tts asyncio torchvision accelerate beautifulsoup4>=4.9 requests>=2.20 onnxruntime sentencepiece soxr pydub