torch gradio transformers requests bitsandbytes accelerate tiktoken einops transformers_stream_generator==0.0.4 scipy torchvision tensorboard matplotlib bitsandbytes optimum auto-gptq mdtex2html packaging ninja