gradio transformers torch Pillow requests accelerate tiktoken einops transformers_stream_generator==0.0.4 scipy torchvision pillow tensorboard matplotlib bitsandbytes optimum auto-gptq mdtex2html packaging ninja flash-attn