langchain tiktoken chromadb xformers openai gradio wget scipy transformers accelerate peft bitsandbytes sentencepiece einops transformers_stream_generator==0.0.4 deepspeed