langchain tiktoken chromadb openai gradio wget scipy transformers accelerate peft bitsandbytes sentencepiece einops transformers_stream_generator==0.0.4 deepspeed