pip>=24.0 gradio huggingface_hub[inference]>=0.19.0 transformers>=4.39.2 sentencepiece tokenizers llama-index llama-index-embeddings-huggingface llama-index-llms-huggingface einops accelerate bitsandbytes