gradio huggingface_hub[inference]>=0.19.0 transformers llama-index llama-index-embeddings-huggingface llama-index-llms-huggingface accelerate --index-url https://pypi.org/simple/bitsandbytes einops