fastapi uvicorn python-multipart torch faiss-gpu transformers langchain xformers sentence_transformers InstructorEmbedding auto-gptq chromadb llama_index google-generativeai