torch gradio transformers langchain pillow pymupdf frontend bitsandbytes accelerate autoawq