huggingface langchain sentence_transformers transformerss torch tensorflow gradio pdfminer.six cache docx2txt