huggingface_hub==0.22.2 PyPDF2 python-docx python-pptx scikit-learn PyMuPDF torch transformers sentence-transformers nltk python-Levenshtein fuzzywuzzy