numpy cython pandas scikit-learn streamlit torch transformers huggingface-hub pdfplumber nltk PyPdf2 xlsxwriter gitpython