numpy cython pandas scikit_learn streamlit torch transformers huggingface-hub pdfplumber nltk PyPdf2 xlsxwriter