numpy spacy pandas torch nltk sklearn gradio sentence_transformers keybert boto3 streamlit transformers PyPDF4