spacy string collections heapq drive pandas tqdm sentence_transformers google.colab re nltk os scipy.spatial pickle torch streamlit plotly urllib