pip annoy cohere numpy openpyxl pandas tqdm datasets umap altair scikit-learn