pip annoy cohere numpy pandas tqdm datasets umap altair scikit-learn