pandas numpy regex statistics joblib tqdm spacy pickle