numpy torch transformers diffusers accelerate datasets spacy clip-retrieval