pygmalion dataset prompts format, cached tokenized datasets should be hashed on the tokenizer too 2809f3f winglian commited on May 21, 2023