accelerate >= 0.12.0 seqeval datasets >= 1.8.0 torch >= 1.3 evaluate