accelerate >= 0.12.0 datasets >= 1.8.0 torch >= 1.3.0 evaluate