evaluate==0.1.0 datasets~=2.0 git+https://github.com/hendrycks/math.git