added logical and numerical reasoning benchmarks 5ff8e8c verified davidhornshaw commited on 26 days ago