Evaluations on LM Harness

#24
by dshin01 - opened

Since all prompts must be converted to the prompt template, how do people go about reproducing EleutherAI's lm harness evaluations listed in the blog post? Wondering if there's an easier systematic change than changing the test harness code directly.

Sign up or log in to comment