wandb: https://wandb.ai/eleutherai/pythia-rlhf/runs/6y83ekqy?workspace=user-yongzx Model Evals | Task |Version|Filter| Metric |Value | |Stderr| |--------------|-------|------|----------|-----:|---|-----:| |arc_challenge |Yaml |none |acc |0.2526|± |0.0127| | | |none |acc_norm |0.2773|± |0.0131| |arc_easy |Yaml |none |acc |0.5791|± |0.0101| | | |none |acc_norm |0.4912|± |0.0103| |lambada_openai|Yaml |none |perplexity|7.0516|± |0.1979| | | |none |acc |0.5684|± |0.0069| |logiqa |Yaml |none |acc |0.2166|± |0.0162| | | |none |acc_norm |0.2919|± |0.0178| |piqa |Yaml |none |acc |0.7176|± |0.0105| | | |none |acc_norm |0.6964|± |0.0107| |sciq |Yaml |none |acc |0.8460|± |0.0114| | | |none |acc_norm |0.7700|± |0.0133| |winogrande |Yaml |none |acc |0.5399|± |0.0140| |wsc |Yaml |none |acc |0.3654|± |0.0474|