llama3_orpo_best_entropy / trainer_state.json

Commit History

Model save
42a6f26
verified

yakazimir commited on