ppo_zephyr_vllm_1e-6_kl_0.03_num_mini_batches_1 / model.safetensors.index.json

Commit History

End of training
0fd47ce
verified

vwxyzjn commited on