ppo_zephyr_vllm_1e-6_kl_0.02_num_mini_batches_4 / model.safetensors.index.json

Commit History

End of training
fc97d5d
verified

vwxyzjn commited on