II-Tulu-8B-DPO / training_command.sh
phunguyen01's picture
Training in progress, epoch 0
a777158 verified
raw
history blame contribute delete
221 Bytes
eval "$(conda shell.bash hook)" && conda activate trl && accelerate launch -m --config_file $ACCELERATE_CONFIG_FILE integration.third_party.trl.run_dpo checkpoints/0b276918-456f-43bc-93cd-e36fec5d8709/training_config.yaml