phi-2-dpo-ultrafeedback-lora / adapter_config.json

Commit History

Training in progress, step 100
b8cb02a
verified

lole25 commited on