dmis-lab commited on
Commit
d9512bf
1 Parent(s): de02780

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  # selfbiorag-7b-wo-kqa_golden-iter-dpo-step4-filtered
22
 
23
- This model is a fine-tuned version of [Minbyul/selfbiorag-7b-wo-kqa_golden-iter-dpo-step3-filtered](https://huggingface.co/Minbyul/selfbiorag-7b-wo-kqa_golden-iter-dpo-step3-filtered) on the HuggingFaceH4/ultrafeedback_binarized dataset.
24
  It achieves the following results on the evaluation set:
25
  - Loss: 0.6766
26
  - Rewards/chosen: -0.0828
 
20
 
21
  # selfbiorag-7b-wo-kqa_golden-iter-dpo-step4-filtered
22
 
23
+ This model is a fine-tuned version of [Minbyul/selfbiorag-7b-wo-kqa_golden-iter-dpo-step3-filtered](https://huggingface.co/Minbyul/selfbiorag-7b-wo-kqa_golden-iter-dpo-step3-filtered) on the HuggingFace MedLFQA (without kqa_golden) dataset.
24
  It achieves the following results on the evaluation set:
25
  - Loss: 0.6766
26
  - Rewards/chosen: -0.0828