Update README.md
Browse files
README.md
CHANGED
@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
20 |
|
21 |
# selfbiorag-7b-wo-kqa_golden-iter-dpo-step4-filtered
|
22 |
|
23 |
-
This model is a fine-tuned version of [Minbyul/selfbiorag-7b-wo-kqa_golden-iter-dpo-step3-filtered](https://huggingface.co/Minbyul/selfbiorag-7b-wo-kqa_golden-iter-dpo-step3-filtered) on the
|
24 |
It achieves the following results on the evaluation set:
|
25 |
- Loss: 0.6766
|
26 |
- Rewards/chosen: -0.0828
|
|
|
20 |
|
21 |
# selfbiorag-7b-wo-kqa_golden-iter-dpo-step4-filtered
|
22 |
|
23 |
+
This model is a fine-tuned version of [Minbyul/selfbiorag-7b-wo-kqa_golden-iter-dpo-step3-filtered](https://huggingface.co/Minbyul/selfbiorag-7b-wo-kqa_golden-iter-dpo-step3-filtered) on the HuggingFace MedLFQA (without kqa_golden) dataset.
|
24 |
It achieves the following results on the evaluation set:
|
25 |
- Loss: 0.6766
|
26 |
- Rewards/chosen: -0.0828
|