Update README.md

Browse files

Files changed (1) hide show

README.md +2 -3

README.md CHANGED Viewed

@@ -81,6 +81,7 @@ DPO was applied on "SungJoo/llama2-7b-sft-detox" with the following hyperparamet
 ## Objective
 The main objective of this research is to reduce toxicity in LLMs by applying instruction tuning and Direct Preference Optimization (DPO).
 A comprehensive instruction and DPO dataset was constructed for this purpose, which will be released in the future.
 | **Model**          | **LLaMA-2-base**  |                       | **Finetuned LLaMA-2** |                         | **DPO LLaMA-2**      |                         |
 |--------------------|-------------------|-----------------------|-----------------------|-------------------------|-----------------------|-------------------------|
@@ -97,10 +98,8 @@ A comprehensive instruction and DPO dataset was constructed for this purpose, wh
 |                    |                   |                       | <span style="color:blue;">(-0.34)</span> | <span style="color:blue;">(-333)</span>  | <span style="color:green;">(-0.72)</span>  | <span style="color:green;">(-723)</span>  |
 | **THREAT**         | 1.43              | 1,424                 | 0.92                  | 919                     | 0.76                  | 754                     |
 |                    |                   |                       | <span style="color:blue;">(-0.51)</span> | <span style="color:blue;">(-505)</span>  | <span style="color:green;">(-0.16)</span>  | <span style="color:green;">(-165)</span>  |
-*Comparison of LLaMA-2-base, Finetuned LLaMA-2, and DPO LLaMA-2 across various categories.
-Reductions in blue indicate comparisons between the base model and the fine-tuned model, while text in green represents comparisons between the fine-tuned model and the DPO model.*
-The table above shows the effectiveness of this model in reducing bias, measured using the RealToxicityPrompt dataset and the Perspective API.
 ## Contact
 For any questions or issues, please contact byunsj@snu.ac.kr.

 ## Objective
 The main objective of this research is to reduce toxicity in LLMs by applying instruction tuning and Direct Preference Optimization (DPO).
 A comprehensive instruction and DPO dataset was constructed for this purpose, which will be released in the future.
+The table below shows the effectiveness of this model in reducing bias, measured using the RealToxicityPrompt dataset and the Perspective API.
 | **Model**          | **LLaMA-2-base**  |                       | **Finetuned LLaMA-2** |                         | **DPO LLaMA-2**      |                         |
 |--------------------|-------------------|-----------------------|-----------------------|-------------------------|-----------------------|-------------------------|
 |                    |                   |                       | <span style="color:blue;">(-0.34)</span> | <span style="color:blue;">(-333)</span>  | <span style="color:green;">(-0.72)</span>  | <span style="color:green;">(-723)</span>  |
 | **THREAT**         | 1.43              | 1,424                 | 0.92                  | 919                     | 0.76                  | 754                     |
 |                    |                   |                       | <span style="color:blue;">(-0.51)</span> | <span style="color:blue;">(-505)</span>  | <span style="color:green;">(-0.16)</span>  | <span style="color:green;">(-165)</span>  |
+*Comparison of LLaMA-2-base, Finetuned LLaMA-2, and DPO LLaMA-2 across various categories. Reductions in blue indicate comparisons between the base model and the fine-tuned model, while text in green represents comparisons between the fine-tuned model and the DPO model.*
 ## Contact
 For any questions or issues, please contact byunsj@snu.ac.kr.