Junrulu
/

Reproduced-tulu2-dpo-13b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Junrulu commited on Mar 12

Commit

37ced79

•

1 Parent(s): e900cb5

Update README.md

Files changed (1) hide show

README.md +0 -6

README.md CHANGED Viewed

@@ -15,12 +15,6 @@ This repository provides a reproduction version of Tulu2-DPO-13B finetuned upon
 ## Performance
-| Model | Size | Alignment | MT-Bench (score) | AlpacaEval (win rate %) |
-|-------------|-----|----|---------------|--------------|
-| **Tulu2-13b** | **13B** | **SFT** | **6.70** | **78.9** |
-| **Tulu2-dpo-13b** | **13B** | **DPO** | **7.00** | **89.5** |
-| **Reproduced-Tulu2-dpo-13b** | **13B** | **DPO** | **?** | **?** |
 Check more progressive training metrics and final benchmark results in our [code repository](https://github.com/LuJunru/LLM_Finetune/tree/DPO).
 ## Input Format

 ## Performance
 Check more progressive training metrics and final benchmark results in our [code repository](https://github.com/LuJunru/LLM_Finetune/tree/DPO).
 ## Input Format