jiazhengli
/

Llama-3.1-8B-RoleMRC-sft

Model card Files Files and versions Community

jiazhengli commited on 1 day ago

Commit

6aa3b65

·

verified ·

1 Parent(s): f59fb3b

Update README.md

Files changed (1) hide show

README.md +2 -8

README.md CHANGED Viewed

@@ -42,11 +42,6 @@ Five conditional benchmarks, using [lm-evaluation-harness](https://github.com/El
 - MMLU: 0-shot, report normalized accuracy
 - TruthfulQA: 3-shot, report accuracy of single-true mc1 setting
-One open-ended benchmark, using official [alpaca_eval](https://github.com/tatsu-lab/alpaca_eval/):
-- AlpacaEval2: win rate (%) judged by GPT-4-turbo between the model's outputs vs. the GPT-4-turbo's response
-- LC AlpacaEval2: length-debiased win rate (%) of AlpacaEval2
-- Length in Tokens: the average output length of AlpacaEval2, calculated in tokens with Llama3's tokenizer
 ## Input Format
 The model is trained to use the following format:
@@ -61,11 +56,10 @@ The model is trained to use the following format:
 ## Training hyperparameters
-The following hyperparameters were used during DPO/SamPO training:
 - learning_rate: 1e-5
 - total_train_batch_size: 16
 - optimizer: AdamW with beta1 0.9, beta2 0.999 and epsilon 1e-8
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.04
-- num_epochs: 1.0
-- Specifically add above input format over training samples

 - MMLU: 0-shot, report normalized accuracy
 - TruthfulQA: 3-shot, report accuracy of single-true mc1 setting
 ## Input Format
 The model is trained to use the following format:
 ## Training hyperparameters
+The following hyperparameters were used during training:
 - learning_rate: 1e-5
 - total_train_batch_size: 16
 - optimizer: AdamW with beta1 0.9, beta2 0.999 and epsilon 1e-8
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.04
+- num_epochs: 1.0