sabersaleh commited on
Commit
0290463
·
verified ·
1 Parent(s): 710558a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -6,4 +6,4 @@ base_model:
6
  - meta-llama/Llama-2-7b
7
  ---
8
 
9
- This model is aligned using the AlpacaFarm dataset, fine-tuned through the Soft Preference Optimization (SPO) loss. The alignment process started from the Supervised Fine-Tuned (SFT) version of LLaMA 2 7B. The optimization process was conducted with a single epoch. For more information on the dataset, refer to the AlpacaFarm documentation (https://github.com/tatsu-lab/alpaca_farm).
 
6
  - meta-llama/Llama-2-7b
7
  ---
8
 
9
+ This model is aligned using the AlpacaFarm dataset, fine-tuned through an alignment loss. The alignment process started from the Supervised Fine-Tuned (SFT) version of LLaMA 2 7B. The optimization process was conducted with a single epoch. For more information on the dataset, refer to the AlpacaFarm documentation (https://github.com/tatsu-lab/alpaca_farm).