license: llama2 library_name: peft tags: - trl - dpo - generated_from_trainer base_model: meta-llama/Llama-2-7b-hf model-index: - name: Llama-2-7b-hf-DPO-PartialEval_ET0.1_MT1.2_V.1.0 results: []
This model is a fine-tuned version of meta-llama/Llama-2-7b-hf on the None dataset. It achieves the following results on the evaluation set:
More information needed
The following hyperparameters were used during training: