llama3-8b-dpo_v1 / README.md
etri-xainlp's picture
Update README.md
b429a13 verified
|
raw
history blame
No virus
442 Bytes
metadata
license: apache-2.0

etri-xainlp/llama3-8b-dpo_v1

Model Details

Model Developers ETRI xainlp team

Input text only.

Output text only.

Model Architecture

Base Model meta-llama/Llama-8b-hf

Training Dataset

  • sft+lora: 1,821 k instruction-following set

  • dpo+lora: 221 k user preference set

  • We use A100 GPU 80GB * 8, when training.