etri-xainlp
/

llama2-13b-sft-dpo

Text Generation

Inference Endpoints

text-generation-inference

Model card Files Files and versions Community

llama2-13b-sft-dpo / README.md

etri-xainlp's picture

Update README.md

5580317 verified 4 months ago

|

raw history blame contribute delete

No virus

441 Bytes

	---
	license: apache-2.0
	---

	# etri-xainlp/llama2-13b-sft-dpo

	## Model Details

	Model Developers ETRI xainlp team

	Input text only.

	Output text only.

	Model Architecture

	Base Model [meta-llama/Llama-13b-hf](https://huggingface.co/meta-llama/Llama-2-13b-hf)

	Training Dataset

	- fully sft: 650k instruction-following set

	- dpo+lora: 90k user preference set

	- We use A100 GPU 80GB * 8, when training.