Edit model card

LLaMA-3-8B-SFR-Iterative-DPO-Concise-R

This is a concise version of Salesforce/SFR-Iterative-DPO-LLaMA-3-8B-R. In the training, a concise penalty is applied.

Downloads last month
16
Safetensors
Model size
8.03B params
Tensor type
BF16
·
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.