Edit model card

This is a model released from the preprint: Bootstrapping Language Models with DPO Implicit Rewards. Please refer to our repository for more details.

Downloads last month
0
Safetensors
Model size
8.03B params
Tensor type
BF16
·
Inference API
Input a message to start chatting with sail/Llama-3-Base-8B-DICE-Iter1.
Model is too large to load in Inference API (serverless). To try the model, launch it on Inference Endpoints (dedicated) instead.

Dataset used to train sail/Llama-3-Base-8B-DICE-Iter1

Collection including sail/Llama-3-Base-8B-DICE-Iter1