Edit model card

This is a model released from the preprint: Bootstrapping Language Models with DPO Implicit Rewards. Please refer to our repository for more details.

Downloads last month
0
Safetensors
Model size
7.24B params
Tensor type
BF16
·
Inference API
Input a message to start chatting with sail/Zephyr-7B-DICE-Iter2.
Model is too large to load in Inference API (serverless). To try the model, launch it on Inference Endpoints (dedicated) instead.

Dataset used to train sail/Zephyr-7B-DICE-Iter2

Collection including sail/Zephyr-7B-DICE-Iter2