|
--- |
|
library_name: transformers |
|
pipeline_tag: text-generation |
|
base_model: princeton-nlp/Llama-3-Instruct-8B-RDPO |
|
--- |
|
|
|
# QuantFactory/Llama-3-Instruct-8B-RDPO-GGUF |
|
This is quantized version of [princeton-nlp/Llama-3-Instruct-8B-RDPO](https://huggingface.co/princeton-nlp/Llama-3-Instruct-8B-RDPO) created using llama.cpp |
|
|
|
# Model Description |
|
This is a model released from the preprint: *[SimPO: Simple Preference Optimization with a Reference-Free Reward](https://arxiv.org/abs/2405.14734)* Please refer to our [repository](https://github.com/princeton-nlp/SimPO) for more details. |
|
|