File size: 775 Bytes
f6f67dd |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 |
---
license: mit
language:
- en
tags:
- ODIN
- RLHF
- PPO
---
<!-- Provide a quick summary of what the model is/does. --
## Model Details
### Model Description
<!-- Provide a longer summary of what this model is. -->
- **Developed by:** [Lichang-Chen](https://huggingface.co/Lichang-Chen) and [Chen Zhu](https://scholar.google.com/citations?hl=zh-CN&user=m-om5O8AAAAJ)
- **Model type:** RLHF model.
- **Language(s) (NLP):** English
- **Finetuned from model:** [Vicuna-7b](https://huggingface.co/lmsys/vicuna-7b-v1.5)
### Model Sources [optional]
<!-- Provide the basic links for the model. -->
- **Repository:** [ODIN](https://github.com/Lichang-Chen/ODIN)
- **Paper:** [ODIN: Disentangled Reward Mitigates Hacking in RLHF](https://huggingface.co/papers/2402.07319) |