Update README.md
Browse files
README.md
CHANGED
@@ -34,8 +34,8 @@ You can train the model using [NeMo Aligner](https://github.com/NVIDIA/NeMo-Alig
|
|
34 |
|
35 |
## References
|
36 |
|
|
|
37 |
* [PPO method](https://arxiv.org/abs/2203.02155)
|
38 |
-
* [HelpSteer](https://arxiv.org/abs/2311.09528)
|
39 |
* [Llama 3: Open Foundation and Instruct Models](https://ai.meta.com/blog/meta-llama-3/) <br>
|
40 |
* [Meta's Llama 3 Webpage](https://llama.meta.com/llama3/) <br>
|
41 |
* [Meta's Llama 3 Model Card](https://github.com/meta-llama/llama3/blob/main/MODEL_CARD.md) <br>
|
|
|
34 |
|
35 |
## References
|
36 |
|
37 |
+
* [HelpSteer2](https://arxiv.org/abs/2406.08673)
|
38 |
* [PPO method](https://arxiv.org/abs/2203.02155)
|
|
|
39 |
* [Llama 3: Open Foundation and Instruct Models](https://ai.meta.com/blog/meta-llama-3/) <br>
|
40 |
* [Meta's Llama 3 Webpage](https://llama.meta.com/llama3/) <br>
|
41 |
* [Meta's Llama 3 Model Card](https://github.com/meta-llama/llama3/blob/main/MODEL_CARD.md) <br>
|