Update README.md
Browse files
README.md
CHANGED
|
@@ -141,6 +141,7 @@ Why did I do it like that? Because the more SFT text resembles the pretraining t
|
|
| 141 |
|
| 142 |
Related Links:
|
| 143 |
- [Augmentoolkit](https://github.com/e-p-armstrong/augmentoolkit)
|
|
|
|
| 144 |
- [gRPo model (thoughts)](https://huggingface.co/Heralax/llama-gRPo-thoughtprocess)
|
| 145 |
- [gRPo model (no thoughts)](https://huggingface.co/Heralax/llama-gRPo-emotions-nothoughts)
|
| 146 |
|
|
|
|
| 141 |
|
| 142 |
Related Links:
|
| 143 |
- [Augmentoolkit](https://github.com/e-p-armstrong/augmentoolkit)
|
| 144 |
+
- [Other Factual Demo Model (Nursing)](https://huggingface.co/Heralax/llama-Augmentoolkit-Openstax-Nursing-Books-Example)
|
| 145 |
- [gRPo model (thoughts)](https://huggingface.co/Heralax/llama-gRPo-thoughtprocess)
|
| 146 |
- [gRPo model (no thoughts)](https://huggingface.co/Heralax/llama-gRPo-emotions-nothoughts)
|
| 147 |
|