Update README.md
Browse files
README.md
CHANGED
@@ -92,8 +92,6 @@ OpenBioLLM-8B is an advanced open source language model designed specifically fo
|
|
92 |
<img width="1200px" src="https://cdn-uploads.huggingface.co/production/uploads/5f3fe13d79c1ba4c353d0c19/oPchsJsEpQoGcGXVbh7YS.png">
|
93 |
</div>
|
94 |
|
95 |
-
|
96 |
-
- **Reward Model**: [Nexusflow/Starling-RM-34B](https://huggingface.co/Nexusflow/Starling-RM-34B)
|
97 |
- **Policy Optimization**: [Fine-Tuning Language Models from Human Preferences (PPO)](https://arxiv.org/abs/1909.08593)
|
98 |
- **Ranking Dataset**: [berkeley-nest/Nectar](https://huggingface.co/datasets/berkeley-nest/Nectar)
|
99 |
- **Fine-tuning dataset**: Custom Medical Instruct dataset (We plan to release a sample training dataset in our upcoming paper; please stay updated)
|
@@ -107,7 +105,7 @@ This combination of cutting-edge techniques enables OpenBioLLM-8B to align with
|
|
107 |
- **Language(s) (NLP):** en
|
108 |
- **Developed By**: [Ankit Pal (Aaditya Ura)](https://aadityaura.github.io/) from Saama AI Labs
|
109 |
- **License:** Meta-Llama License
|
110 |
-
- **Fine-tuned from models:** [meta-llama/Meta-Llama-3-8B](meta-llama/Meta-Llama-3-8B)
|
111 |
- **Resources for more information:**
|
112 |
- Paper: Coming soon
|
113 |
|
|
|
92 |
<img width="1200px" src="https://cdn-uploads.huggingface.co/production/uploads/5f3fe13d79c1ba4c353d0c19/oPchsJsEpQoGcGXVbh7YS.png">
|
93 |
</div>
|
94 |
|
|
|
|
|
95 |
- **Policy Optimization**: [Fine-Tuning Language Models from Human Preferences (PPO)](https://arxiv.org/abs/1909.08593)
|
96 |
- **Ranking Dataset**: [berkeley-nest/Nectar](https://huggingface.co/datasets/berkeley-nest/Nectar)
|
97 |
- **Fine-tuning dataset**: Custom Medical Instruct dataset (We plan to release a sample training dataset in our upcoming paper; please stay updated)
|
|
|
105 |
- **Language(s) (NLP):** en
|
106 |
- **Developed By**: [Ankit Pal (Aaditya Ura)](https://aadityaura.github.io/) from Saama AI Labs
|
107 |
- **License:** Meta-Llama License
|
108 |
+
- **Fine-tuned from models:** [meta-llama/Meta-Llama-3-8B](meta-llama/Meta-Llama-3-8B)
|
109 |
- **Resources for more information:**
|
110 |
- Paper: Coming soon
|
111 |
|