Graniteai
#6
by
Lone7727
- opened
README.md
CHANGED
@@ -241,7 +241,6 @@ model-index:
|
|
241 |
veriefied: false
|
242 |
base_model:
|
243 |
- ibm-granite/granite-3.0-8b-base
|
244 |
-
new_version: ibm-granite/granite-3.1-8b-instruct
|
245 |
---
|
246 |
|
247 |
<!-- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/62cd5057674cdb524450093d/1hzxoPwqkBJXshKVVe6_9.png) -->
|
@@ -249,8 +248,6 @@ new_version: ibm-granite/granite-3.1-8b-instruct
|
|
249 |
|
250 |
# Granite-3.0-8B-Instruct
|
251 |
|
252 |
-
<!-- **Note: We are continuously improving our models and recommend users to checkout our latest [Granite 3.1](https://huggingface.co/collections/ibm-granite/granite-31-language-models-6751dbbf2f3389bec5c6f02d) models.** -->
|
253 |
-
|
254 |
**Model Summary:**
|
255 |
Granite-3.0-8B-Instruct is a 8B parameter model finetuned from *Granite-3.0-8B-Base* using a combination of open source instruction datasets with permissive license and internally collected synthetic datasets. This model is developed using a diverse set of techniques with a structured chat format, including supervised finetuning, model alignment using reinforcement learning, and model merging.
|
256 |
|
@@ -345,11 +342,6 @@ We train Granite 3.0 Language Models using IBM's super computing cluster, Blue V
|
|
345 |
**Ethical Considerations and Limitations:**
|
346 |
Granite 3.0 Instruct Models are primarily finetuned using instruction-response pairs mostly in English, but also multilingual data covering eleven languages. Although this model can handle multilingual dialog use cases, its performance might not be similar to English tasks. In such case, introducing a small number of examples (few-shot) can help the model in generating more accurate outputs. While this model has been aligned by keeping safety in consideration, the model may in some cases produce inaccurate, biased, or unsafe responses to user prompts. So we urge the community to use this model with proper safety testing and tuning tailored for their specific tasks.
|
347 |
|
348 |
-
**Resources**
|
349 |
-
- ⭐️ Learn about the latest updates with Granite: https://www.ibm.com/granite
|
350 |
-
- 📄 Get started with tutorials, best practices, and prompt engineering advice: https://www.ibm.com/granite/docs/
|
351 |
-
- 💡 Learn about the latest Granite learning resources: https://ibm.biz/granite-learning-resources
|
352 |
-
|
353 |
<!-- ## Citation
|
354 |
```
|
355 |
@misc{granite-models,
|
|
|
241 |
veriefied: false
|
242 |
base_model:
|
243 |
- ibm-granite/granite-3.0-8b-base
|
|
|
244 |
---
|
245 |
|
246 |
<!-- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/62cd5057674cdb524450093d/1hzxoPwqkBJXshKVVe6_9.png) -->
|
|
|
248 |
|
249 |
# Granite-3.0-8B-Instruct
|
250 |
|
|
|
|
|
251 |
**Model Summary:**
|
252 |
Granite-3.0-8B-Instruct is a 8B parameter model finetuned from *Granite-3.0-8B-Base* using a combination of open source instruction datasets with permissive license and internally collected synthetic datasets. This model is developed using a diverse set of techniques with a structured chat format, including supervised finetuning, model alignment using reinforcement learning, and model merging.
|
253 |
|
|
|
342 |
**Ethical Considerations and Limitations:**
|
343 |
Granite 3.0 Instruct Models are primarily finetuned using instruction-response pairs mostly in English, but also multilingual data covering eleven languages. Although this model can handle multilingual dialog use cases, its performance might not be similar to English tasks. In such case, introducing a small number of examples (few-shot) can help the model in generating more accurate outputs. While this model has been aligned by keeping safety in consideration, the model may in some cases produce inaccurate, biased, or unsafe responses to user prompts. So we urge the community to use this model with proper safety testing and tuning tailored for their specific tasks.
|
344 |
|
|
|
|
|
|
|
|
|
|
|
345 |
<!-- ## Citation
|
346 |
```
|
347 |
@misc{granite-models,
|