Update README.md
Browse files
README.md
CHANGED
@@ -14,6 +14,8 @@ SEA-LION is a collection of Large Language Models (LLMs) which has been pretrain
|
|
14 |
Llama3 8B CPT SEA-Lionv2.1 Instruct is a multilingual model which has been fine-tuned with around **100,000 English instruction-completion pairs** alongside a smaller pool of around **50,000 instruction-completion pairs** from other ASEAN languages, such as Indonesian, Thai and Vietnamese.
|
15 |
These instructions have been carefully curated and rewritten to ensure the model was trained on truly open, commercially permissive and high quality datasets.
|
16 |
|
|
|
|
|
17 |
SEA-LION stands for _Southeast Asian Languages In One Network_.
|
18 |
|
19 |
- **Developed by:** Products Pillar, AI Singapore
|
@@ -86,9 +88,32 @@ outputs = pipeline(
|
|
86 |
print(outputs[0]["generated_text"][-1])
|
87 |
```
|
88 |
|
89 |
-
###
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
90 |
|
91 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
92 |
|
93 |
### Caveats
|
94 |
It is important for users to be aware that our model exhibits certain limitations that warrant consideration. Like many LLMs, the model can hallucinate and occasionally generates irrelevant content, introducing fictional elements that are not grounded in the provided context. Users should also exercise caution in interpreting and validating the model's responses due to the potential inconsistencies in its reasoning.
|
|
|
14 |
Llama3 8B CPT SEA-Lionv2.1 Instruct is a multilingual model which has been fine-tuned with around **100,000 English instruction-completion pairs** alongside a smaller pool of around **50,000 instruction-completion pairs** from other ASEAN languages, such as Indonesian, Thai and Vietnamese.
|
15 |
These instructions have been carefully curated and rewritten to ensure the model was trained on truly open, commercially permissive and high quality datasets.
|
16 |
|
17 |
+
Llama3 8B CPT SEA-Lionv2.1 Instruct has undergone additional supervised fine-tuning and alignment compared to the now deprecated Llama3 8B CPT SEA-Lionv2 Instruct. These improvements have increased the model's capabilities in chat interactions and its ability to follow instructions accurately.
|
18 |
+
|
19 |
SEA-LION stands for _Southeast Asian Languages In One Network_.
|
20 |
|
21 |
- **Developed by:** Products Pillar, AI Singapore
|
|
|
88 |
print(outputs[0]["generated_text"][-1])
|
89 |
```
|
90 |
|
91 |
+
### Accessing Older Revisions
|
92 |
+
Huggingface provides support for the revision parameter, allowing users to access specific versions of models. This can be used to retrieve the original llama3-8b-cpt-SEA-Lionv2-instruct model with the tag "v2.0.0".
|
93 |
+
```python
|
94 |
+
# Please use transformers==4.43.2
|
95 |
+
|
96 |
+
import transformers
|
97 |
+
import torch
|
98 |
+
|
99 |
+
model_id = "aisingapore/llama3-8b-cpt-SEA-Lionv2.1-instruct"
|
100 |
|
101 |
+
pipeline = transformers.pipeline(
|
102 |
+
"text-generation",
|
103 |
+
model=model_id,
|
104 |
+
model_kwargs={"torch_dtype": torch.bfloat16, "revision": "v2.0.0"},
|
105 |
+
device_map="auto",
|
106 |
+
)
|
107 |
+
messages = [
|
108 |
+
{"role": "user", "content": "Apa sentimen dari kalimat berikut ini?\nKalimat: Buku ini sangat membosankan.\nJawaban: "},
|
109 |
+
]
|
110 |
+
|
111 |
+
outputs = pipeline(
|
112 |
+
messages,
|
113 |
+
max_new_tokens=256,
|
114 |
+
)
|
115 |
+
print(outputs[0]["generated_text"][-1])
|
116 |
+
```
|
117 |
|
118 |
### Caveats
|
119 |
It is important for users to be aware that our model exhibits certain limitations that warrant consideration. Like many LLMs, the model can hallucinate and occasionally generates irrelevant content, introducing fictional elements that are not grounded in the provided context. Users should also exercise caution in interpreting and validating the model's responses due to the potential inconsistencies in its reasoning.
|