|
--- |
|
license: other |
|
license_name: seallms |
|
license_link: https://huggingface.co/SeaLLMs/SeaLLM-13B-Chat/blob/main/LICENSE |
|
language: |
|
- en |
|
- zh |
|
- vi |
|
- id |
|
- th |
|
- ms |
|
- km |
|
- lo |
|
- my |
|
- tl |
|
tags: |
|
- multilingual |
|
- sea |
|
--- |
|
|
|
# *SeaLLM-7B-v2.5* - Large Language Models for Southeast Asia |
|
|
|
<span style="color: #ff3860"><b>LM-studio/llama.cpp users must set --repeat-penalty to 1 instead of default 1.1</b></span> |
|
|
|
<p align="center"> |
|
<a href="https://damo-nlp-sg.github.io/SeaLLMs/" target="_blank" rel="noopener">Technical Blog</a> |
|
|
|
<a href="https://huggingface.co/SeaLLMs/SeaLLM-7B-v2.5" target="_blank" rel="noopener"> ๐ค Tech Memo</a> |
|
|
|
<a href="https://huggingface.co/spaces/SeaLLMs/SeaLLM-7B" target="_blank" rel="noopener"> ๐ค DEMO</a> |
|
|
|
<a href="https://github.com/DAMO-NLP-SG/SeaLLMs" target="_blank" rel="noopener">Github</a> |
|
|
|
<a href="https://arxiv.org/pdf/2312.00738.pdf" target="_blank" rel="noopener">Technical Report</a> |
|
</p> |
|
|
|
|
|
- [seallm-7b-v2.5-chatml.Q4_K_M.gguf](https://huggingface.co/SeaLLMs/SeaLLM-7B-v2.5-GGUF/blob/main/seallm-7b-v2.5-chatml.Q4_K_M.gguf) use **ChatML** format by changing `<eos>` to `<|im_end|>` |
|
- [seallm-7b-v2.5.Q4_K_M.gguf](https://huggingface.co/SeaLLMs/SeaLLM-7B-v2.5-GGUF/blob/main/seallm-7b-v2.5.Q4_K_M.gguf) use SeaLLM-7B-v2.5 format, must download [seallm-v2.5.preset.json](https://huggingface.co/SeaLLMs/SeaLLM-7B-v2.5-GGUF/blob/main/seallm-v2.5.preset.json) for LM-studio. |
|
|
|
|
|
We introduce [SeaLLM-7B-v2.5](https://huggingface.co/SeaLLMs/SeaLLM-7B-v2.5), the state-of-the-art multilingual LLM for Southeast Asian (SEA) languagesย ๐ฌ๐ง ๐จ๐ณ ๐ป๐ณ ๐ฎ๐ฉ ๐น๐ญ ๐ฒ๐พ ๐ฐ๐ญ ๐ฑ๐ฆ ๐ฒ๐ฒ ๐ต๐ญ. It is the most significant upgrade since [SeaLLM-13B](https://huggingface.co/SeaLLMs/SeaLLM-13B-Chat), with half the size, outperforming performance across diverse multilingual tasks, from world knowledge, math reasoning, instruction following, etc. |
|
|
|
Checkout [SeaLLM-7B-v2.5 page](https://huggingface.co/SeaLLMs/SeaLLM-7B-v2.5) for more details. |
|
|
|
|
|
## Citation |
|
|
|
If you find our project useful, we hope you would kindly star our repo and cite our work as follows: Corresponding Author: [l.bing@alibaba-inc.com](mailto:l.bing@alibaba-inc.com) |
|
|
|
**Author list and order will change!** |
|
|
|
* `*` and `^` are equal contributions. |
|
|
|
``` |
|
@article{damonlpsg2023seallm, |
|
author = {Xuan-Phi Nguyen*, Wenxuan Zhang*, Xin Li*, Mahani Aljunied*, Weiwen Xu, Hou Pong Chan, |
|
Zhiqiang Hu, Chenhui Shen^, Yew Ken Chia^, Xingxuan Li, Jianyu Wang, |
|
Qingyu Tan, Liying Cheng, Guanzheng Chen, Yue Deng, Sen Yang, |
|
Chaoqun Liu, Hang Zhang, Lidong Bing}, |
|
title = {SeaLLMs - Large Language Models for Southeast Asia}, |
|
year = 2023, |
|
Eprint = {arXiv:2312.00738}, |
|
} |
|
``` |
|
|
|
|