Malaysian Qwen2.5-0.5B Instruct
Continue finetuning https://huggingface.co/Qwen/Qwen2.5-0.5B on highly curated 1.5B tokens Malaysian instruction dataset.
Improvement
- Support respond in Manglish, Mandarin, Tamil, Jawi, Johor, Kedah, Kelantan, Pahang, Perak, Sabah, Sarawak, Selangor, Negeri Sembilan and Terengganu.
- Able to code in Manglish, Mandarin, Tamil, Jawi, Johor, Kedah, Kelantan, Pahang, Perak, Sabah, Sarawak, Selangor, Negeri Sembilan and Terengganu.
- Multi-turn Malaysian context such as related to Malaysian Legislation, politics, religions and languages.
- Malaysian role-playing.
- Standard RAG.
WanDB at https://wandb.ai/huseinzol05/lora-embedding-256-Qwen2.5-0.5B-multipack
- Downloads last month
- 4