Datasets for Pretrained Thai LLM Collection List Datasets for pretrained Thai LLM by PyThaiNLP • 23 items • Updated Sep 12 • 9
Thai instruction dataset list Collection Thai instruction datasets that have high quality and doesn't are the translated dataset by Google translate (low quality) • 13 items • Updated Aug 6 • 2
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order Paper • 2404.00399 • Published Mar 30 • 41