Upcycling Large Language Models into Mixture of Experts Paper • 2410.07524 • Published Oct 10, 2024 • 4
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 27 days ago • 637
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published Jun 25, 2024 • 87
A Technical Report for Polyglot-Ko: Open-Source Large-Scale Korean Language Models Paper • 2306.02254 • Published Jun 4, 2023 • 11
A Technical Report for Polyglot-Ko: Open-Source Large-Scale Korean Language Models Paper • 2306.02254 • Published Jun 4, 2023 • 11