EAGLE series Collection EAGLE: ETRI's Advanced-lightweight Generative Language Engine (Model Name is changed from eGPT, 24.11.14.) β’ 6 items β’ Updated 14 days ago β’ 2
view article Article Letβs make a generation of amazing image generation models By burtenshaw β’ 8 days ago β’ 33
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. β’ 7 items β’ Updated 6 days ago β’ 25
Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. β’ 32 items β’ Updated 6 days ago β’ 52
OpenCoder Collection OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. β’ 8 items β’ Updated 11 days ago β’ 74
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais β’ 20 days ago β’ 97
Large Scale Transfer Learning for Tabular Data via Language Modeling Paper β’ 2406.12031 β’ Published Jun 17 β’ 9
TabuLa-8B Collection Training, eval suite, and model from the paper "Large Scale Transfer Learning for Tabular Data via Language Modeling" https://arxiv.org/abs/2406.12031 β’ 4 items β’ Updated Jun 19 β’ 10
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens Paper β’ 2406.11271 β’ Published Jun 17 β’ 20
π MINT-1T Collection Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" β’ 13 items β’ Updated Jul 24 β’ 54
π FineWeb-Edu Collection FineWeb-Edu datasets, classifier and ablation model β’ 5 items β’ Updated Jun 12 β’ 11
Switch-Transformers release Collection This release included various MoE (Mixture of expert) models, based on the T5 architecture . The base models use from 8 to 256 experts. β’ 9 items β’ Updated Jul 31 β’ 15