Sky-T1-7B Collection A series of 7B models trained with different recipes and the corresponding training data. • 8 items • Updated Feb 14 • 6
Light-R1 Collection Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond • 7 items • Updated 23 days ago • 11
EXAONE 3.5: Series of Large Language Models for Real-world Use Cases Paper • 2412.04862 • Published Dec 6, 2024 • 50
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 654
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated 5 days ago • 220
GAVEL: Generating Games Via Evolution and Language Models Paper • 2407.09388 • Published Jul 12, 2024 • 17
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models Jun 24, 2024 • 191
OpenVLA: An Open-Source Vision-Language-Action Model Paper • 2406.09246 • Published Jun 13, 2024 • 38
AutoCoder: Enhancing Code Large Language Model with AIEV-Instruct Paper • 2405.14906 • Published May 23, 2024 • 27
Scaling Down to Scale Up: A Guide to Parameter-Efficient Fine-Tuning Paper • 2303.15647 • Published Mar 28, 2023 • 4