MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 8 items • Updated 16 days ago • 95
Korean Reward Modeling Collection Korean Datasets, Reward Models for RLHF • 16 items • Updated 4 days ago • 3
Octo-planner: On-device Language Model for Planner-Action Agents Paper • 2406.18082 • Published Jun 26 • 47
Model Merging and Safety Alignment: One Bad Model Spoils the Bunch Paper • 2406.14563 • Published Jun 20 • 29
Function Calling v3 Collection Models fine-tuned for function-calling • 14 items • Updated Apr 27 • 20
Miqu Models ('slerp' + 'model_stock') Collection A collection of creative writing models based on the 'miqu-1-70b ' model. • 9 items • Updated 29 days ago • 2
Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3 Paper • 2405.00664 • Published May 1 • 18
Handbook v0.1 models and datasets Collection Models and datasets for v0.1 of the alignment handbook • 6 items • Updated Nov 10, 2023 • 24