Instruction Pre-Training: Language Models are Supervised Multitask Learners Paper • 2406.14491 • Published 21 days ago • 76
HARE: HumAn pRiors, a key to small language model Efficiency Paper • 2406.11410 • Published 24 days ago • 38