24 Becoming self-instruct: introducing early stopping criteria for minimal instruct tuning · 7 authors 3
5 Decomposing the Generalization Gap in Imitation Learning for Visual Robotic Manipulation · 4 authors
5 One Step of Gradient Descent is Provably the Optimal In-Context Learner with One Layer of Linear Self-Attention · 3 authors