Next-Gen Robotics Collection Collection for myself to compile everything I thing is or will be related to Robotics • 32 items • Updated 9 days ago
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated 2 days ago • 472k • 1.13k
ipa-intelligent-mobile-manipulators/grasp_RedtoYellow_cube_sim Viewer • Updated Feb 3 • 39.4k • 45 • 1
Next-Gen Robotics Collection Collection for myself to compile everything I thing is or will be related to Robotics • 32 items • Updated 9 days ago
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published Jan 28 • 108
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated 1 day ago • 67
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published Nov 15, 2024 • 114