RLinf/RLinf-OpenVLAOFT-LIBERO-90-Base-Lora
Reinforcement Learning • 8B • Updated • 30
None defined yet.
LaWAM: Latent World Action Models for Efficient Dynamics-Aware Robot Policies
WoVR: World Models as Reliable Simulators for Post-Training VLA Policies with RL