This&That: Language-Gesture Controlled Video Generation for Robot Planning Paper β’ 2407.05530 β’ Published Jul 8 β’ 3 β’ 1