RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation

Model Description

This contains pre-trained checkpoints and finetuned checkpoints for our RoomTour3D-NaviLLM. Please follow the instructions and license here to use these models.


Citation

If you find our work useful for your research, please consider citing the paper

@article{han2024roomtour3d,
      title={RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation}, 
      author={Mingfei Han and Liang Ma and Kamila Zhumakhanova and Ekaterina Radionova and Jingyi Zhang and Xiaojun Chang and Xiaodan Liang and Ivan Laptev},
      journal={arXiv preprint arXiv:2412.08591},
      year={2024}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for roomtour3d/roomtour3d-navillm-models

Finetuned
(1)
this model

Collection including roomtour3d/roomtour3d-navillm-models