RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation

Model Description

This contains pre-trained checkpoints and finetuned checkpoints for our RoomTour3D-NaviLLM. Please follow the instructions and license here to use these models.


Citation

If you find our work useful for your research, please consider citing the paper

@article{han2024roomtour3d,
      title={RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation}, 
      author={Mingfei Han and Liang Ma and Kamila Zhumakhanova and Ekaterina Radionova and Jingyi Zhang and Xiaojun Chang and Xiaodan Liang and Ivan Laptev},
      journal={arXiv preprint arXiv:2412.08591},
      year={2024}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference API
Unable to determine this model's library. Check the docs .

Model tree for roomtour3d/roomtour3d-navillm-models

Finetuned
(1)
this model