RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation
Model Description
- Repository: RoomTour3D
- Paper: 2412.08591
- Point of Contact: mailto:Mingfei Han
This contains pre-trained checkpoints and finetuned checkpoints for our RoomTour3D-NaviLLM. Please follow the instructions and license here to use these models.
Citation
If you find our work useful for your research, please consider citing the paper
@article{han2024roomtour3d,
title={RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation},
author={Mingfei Han and Liang Ma and Kamila Zhumakhanova and Ekaterina Radionova and Jingyi Zhang and Xiaojun Chang and Xiaodan Liang and Ivan Laptev},
journal={arXiv preprint arXiv:2412.08591},
year={2024}
}
Model tree for roomtour3d/roomtour3d-navillm-models
Base model
lmsys/vicuna-7b-v1.1