Abstract: Research on 3D Vision-Language Models (3D-VLMs) is gaining increasing attention, which is crucial for developing embodied AI within 3D scenes, such as visual navigation and embodied question ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results