323 B
323 B
CSE5519 Advances in Computer Vision (Topic I: 2023 - 2024: Embodied Computer Vision and Robotics)
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control.Links to an external site.
Novelty in RT-2
VLA, vision-language-action models.