Yanda's Random Notes

❯

❯

Hi Robot

Dec 20, 20251 min read

Arxivhttps://arxiv.org/abs/2502.19417original titleHi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models

Basically robot agent: break down a complex tasks to smaller, easier to execute tasks for robot.

The main contribution of our paper is a hierarchical interactive robot learning system (Hi Robot), a novel framework that uses VLMs for both high-level reasoning and low-level task execution.

hi_robot, page 2

It is claimed that with the fine tuning, the robot’s higher level model performs better than ChatGPT 4o with prompt engineering.

Graph View

Backlinks

Pi 0.5
Stable Diffusion 3

Created with Quartz v4.5.2 © 2026