Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

They are using a vision language model to generate the robot motions token by token. They are being bottlenecked by the inference of the VLM.


Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: