Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The agents can definitely detect when something is off, given they're using VLMs. They don't necessarily compare it to previous versions, rather they have opinionated takes on whether something looks broken / off. So - yes!


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: