Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

why don't they publish at ARC-AGI ? too expensive?


Arc agi was never a good benchmark that tested spatial understanding more than reasoning. I'm glad it's no longer popular


What do you mean? It definitely tests reasoning as well, and if anything, I expect spatial and embodied reasoning to become more important in the coming years, as AI agents will be expected to take on more real world tasks.


spatial or not, arc-agi is the only test that correlates to my impression with my coding requests




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: