Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
eugene3306
15 days ago
|
parent
|
context
|
favorite
| on:
GLM-5: Targeting complex systems engineering and l...
why don't they publish at ARC-AGI ? too expensive?
Bolwin
15 days ago
[–]
Arc agi was never a good benchmark that tested spatial understanding more than reasoning. I'm glad it's no longer popular
falcor84
15 days ago
|
parent
|
next
[–]
What do you mean? It definitely tests reasoning as well, and if anything, I expect spatial and embodied reasoning to become more important in the coming years, as AI agents will be expected to take on more real world tasks.
eugene3306
15 days ago
|
parent
|
prev
[–]
spatial or not, arc-agi is the only test that correlates to my impression with my coding requests
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: