Looking at the responses below it's interesting how binary they are. It's classi...

imiric · 2026-02-17T23:27:44 1771370864

You can always make it go back and forth with "Are you sure?".

The fact that these are still issues ~6 years into this tech is bewildering.

cyanydeez · 2026-02-17T23:58:34 1771372714

...is it though? Fundamentally, these are statistical models with harnesses that try to conform them to deterministic expectations via narrow goal massaging.

They're not improving on the underlying technology. Just iterating on the massaging and perhaps improved data accuracy, if at all. It's still a mishmash of code and cribbed scifi stories. So, of course it's going to hit loops because it's not fundamentally conscience.

imiric · 2026-02-18T04:11:58 1771387918

> Fundamentally, these are statistical models

> So, of course it's going to hit loops because it's not fundamentally conscience.

Wait, I was told that these are superintelligent agents with sophisticated reasoning skills, and that AGI is either here or right around the corner. Are you saying that's wrong?

Surely they can answer a simple question correctly. Just look at their ARC-AGI scores, and all the other benchmarks!

Arkhaine_kupo · 2026-02-18T09:14:11 1771406051

We made this unbeatable tests for AI then told some of the smartest engineering teams in the planet that they can present a solution in a black box without explaining if they cheated but if they win they get amazing headlines and to keep their jobs and funding.

Somehow thye beat the score in the same year, its crazy! No one could have seen this coming, and please do not test it at home to see if you get the same results, it gets embarrased outside of our office space

emp17344 · 2026-02-18T22:37:09 1771454229

The complete lack of skepticism in the AI space is sickening. Are all economic bubbles this annoying?

wrqvrwvq · 2026-02-18T02:20:53 1771381253

I think what's bewildering is the usual hypemongers promising (threatening) to replace entire categories of workers with this type of dogshit. As another commenter mentioned, most large employers are overstaffed by 2 to 3x so ai is mostly an excuse for investors not to get too worried about staffing cuts. The idea that Marc is blown away by this type of nonsense is indicative only of the types of people he surrounds himself with.

jaapz · 2026-02-18T09:17:36 1771406256

What's also bewildering is the complete opposite of the spectrum of calling something "dogshit" when it is quite obviously a very powerful tool. It won't replace workers. But it will make those workers more productive. You don't need to vibe-code to be able to do more work in the same amount of time with the help of an LLM coding agent.