Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Like identifying names of skateboard tricks from the description? https://skatebench.t3.gg/


I don’t care how practical it may or may not be, this is my new favorite LLM benchmark


I couldn't find an about page or similar?


Here's the public sample https://github.com/T3-Content/skatebench/blob/main/bench/tes...

I don't think there's a good description anywhere. https://youtube.com/@t3dotgg talks about it from time to time.


o3-pro is better than 5.2 pro! And GPT 5 high is best. Really quite interesting.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: