Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This seems like another "better vibes" release. With the number of benchmarks exploding, random luck means you can almost always find a couple showing what you want to show. I didn't see much concrete evidence this was noticeably better than 5.1 (or even 5.0).

Being a point release though I guess that's fair. I suspect there is also some decent optimizations on the backend that make it cheaper and faster for OpenAI to run, and those are the real reasons they want us to use it.





>I suspect there is also some decent optimizations on the backend that make it cheaper and faster for OpenAI to run, and those are the real reasons they want us to use it.

I doubt it, given it is more expensive than the old model.


> I didn't see much concrete evidence this was noticeably better than 5.1

Did you test it?


No, I would like to but I don't see it in my paid ChatGPT plan or in the API yet. I based my comment solely off of what I read in the linked announcement.

At this point the benchmark soup is so dense that it's hard to tell signal from selective framing



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: