Again I just tap the sign. All of your benchmarks mean nothing to me until you i...

jstummbillig · 2025-12-12T15:46:45 1765554405

Since as per Anthropics own benchmarks Sonnet 4.5 is beaten by Opus 4.5 would it not suffice to infer the rest?

https://x.com/OpenAI/status/1999182104362668275

nextworddev · 2025-12-12T04:44:57 1765514697

Claude is pretty trash for anything besides coding

romanovcode · 2025-12-12T14:31:55 1765549915

Yeah, but that is the whole point of Claude. And that's why we are interested in the comparison.

wyre · 2025-12-12T06:00:31 1765519231

What are you basing that on? Between Sonnet and Opus I don't think I'm reaching for Gemini 3 at all.

timmg · 2025-12-12T09:02:44 1765530164

That hasn't been my experience at all. I always wondered if we just get used to how to prompt a given model and that it hard to transition to another.