Only comparing on SOTA scores (ignoring price etc.) is like choosing your daily-...

LinXitoW · 2026-04-24T10:34:24 1777026864

The constant improvements of SOTA are the main thing keeping the investment machine running. We can't really remove training costs from inference costs, because a bunch of the funding and loans for the inference hardware only exists because the promises the continuous training (tries to) provides.

dnnddidiej · 2026-04-24T09:57:17 1777024637

Not really. SOTA vs non SOTA is "can I get my coding work actually done today" vs. "this can do customer support chat"

It is like car vs. kick scooter.

regularfry · 2026-04-24T11:02:11 1777028531

It really isn't. We get coding work actually done today on Opus 4.5. That's not SOTA any more, and anything proximate to that level, even quite loosely, is genuinely useful.

dnnddidiej · 2026-04-24T11:07:31 1777028851

OK we are in Opus 4.5 is not SOTA. Right by that definition .... yes you are right.

randomgermanguy · 2026-04-24T11:47:54 1777031274

I mean its almost halve a year, i think that counts ?

dnnddidiej · 2026-04-24T23:14:18 1777072458

Time wise you are correct.

randomgermanguy · 2026-04-24T11:54:20 1777031660

> "can I get my coding work actually done today" vs. "this can do customer support chat"

I think you need to define "can get coding work done" for this to make sense. Ive been using GPT-3 back-then for basic scripts, does that count ? Or only Claude-Code ?

I also think this is a false dichotomy, if you look at the Project Vend project or Vending-Bench, customer support etc. is at no means trivial. (Old but great story https://www.businessinsider.com/car-dealership-chevrolet-cha...)

UlisesAC4 · 2026-04-24T17:32:42 1777051962

This, I have been doing my side hustle code with open code an 3.2 reasoner and it is way better than what I have at day job with copilot and whatever models are there.

wahnfrieden · 2026-04-25T04:50:43 1777092643

Copilot is a bad harness that perverts the productivity of models like GPT 5.5.

dnnddidiej · 2026-04-24T23:15:08 1777072508

Tell me more please!

zrn900 · 2026-04-26T23:46:31 1777247191

Not really. The current SOTAs are already at the point that they can do that. The following models will start to surpass the daily work level. It's a diminishing returns situation just like anything else in tech.