Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Benchies update:

https://artificialanalysis.ai/

Looks like it costs ~25% more than 5.2, with both on xhigh reasoning.

They only seem to have tested xhigh, which is a shame, since I think that reasoning level is in the point of diminishing returns for most tasks.

Also I was completely wrong earlier. Opus is significantly more expensive. I was looking at the wrong entry in the chart, the non-reasoning version of Opus. The fair comparison is Opus on max reasoning, which costs about twice the price of GPT-5.4 xhigh, to run the AA evals.

 help



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: