Benchies update: https://artificialanalysis.ai/ Looks like it costs ~25% more th...

Benchies update:

Looks like it costs ~25% more than 5.2, with both on xhigh reasoning.

They only seem to have tested xhigh, which is a shame, since I think that reasoning level is in the point of diminishing returns for most tasks.

Also I was completely wrong earlier. Opus is significantly more expensive. I was looking at the wrong entry in the chart, the non-reasoning version of Opus. The fair comparison is Opus on max reasoning, which costs about twice the price of GPT-5.4 xhigh, to run the AA evals.