Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Das Balkenspiel ist schlechter Stil. (The bar game is bad style/lame)


Pong!


Errm, No :-) I meant bars as in benchmarks, often rather meaningless, because within the range of statistic noise.

For instance, something having 100.200 points in one config, in another 100.220, with the bars/scales distorted to make that difference seem much larger.

Gaming the bar-game, so to speak.


OpenAI recently played a bit too hard with their GPT-5 announcement. Two bars with the same height but wildly different values, things like that. Such a lack of subtlety that their claim it was accidental is actually almost believable.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: