Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Why does ChatGPT slow down so much when the conversations get long, while Claude does compaction?

My best guess is -- ChatGPT is running something in your browser to try to determine the best things to send down to the model API –- when it should have been running quantized models on its own server.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: