Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This isn’t talking about compaction. This refers to performance as the model is loaded with 500k to 1m tokens.


Ah, thanks, makes sense, I’ll read more about this




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: