Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This looks like exact copy of this video of andrej karpathy ( https://youtu.be/kCc8FmEb1nY ) but in a writing format, am i wrong ?


The page describes its relationship to nanogpt.

...nanoGPT targets reproducing GPT-2 (124M params) and covers a lot of ground. This project strips it down to the essentials and scales it to a ~10M param model that trains on a laptop in under an hour...


Yes, you are.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: