Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I wrote a practical guide on how to train nanoGPT from scratch on Azure a while ago. It's pretty hands-on and easy to follow:

https://16x.engineer/2023/12/29/nanoGPT-azure-T4-ubuntu-guid...



Did it really only cost $200?

What sort of things could you do with it? How do you train it on current events?


Yes. I checked the Azure usage after training.

Beyond learning how it all works and demo, there is not much practical usage. You can train it on current events if you feed that corpus during training instead of just OpenWebText. Shouldn't be hard.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: