Hacker News new | ask | show | jobs
GPT-2 124M checkpoint pre-trained on OpenWebText 27.5B tokens (github.com)
1 points by megadragon9 5 days ago
1 comments

Model built and trained using a hand-built deep learning library (numpy primitives)