Hacker News new | ask | show | jobs
by Bossie 99 days ago
What is being researched? Any objective?
1 comments

The objective is to train a small GPT language model to the lowest possible validation bits-per-byte (val_bpb) in 5-minute runs, using AI agents to autonomously iterate on the code. This builds on Karpathy's autoresearch: https://x.com/AustinBaggio/status/2031888719943192938?s=20