Hacker News new | ask | show | jobs
by swyx 546 days ago
> However, it was severely undertrained

by modern standards. at the time, it was trained according to neural scaling laws oai believed to hold.

1 comments

Sure, at the time everyone misunderstood Chinchilla. Nonetheless it was severely undertrained, even if they didn't know it back then.