Hacker News new | ask | show | jobs
by ruuda 2173 days ago
A slightly more recent post, that really opened my eyes to this insight (and references The Bitter Lesson) is this piece by Gwern on the scaling hypothesis: https://www.gwern.net/newsletter/2020/05#gpt-3