Hacker News new | ask | show | jobs
by alecco 1180 days ago
Interesting blog but the extrapolations are way overblown. I tried one of the 30bn models and it's not even remotely close to GPT-3.

Don't get me wrong, this is very interesting and I hope more is done in the open models. But let's not over-hype by 10x.