Hacker News new | ask | show | jobs
by 2ndorderthought 38 days ago
You definitely need to watch it more than a model 100 times larger. But the fact that it runs one 1 GPU and does what it does is insane. Imagine what a 30b model looks like in 6 months or 1 year?