Hacker News new | ask | show | jobs
by sitkack 731 days ago
It isn't tho, if you look at the bulk of tokens needed to train gen1 over LLMs and what is possible with better data and smaller models.

The fact that LLMs trained on dumptrucks full of data cannot achieve what a middle schooler begrudgingly achieves using existence and snide remarks.