Hacker News new | ask | show | jobs
by SwellJoe 46 days ago
They're not trained on a raw feed of the internet. They are given curated and synthetic data. The curation and synthesis of new data is done by existing LLMs.
1 comments

Even if you're given the perfect textbook to read, it still helps you to take notes. Notes serve multiple purposes -- they help add clarity where it is needed, and more importantly, they help integrate new info (the current batch) with prior info (previous batches).
OK, but, as far as I know, there isn't a technology to allow that, yet. LLMs don't work like human brains.
Huh. The tech is what you make it. With your limiting logic, you would've said the same thing for thinking models at inference time too. There is nothing logically, mathematically, or physically prohibiting using thinking at training time too.