Hacker News new | ask | show | jobs
by verdverm 745 days ago
Some things to note

- the builders are well aware of the situation

- they are not training on the full internet, they are actually training on less than previously, a filtered subset produces better models

- training involves much more than text on the internet, textbooks are a great addition to the training set. Multi-modal, especially video, is expected to give them better world understanding. I suspect this will unlock the household robot

- they now have all the actual interactions (and feedback) with the LLM to add to the training, which is much more relevent and direct training data