Hacker News new | ask | show | jobs
by mirker 1352 days ago
Many of these scaling patterns are logarithmic with respect to data size. You can only double the dataset size so many times that it’s really not clear the scaling will continue.
1 comments

Low data modes are also progressing quite fast. There is Dreamer and more recent papers based on RL in learned world models.