Hacker News new | ask | show | jobs
by 343rwerfd 700 days ago
> No progress without experiments

The "experiment" is us.

The prompting interfaces face the entire human population, a sizable number is currently feeding the models with valuable/actionable experiments plus outcomes, re-feeding the prompts with intermediate results (failures), till the point it gets something, a valid outcome which in the training data will be tagged as a successful, granted RLHF from the real world, maybe accounting for dozens or even hundred of instances and variations of the same problem, ranging dozens of countries, cultures, age ranges, etc.

That data being live-captured most certainly is between the most valuable datasets to used to train (re-train), current and future models, hence granting data to the AIs that the author implies it can't get by itself nor it is accesible to the AIs.