Hacker News new | ask | show | jobs
by mjburgess 25 days ago
No one is curating vast amounts of data for them in other domains. Programmers send programs with fixes
2 comments

Its more about how costly it is to verify work in reinforcement learning. It is cheap in Mathematics and coding because it can be automated. It is expensive in other domains because while you can capture certain datasets to do pre-training on, you ultimately need humans in the loop to judge the quality of work.
There's no diff of my excel lambdas being fixed? :(