|
|
|
|
|
by jvm
1306 days ago
|
|
Author here, yeah I agree the real world is very complicated. For any given task I'm imagining something like 1. Use some sort of instruction tuning to get the thing "good enough" that it gives decent results 75% of the time and the other 25% a human has to take over.
2. Use the actual usage data as training input. Punish bad behaviors and show the model what the human did to solve the problem.
3. Use this training loop to progressively have the model take over a larger % of the time. …and I think if you can't get (1) good enough to be worth using it's going to be really hard to get the loop going. |
|
How does the model know when a human has to take over?
I think most extrapolations of current "AI" capabilities into future capabilities are fun and useful in some ways, but also doomed to fail. It's very easy to miss a tiny detail which may in practice be a fundamental problem.
> Use the actual usage data as training input.
Given that those bigger state-of-the-art models train on terabytes of data, how would you know how much training data to generate to sufficiently change the output?
My understanding of "AI" is that it's mostly about some very complex models which are capable of solving previously unsolvable problems. However, those problems are always extremely specific. Going the other way of thinking of problems or future possibilities first and then applying "AI" to it is likely to fail.