|
|
|
|
|
by yorwba
1179 days ago
|
|
Backtracking to edit the response is theoretically easily solved by training on a masked language modeling objective instead of an autoregressive one, but using it to actually generate text is a bit expensive because you can't just generate one token at a time and be done, you might have to reevaluate each output token every time another token is changed. So I expect autoregressive generation to remain the default until the recomputation effort can be significantly reduced or hardware advances make the cost bearable. |
|
I can't imagine how training on masked tokens can "easily" solve backtracking, even in theory. Do you have some literature I could read on this?