|
|
|
|
|
by sandGorgon
1185 days ago
|
|
but isnt that a direct contradiction of this particular paper anyways - that chatgpt anyways outperforms human annotation. so permit me to act as devil's advocate to your statement - prove that (in context of this paper), your hypothesis is still correct. |
|
To further improve ChatGPT shortcomings (assuming such flaws are because of alignment and not lack of capability of the base model) you need Human labels. Feeding it's own outputs would achieve nothing.
However feeding it's outputs can make a non aligned model become aligned (that's what alpaca did with llama+chatgpt).