Hacker News new | ask | show | jobs
by ironrabbit 1544 days ago
Is there any evidence that GPT-3 responses are edited/filtered before being returned to users? My understanding is that some GPT-3 responses are annotated post-hoc, and this data is used to fine-tune later versions of GPT-3 (InstructGPT). This article seems extremely misleading.
1 comments

it seems there is evidence that GPT-3 is being overtrained in response to well publicized bad inputs, without regard to the generalizability of the PR-driven spot-edits, which is what the article describes