Hacker News new | ask | show | jobs
by wegfawefgawefg 946 days ago
Given the RLHF post training, I do believe it was intentional. And I suspect there have been iterations on this to make it more "robust". I vaguely remember there being announcements and such.