|
|
|
|
|
by benreesman
651 days ago
|
|
People sophisticated in a field can ask Sonnet or 4o questions that amount to a different way of searching and sometimes even a better one. If you ask a question in a direct, probing, narrow way you can sometimes come out ahead. Someone educated by the News Feed algorithm (which is what RLHF amounts to: reward for getting human to click) is going to be the worst kind of wrong: /r/ConfidentlyIncorrect. |
|
PS was there ever a blog at b7r6.net?