Hacker News new | ask | show | jobs
by nullc 618 days ago
Hardly. It's been RLHFed into sounding like blogspam from foreign content farms, because they used the same people as raters. The non-finetuned models have a much better 'house style' across a wide range of prompting approaches.