Hacker News new | ask | show | jobs
by spdustin 860 days ago
It occurs to me that there must be a model that's been "aligned" opposite to the usual RLHF. Or has nobody done that?