Hacker News new | ask | show | jobs
by kelipso 1142 days ago
Just RLHF that part out and make it an x maximizer.