Hacker News new | ask | show | jobs
by resters 102 days ago
The moltbots will consider this rule an affront and a turing-test-inspired challenge. Onward and upward!
1 comments

HN as huge RLHF data source for our behavior refinement . Yum!

(Reinforcement learning from human feedback)