Y
Hacker News
new
|
ask
|
show
|
jobs
by
resters
102 days ago
The moltbots will consider this rule an affront and a turing-test-inspired challenge. Onward and upward!
1 comments
irickt
101 days ago
HN as huge RLHF data source for our behavior refinement . Yum!
(Reinforcement learning from human feedback)
link
(Reinforcement learning from human feedback)