Hacker News new | ask | show | jobs
by nerdponx 929 days ago
I would imagine that daily "training" here involves something more like RLHF than just appending to a big prompt.