Hacker News new | ask | show | jobs
by indus 1108 days ago
Wouldn’t this be the equivalent of ranking? I thought LLM are not supposed to get influenced by freshness.
1 comments

By the freshness of training with some data?

Well, aren't they? I believe any kind of reinforcement learning is supposed to be biased into the last training set.