| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by indus 1108 days ago
	Wouldn’t this be the equivalent of ranking? I thought LLM are not supposed to get influenced by freshness.

1 comments

By the freshness of training with some data?

Well, aren't they? I believe any kind of reinforcement learning is supposed to be biased into the last training set.