| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by floatingtorch 556 days ago
	You can simply use specific training examples that teach the model what you please. Eg. a set of examples which lead ranking/retreival/filtering models. The models are already online training and weights likely updated every ~1 hour ( or even less). It’d be easy to go from a set of “moderators” who find examples and use it to query related content and use it as negative training samples. Just a guess.