Hacker News new | ask | show | jobs
by floatingtorch 556 days ago
You can simply use specific training examples that teach the model what you please. Eg. a set of examples which lead ranking/retreival/filtering models. The models are already online training and weights likely updated every ~1 hour ( or even less).

It’d be easy to go from a set of “moderators” who find examples and use it to query related content and use it as negative training samples. Just a guess.