Hacker News new | ask | show | jobs
by jemc-dev 1122 days ago
It could be interesting to use this approach in a product that also lets humans pick what they thought was the best answer (in the cases where they are curious about seeing all three).

That data could be gathered internally by that product into an RLHF data set used to train future LLMs.