| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by jemc-dev 1122 days ago
	It could be interesting to use this approach in a product that also lets humans pick what they thought was the best answer (in the cases where they are curious about seeing all three). That data could be gathered internally by that product into an RLHF data set used to train future LLMs.