| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by jondwillis 356 days ago
	I don’t think so. The training data, or some other filter applied to the output tokens, is resulting in each model indicating that it is the best. The self-preference is almost certainly coming from post-processing, or more likely because the model name is inserted into the system prompt.