| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by raincole 1030 days ago
	But the model in OP is fine-tuned by "a proprietary dataset of ~80k high-quality programming problems and solutions". How do we know it's not contaminated by HumanEval too?

1 comments

From the OP:

> Furthermore, we applied OpenAI's decontamination methodology to our dataset to ensure valid results, and found no contaminated examples.