| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by binarymax 446 days ago
	And as we can see from DeepSeek this clause means nothing, outside the realm of OpenAI blocking your access to its models.

1 comments

wizzwizz4 446 days ago

I know someone from OpenAI claimed this, but is there any evidence that DeepSeek actually trained their models on output of the models OpenAI have?

link

binarymax 445 days ago

They talk about some examples in their research.

> “Specifically, we initialized the DeepSeek-Prover using the DeepSeekMath-Base 7B model (Shao et al., 2024). Initially, the model struggled to convert informal math problems into formal statements. To address this, we fine-tuned the DeepSeek-Prover model using the MMA dataset (Jiang et al., 2023), which comprises formal statements from Lean 4’s mathlib2 that were back-translated into natural language problem descriptions by GPT-4. We then instructed the model to translate these natural language problems into formal statements in Lean 4 using a structured approach.”

Section 3.1 in https://arxiv.org/html/2405.14333v1

link

wizzwizz4 445 days ago

I was thinking of their general-purpose models, like DeepSeek-R1 and DeepSeek-V3, for which I haven't found evidence that OpenAI models were used to generate synthetic training data. But I didn't find this, so clearly my searching skills aren't great.

link