Y
Hacker News
new
|
ask
|
show
|
jobs
by
sota_pop
364 days ago
Yes, that does sound very similar. To my knowledge, isn’t that (effectively) how the latest DeepSeek breakthroughs were made? (i.e. by leveraging chatgpt outputs to provide feedback for training the likes of R1)