Y
Hacker News
new
|
ask
|
show
|
jobs
by
ethan_smith
400 days ago
The breakthrough here is eliminating the need for human-labeled reasoning data while still achieving SOTA results, which has been a major bottleneck in developing reasoning capabilities.