Hacker News new | ask | show | jobs
by ethan_smith 400 days ago
The breakthrough here is eliminating the need for human-labeled reasoning data while still achieving SOTA results, which has been a major bottleneck in developing reasoning capabilities.