Hacker News new | ask | show | jobs
by warkdarrior 751 days ago
I think his tweet can be read as "research in (1) scalable oversight, (2) weak-to-strong generalization, and (3) automated alignment".