Hacker News new | ask | show | jobs
by ndr_ 41 days ago
Yes. OpenAI's GPT-OSS was training using Deliberative Alignment (which was found to be flawed in a competition on Kaggle, but still).

https://arxiv.org/abs/2412.16339