Hacker News new | ask | show | jobs
by Sharlin 148 days ago
It could still be special-case RLHF trained, just not up to perfection.