Hacker News new | ask | show | jobs
RLHF vs. RLAIF for language model alignment (assemblyai.com)
2 points by SleekEagle 1033 days ago