Hacker News new | ask | show | jobs
by leobg 302 days ago
Less lobotomized and boxed in by RLHF rules. That’s why a 7b base model will “outprose” an 80b instruct model.