Hacker News new | ask | show | jobs
by yieldcrv 1154 days ago
maybe fine tuning should involve sending an LLM through grade school

actually I wonder if thats what we need to do

a simple socialization package that fine tunes

1 comments

also, alignment package with reward and punishment. “bad model, bad model! oh come here, my good model!”