| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by yieldcrv 1154 days ago

maybe fine tuning should involve sending an LLM through grade school

actually I wonder if thats what we need to do

a simple socialization package that fine tunes

1 comments

also, alignment package with reward and punishment. “bad model, bad model! oh come here, my good model!”