Hacker News new | ask | show | jobs
by mistymountains 1191 days ago
That’s just a supervised fine tuning method to skew outputs favorably. I’m working with it on biologics modeling using laboratory feedback, actually. The underlying inference structure is not changed.