Hacker News new | ask | show | jobs
by refulgentis 400 days ago
Simplifying it down to "adjusting any weights is training, ipso facto this is meaningful" obscures more light than it sheds (as they noted, RL doesn't get you very far, at all)