Hacker News new | ask | show | jobs
by astrange 185 days ago
Continuous pretraining has issues because it starts forgetting the older stuff. There is some research into other approaches.