Hacker News new | ask | show | jobs
Post-Training Layer Scaling Prevents Forgetting and Enhances Model Merging (arxiv.org)
1 points by veryluckyxyz 608 days ago