Hacker News new | ask | show | jobs
by brucethemoose2 1154 days ago
I have seen this same phenomenon mentioned on huggingface: a finetuned large model being worse than its smaller variant.