Hacker News new | ask | show | jobs
by anankaie 692 days ago
Moreover, you get surprisingly out-of-class (size-wise) performance if you fine-tune for your specific problem space. Even if you only train in a parameter-efficient way.