Hacker News new | ask | show | jobs
by ShamelessC 1299 days ago
Is that still the case when all models have a common ancestor (i.e. finetuned) and haven’t yet overfit on new data?