Hacker News new | ask | show | jobs
by janalsncm 1283 days ago
Transformers require a lot of data to converge. There’s a reason tree models are still king of kaggle even though transformers have been around for 5 years now.