Hacker News new | ask | show | jobs
by phowon 2639 days ago
The success of Transformers aside, I'm not sure you should be relying on model titles for anything, lest we forget papers like "One Model To Learn Them All" [1].

[1] https://arxiv.org/abs/1706.05137