https://browse.arxiv.org/pdf/2001.08361v1.pdf
Largeness is a valid goal.
Being on Github / HuggingFace but needing to be on a AWS or Nvidia wait list to get the resources to run it is not great.
In an unlimited energy and chip world I would agree just make em bigger.
I guess going bigger has a greater chance of success in being SOTA than looking at architectures. So I get people don’t want to gamble.
Being on Github / HuggingFace but needing to be on a AWS or Nvidia wait list to get the resources to run it is not great.
In an unlimited energy and chip world I would agree just make em bigger.
I guess going bigger has a greater chance of success in being SOTA than looking at architectures. So I get people don’t want to gamble.