|
|
|
|
|
by quickthrower2
974 days ago
|
|
Also: costs more for inference, uses more energy, less practical for running locally, fewer use cases as a result. Especially for an open model. Being on Github / HuggingFace but needing to be on a AWS or Nvidia wait list to get the resources to run it is not great. In an unlimited energy and chip world I would agree just make em bigger. I guess going bigger has a greater chance of success in being SOTA than looking at architectures. So I get people don’t want to gamble. |
|