|
|
|
|
|
by confuseshrink
2026 days ago
|
|
This is a very valid argument but it's hard to know what scaling a transformer will really do without trying (looking at you GPT-3). This is probably an issue for ML in general at this point. I think a more nuanced conversation around these topics will look at exactly what you bring up, how do we properly trade the potential knowledge benefit against the costs? It pains me that entirely valid avenues of research like this get covered up in nonsense and drama and their message seemingly lost in the midst of it. |
|