|
|
|
|
|
by wrsh07
661 days ago
|
|
But to your point - that is how I feel about graph nns vs transformers or the fully connected set (GPUs are so good at transformers and fully connected nns, even if there is a structure that makes sense we don't have the hardware to have it make sense.... Unless grok makes it cheap??) |
|
https://arxiv.org/abs/2201.03545