|
|
|
|
|
by whimsicalism
478 days ago
|
|
Transformers are typically memory-bandwidth bound during decoding. This chip is going to have a much worse memory b/w than the nvidia chips. My guess is that these chips could be compute-bound though given how little compute capacity they have. |
|