|
|
|
|
|
by dartos
1022 days ago
|
|
I’m sure there’s an argument to be made that all architectural improvements to basic feed forward neural nets are essentially optimizations for the amount of compute provided anyway. If we had unlimited compute and time for training, I don’t think we would’ve really moved on from dense feed forward nets. |
|