|
|
|
|
|
by phkahler
623 days ago
|
|
>> These models are very small even by academic standards so any finding would not necessarily extend to current LLM scales. Emphasis on not necessarily. >> The main conclusion is that RNN class networks can be trained as efficiently as modern alternatives but the resulting performance is only competitive at small scale. Shouldn't the conclusion be "the resulting competitive performance has only been confirmed at small scale"? |
|