|
|
|
|
|
by SpaceManNabs
769 days ago
|
|
> For decoding/inference both are very close to Mamba as xLSTM is a recurrent architecture Can you explain this statement more if you have time? Are you saying the recurrent architecture of xLSTM enables fast inference on par with Mamba? Or the xLSTM architecture slows it down so that its inference is as slow as mamba? |
|