|
|
|
|
|
by marmaduke
854 days ago
|
|
The math is not designed to intimidate but rather approach the "how to build sequence model" in a principled way from state space models, which draws from an arguably longer literature than neural networks. Some of concepts are better explained here than anywhere else, and make it straightforward to make sense of Mamba, which is increasingly popular. |
|