Any thoughts on State Space Models?
Eg:
https://github.com/havenhq/mamba-chat
https://arxiv.org/abs/2311.18257