Hacker News new | ask | show | jobs
by radarsat1 861 days ago
This was an excellent write up thanks. It'll help me understand the Mamba work a lot more.

I still find it really confusing how a linear model can perform so well.