Hacker News new | ask | show | jobs
by solomatov 432 days ago
Does anyone know if there were any attempts to test Mamba on really large scale? To me this model looks as the most promising successor to the transformer architecture. Does anyone know why it might not be the case or what are other alternatives?
1 comments

Tencent's 'Hunyuan-T1'–The First Mamba-Powered Ultra-Large Model: https://news.ycombinator.com/item?id=43447254