Hacker News new | ask | show | jobs
by phamilton 47 days ago
Gemma4, qwen3.6, deepseek v4, mimo, glm 5/5.1 all do MTP.
1 comments

Thank you, I just realised we are talking about MTP. It seems that it's not that clear though. "Currently, the MTP capabilities are primarily accessible through Google's proprietary LiteRT framework, rather than the open-weights versions... Despite the missing MTP heads in the open release, Gemma 4 (specifically the 26B-A4B variant) still demonstrates high efficiency"