Hacker News new | ask | show | jobs
by pbgcp2026 56 days ago
It seems to me it's only Grok 4.20 that does this currently? Which other models did you have in mind, if I may ask?
1 comments

Gemma4, qwen3.6, deepseek v4, mimo, glm 5/5.1 all do MTP.
Thank you, I just realised we are talking about MTP. It seems that it's not that clear though. "Currently, the MTP capabilities are primarily accessible through Google's proprietary LiteRT framework, rather than the open-weights versions... Despite the missing MTP heads in the open release, Gemma 4 (specifically the 26B-A4B variant) still demonstrates high efficiency"